Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, EMM, ASJ-H |
2022-11-22 13:00 |
Online |
Online |
[Fellow Memorial Lecture]
Security and Privacy Preservation for Speech Signal
-- Approach from speech information hiding technology -- Masashi Unoki (JAIST) EA2022-60 EMM2022-60 |
Non-authentic but skillfully fabricated artificial replicas of authentic media in the real world are known as “media clo... [more] |
EA2022-60 EMM2022-60 pp.99-104 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
Initial analysis of oral reading skills obtained from large scale subjective evaluation Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2019-135 SIP2019-137 SP2019-84 |
Speech of professional newscasters easily suggest us his/her occupation, that is newscaster. So far, we have analyzed pr... [more] |
EA2019-135 SIP2019-137 SP2019-84 pp.195-200 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 10:35 |
Tokyo |
NHK Science & Technology Research Labs. |
[Invited Talk]
Progress and prospects of statistical speech synthesis Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-35 |
The basic problem of statistical speech synthesis is quite simple: we have a speech database for training, i.e., a set o... [more] |
SP2019-35 pp.11-12 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Analysis of prosodic differences between a newscaster and amateur speakers using partial-substituted synthetic speech Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2019-11 |
This paper analyzes prosodic differences between a professional newscaster and amateur speakers which affects listeners’... [more] |
SP2019-11 pp.13-18 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
F0 estimation using TV-CAR speech analysis based on Regularized LP Keiichi Funaki (Univ. of the Ryukyus) EA2018-152 SIP2018-158 SP2018-114 |
Linear Prediction (LP) analysis is speech analysis to estimate AR(Auto-Regressive) coefficients to represent the all-pol... [more] |
EA2018-152 SIP2018-158 SP2018-114 pp.311-316 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
Perceptual influence of spectral envelope and aperiodicity quantization for encoding high-quality speech Genta Miyashita, Masanori Morise (Univ. of Yamanashi) EA2017-145 SIP2017-154 SP2017-128 |
In this paper, we investigate the relationship between the degradation of sound quality and the parameter quantization i... [more] |
EA2017-145 SIP2017-154 SP2017-128 pp.241-244 |
KBSE |
2018-03-01 17:00 |
Okinawa |
|
Analysis of Specification in Japanese using Natural Language Processing and Review Supporting with Speech Synthesis Kozo Okano, Kazuma Takahashi, Yusuke Naka, Shinpei Ogata (Shinshu Univ.), Toshifusa Sekizawa (Nihon Univ.) KBSE2017-52 |
The requirement specification for software is often described
in a natural language and thus may include ambiguity and ... [more] |
KBSE2017-52 pp.79-84 |
SP, ASJ-H |
2018-01-20 13:00 |
Tokyo |
The University of Tokyo |
An extended log domain pulse model for VOCODERs Hideki Kawahara (Wakayama Univ.) SP2017-66 |
We propose a new procedure to design excitation source signals for the analysis-and-synthesis systems without preserving... [more] |
SP2017-66 pp.1-4 |
SP, ASJ-H |
2018-01-20 14:55 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
Influence of frame shift in speech parameters on sound quality by high-quality speech analysis/synthesis system Genta Miyashita, Masanori Morise (Yamanashi Univ.) SP2017-72 |
Sound quality deterioration occurs when analyzing and synthesizing high--quality speech by using a vocoder.
We conduct ... [more] |
SP2017-72 pp.35-38 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Realtime analysis and display of voice source periodicity Hideki Kawahara (Wakayama Univ.), Ken-Ichi Sakakibara (Health Sciences Univ. Hokkaido) SP2017-59 |
This article introduces a real-time procedure for extraction and display of deviations from pure periodicity in voice ex... [more] |
SP2017-59 pp.21-22 |
SP |
2017-08-30 11:00 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Improvement of intelligibility improvement method based on waveform processing to emphasize dynamic characteristic of speech Hiroki Kohara, Hideki Banno, Kensaku Asahi (Meijo Univ.) SP2017-26 |
This paper describes intelligibility improvement method for speech signal with high quality by subband waveform processi... [more] |
SP2017-26 pp.33-36 |
SP |
2017-01-21 15:35 |
Tokyo |
The University of Tokyo |
Conversational Speech Synthesis dealing with Sequence of Sentences Ishin Fukuoka, Kazuhiko Iwata, Tetsunori Kobayashi (Waseda Univ.) SP2016-74 |
We proposed a conversational speech synthesis system that takes account of dialogue structure-based features. Convention... [more] |
SP2016-74 pp.59-64 |
EA, EMM |
2016-11-18 14:05 |
Oita |
Compal Hall (Oita) |
Basic Study of the Sound Quality Improvement using the Signal Enhancement of the Sound from inside of the Body Masatoshi Tsukiashi, Yoichi Midorikawa, Masanori Akita (Oita Univ.) EA2016-64 EMM2016-70 |
We have studied the detection of sleepiness or the detection of the change of feelings using NAM microphones. NAM microp... [more] |
EA2016-64 EMM2016-70 pp.95-100 |
IN, MoNA, CNR (Joint) |
2016-11-17 15:00 |
Kagoshima |
Kirishima-kanko Hotel |
[Invited Talk]
The collaboration between Digital IT platform and Cognitive solution - IBM Bluemix + Watson Misaki Utou (IBM Japan) MoNA2016-25 |
Cognitive system and Cloud platform is IBM’s focused area with high priority in 2016. IBM defines that Cognitive sy... [more] |
MoNA2016-25 p.17 |
SP |
2016-01-14 15:10 |
Kanagawa |
Sunpian Kawasaki |
Objective evaluation of synthetic speech using association between dimensions within spectral features Yusuke Ijima, Taichi Asami (NTT), Hideyuki Mizuno (TUSS) SP2015-90 |
This paper proposes a novel objective evaluation technique for statistical parametric speech synthesis. A novel point of... [more] |
SP2015-90 pp.27-32 |
SP |
2016-01-14 15:35 |
Kanagawa |
Sunpian Kawasaki |
Pitch-synchronous band group delay vocoder for high quality speech synthesis Masatsune Tamura, Ryo Morinaka, Masahiro Morita (Toshiba) SP2015-91 |
This paper presents a speech analysis and synthesis method that can precisely synthesize speech waveforms for high quali... [more] |
SP2015-91 pp.33-38 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 11:15 |
Aichi |
Nagoya Inst of Tech. |
Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification Shinya Kura, Shinnosuke Takamichi (NAIST), Tomoki Toda (NAIST/Nagoya Univ.), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2015-73 |
There are several attempts at correcting durational patterns of non-native speech towards language learning. One of the ... [more] |
SP2015-73 pp.19-24 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:25 |
Aichi |
Nagoya Inst of Tech. |
Deep Auto-encoder based Low-dimensional Feature Extraction using FFT Spectral Envelopes in Statistical Parametric Speech Synthesis Shinji Takaki, Junichi Yamagishi (NII) SP2015-81 |
In the state-of-the-art statistical parametric speech synthesis system, a speech analysis module, e.g. STRAIGHT spectral... [more] |
SP2015-81 pp.99-104 |
SP |
2014-11-13 17:20 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
Emotional speech synthesis for long words by generalizing accent types Yuko Aoyama (hakase.com), Tsuyoshi Moriyama (Tokyo Polytechnic Univ.) SP2014-96 |
More desirable speech synthesis methods require fewer number of training samples in synthesizing various kinds and stren... [more] |
SP2014-96 pp.37-40 |