IEICE Technical Report

Online edition: ISSN 2432-6380

Volume 119, Number 321

Speech

Workshop Date : 2019-12-06 / Issue Date : 2019-11-29

[PREV] [NEXT]

[TOP] | [2016] | [2017] | [2018] | [2019] | [2020] | [2021] | [2022] | [Japanese] / [English]

[PROGRAM] [BULK PDF DOWNLOAD]


Table of contents

SP2019-34
Time-domain convolutional denoising autoencoder for multi-channel speech enhancement
Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa (Waseda Univ.)
pp. 1 - 6

SP2019-35
[Invited Talk] Progress and prospects of statistical speech synthesis
Keiichi Tokuda (Nagoya Inst. of Tech.)
pp. 11 - 12

SP2019-36
[Poster Presentation] Assessment of pronunciation proficiency for second language learners using phoneme posterior probabilities of language pairs
Rintaro Mori, Akinobu Lee (Nitech)
pp. 43 - 48

SP2019-37
[Poster Presentation] Effectiveness of sequence-to-sequence acoustic modeling by using automatic generated labels
Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano (NHK)
pp. 49 - 54

SP2019-38
[Poster Presentation] Synthetic speech-based sound masking for privacy protection when speaking to smartphones in public space
Takahiro Tsugui, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.)
pp. 55 - 60

SP2019-39
[Poster Presentation] Analysis and Subjective Labeling for Emotional Speech Database JTES
Mai Yamanaka, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.)
pp. 61 - 66

SP2019-40
[Poster Presentation] Estimation of three-dimensional tongue shape from midsagittal tongue contour using regression models
Tatsuya Kitamura (Konan Univ.), Hisanori Makinae (NRIPS), Masashi Ito (TIT)
pp. 67 - 72

SP2019-41
[Poster Presentation] Time-Varying Complex AR speech analysis based on l2-norm regularization
Keiichi Funaki (Univ. of the Ryukyus)
pp. 73 - 77

SP2019-42
A comparison of neural vocoders in singing voice synthesis
Sota Wada, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.)
pp. 85 - 90

SP2019-43
An evaluation of representation learning using phoneme posteriorgrams and data augmentation in speech emotion recognition
Shintaro Okada (Nagoya Univ.), Atsushi Ando (Nagoya Univ./NTT), Tomoki Toda (Nagoya Univ.)
pp. 91 - 96

Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan