IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (Searched in: All Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 28  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] 2023-12-03
10:00
Tokyo Kikai-Shinko-Kaikan Bldg.
(Primary: On-site, Secondary: Online)
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism
Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) NLC2023-17 SP2023-37
 [more] NLC2023-17 SP2023-37
pp.19-24
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo
(Primary: On-site, Secondary: Online)
Fast Neural Waveform Generation Model With Fully Connected Upsampling
Haruki Yamashita (Kobe cniv/NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-15
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more]
SP2023-15
pp.73-78
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo
(Primary: On-site, Secondary: Online)
Evaluation of multi-speaker text-to-speech synthesis using a corpus for speech recognition with x-vectors for various speech styles
Koki Hida (Wakayama Univ/NICT), Takuma Okamoto (NICT), Ryuichi Nisimura (Wakayama Univ), Yamato Ohtani (NICT), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-25
We have implemented multi-speaker end-to-end text-to-speech synthesis based on JETS using x-vectors as speaker embedding... [more] SP2023-25
pp.125-130
PRMU, IBISML, IPSJ-CVIM [detail] 2023-03-02
15:10
Hokkaido Future University Hakodate
(Primary: On-site, Secondary: Online)
[Invited Talk] --
Yuma Koizumi (Google Research) PRMU2022-87 IBISML2022-94
Machine learning tasks that deal with acoustic signals can be broadly classified into "recognizing sounds" and "generati... [more] PRMU2022-87 IBISML2022-94
p.149
SP, IPSJ-SLP, EA, SIP [detail] 2023-02-28
09:30
Okinawa
(Primary: On-site, Secondary: Online)
MS-FC-HiFiGAN : Fast Neural Waveform Generation Model With Learnable Lightweight Upsampling
Haruki Yamashita (Kobe Univ/NICT), Takuma Okamoto (NICT), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) EA2022-76 SIP2022-120 SP2022-40
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more]
EA2022-76 SIP2022-120 SP2022-40
pp.7-12
SP, IPSJ-SLP, EA, SIP [detail] 2023-02-28
13:00
Okinawa
(Primary: On-site, Secondary: Online)
[Invited Talk] Multiple sound spot synthesis meets multilingual speech synthesis -- Implementation is really all we need --
Takuma Okamoto (NICT) EA2022-87 SIP2022-131 SP2022-51
A multilingual multiple sound spot synthesis system is implemented as a user interface for real-time speech translation ... [more] EA2022-87 SIP2022-131 SP2022-51
pp.73-76
SP, IPSJ-SLP, EA, SIP [detail] 2023-03-01
11:00
Okinawa
(Primary: On-site, Secondary: Online)
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech
Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] EA2022-108 SIP2022-152 SP2022-72
pp.197-202
SP, IPSJ-MUS, IPSJ-SLP [detail] 2022-06-18
10:50
Online Online [Invited Talk] Crazy vocoder is unbreakable -- But let's talk about an informal vision of the future --
Masanori Morise (Meiji Univ.) SP2022-15
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] SP2022-15
pp.61-66
SP, IPSJ-MUS, IPSJ-SLP [detail] 2022-06-18
13:00
Online Online [Poster Presentation] Proposal of Speech Content Conversion and the Initial Trial: Conversion of Linguistic Information Depending on Situations
Kohei Takita, Saizo Aoyagi, Tatsunori Hirai (Komazawa Univ.) SP2022-19
It is important to speak dialects, honorifics, and simple words for listeners and the environment in order to smooth com... [more] SP2022-19
pp.82-87
AI 2022-02-28
10:00
Miyazaki Youth Hostel Sunflower MIYAZAKI
(Primary: On-site, Secondary: Online)
AI2021-12 (To be available after the conference date) [more] AI2021-12
pp.1-6
SP, IPSJ-SLP, IPSJ-MUS 2021-06-19
15:00
Online Online Neural speech synthesis using local phrase dependency structure information
Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura (NIST) SP2021-23
In order to synthesize Japanese speech with natural prosody, we introduce an end-to-end TTS with new prosodic symbol rep... [more] SP2021-23
pp.107-112
SP 2019-08-28
14:40
Kyoto Kyoto Univ. [Poster Presentation] An investigation on training of WaveNet vocoder in end-to-end text-to-speech
Kazuki Yasuhara, Tomoki Hayashi, Tomoki Toda (Nagoya Univ.) SP2019-14
In this paper, we investigate the training of WaveNet vocoder in end-to-end text-to-speech. Tacotron 2, which is an end-... [more] SP2019-14
pp.31-36
SP 2019-01-27
09:00
Ishikawa Kanazawa-Harmonie [Tutorial Invited Lecture] Software components towards end-to-end speech synthesis at NII -- Tutorial for Tacotron and WaveNet --
Yusuke Yasuda, Xin Wang (NII) SP2018-56
This presentation describes recent advances of end-to-end speech synthesis. We introduce major approaches and our method... [more] SP2018-56
p.21
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2017-12-22
13:00
Tokyo Waseda Univ. Green Computing Systems Research Organization [Invited Talk] Expressive Speech Synthesis: Approaches to Text-to-Speech with Diverse Voices and Styles
Takao Kobayashi (Tokyo Tech.) SP2017-64
As the performance of smart devices and information systems becomes higher, more advanced speech interfaces are requeste... [more] SP2017-64
pp.85-86
SP, IPSJ-SLP, NLC, IPSJ-NL
(Joint) [detail]
2016-12-20
15:10
Tokyo NTT Musashino R&D [Poster Presentation] Improvement of accent sandhi rules based on accent dictionary for Japanese text-to-speech systems
Hiroto Aoyama, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-54
In order to synthesize more natural speech in Japanese text-to-speech systems, we improved accent sandhi rules. Conventi... [more] SP2016-54
pp.31-36
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2015-12-03
09:00
Aichi Nagoya Inst of Tech. Evaluation of text-to-speech system construction for unknown-pronunciation languages
Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2015-80
This paper discusses a method to construction of text-to-speech (TTS) systems for unknown-pronunciation languages. There... [more] SP2015-80
pp.93-98
TL 2015-10-04
13:20
Tokyo WASEDA University An N-gram-based approach for input text synthesis depending on text-to-speech system
Wang Le, Kacper Radzikowski, Yoshie Osamu (Waseda Univ.) TL2015-34
Plenty of researches on text-to-speech(TTS) system have been made, in which prosodic information plays an important role... [more] TL2015-34
pp.1-5
WIT 2015-03-13
10:50
Ibaraki Kasuga Campus, Tsukuba University of Technology Comparative Evaluation of the Movie with Audio Description narrated with Text-to-speech
Kayo Omori, Rio Nakagawa (TWCU), Michiaki Yasumura (Keio Univ.), Takayuki Watanabe (TWCU) WIT2014-88
We made the audio description narrated with TTS and compared it with that with human voice. We found that (1) the guide ... [more] WIT2014-88
pp.17-22
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD
(Joint) [detail]
2014-12-15
14:00
Kanagawa Tokyo Institute of Technology (Suzukakedai Campus) [Invited Talk] Statistical approach to flexible speech synthesis -- towards human-like talking machines --
Keiichi Tokuda (NITech/Google) SP2014-109
This talk will give an overview of statistical approach to
flexible speech synthesis. For constructing human-like
tal... [more]
SP2014-109
p.31
SP 2014-11-13
16:55
Fukuoka Kyushu Univ. Chikushi Campus Emphasized Accent Phrase Prediction from Advertisement Text towards Expressive Text-to-speech Synthesis
Hideharu Nakajima, Hideyuki Mizuno, Sumitaka Sakauchi (NTT) SP2014-95
Realizing Expressive Text-to-speech synthesis needs developments of both text processing and the rendering of natural ex... [more] SP2014-95
pp.31-36
 Results 1 - 20 of 28  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan