Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
PRMU, IBISML, IPSJ-CVIM [detail] |
2023-03-02 15:10 |
Hokkaido |
Future University Hakodate (Primary: On-site, Secondary: Online) |
[Invited Talk]
-- Yuma Koizumi (Google Research) PRMU2022-87 IBISML2022-94 |
Machine learning tasks that deal with acoustic signals can be broadly classified into "recognizing sounds" and "generati... [more] |
PRMU2022-87 IBISML2022-94 p.149 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multi-stream FC-HiFi-GAN:Fast Neural Vocoder Model Using Learnable Lightweight Upsampling Haruki Yamashita (Kobe Univ/NICT), Takuma Okamoto (NICT), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) EA2022-76 SIP2022-120 SP2022-40 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
EA2022-76 SIP2022-120 SP2022-40 pp.7-12 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 13:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
[Invited Talk]
Multiple sound spot synthesis meets multilingual neural speech synthesis
-- Implementation is really all we need -- Takuma Okamoto (NICT) EA2022-87 SIP2022-131 SP2022-51 |
A multilingual multiple sound spot synthesis system is implemented as a user interface for real-time speech translation ... [more] |
EA2022-87 SIP2022-131 SP2022-51 pp.73-76 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72 |
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] |
EA2022-108 SIP2022-152 SP2022-72 pp.197-202 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
[Poster Presentation]
Proposal of Speech Content Conversion and the Initial Trial: Conversion of Linguistic Information Depending on Situations Kohei Takita, Saizo Aoyagi, Tatsunori Hirai (Komazawa Univ.) SP2022-19 |
It is important to speak dialects, honorifics, and simple words for listeners and the environment in order to smooth com... [more] |
SP2022-19 pp.82-87 |
AI |
2022-02-28 10:00 |
Miyazaki |
Youth Hostel Sunflower MIYAZAKI (Primary: On-site, Secondary: Online) |
AI2021-12 |
(To be available after the conference date) [more] |
AI2021-12 pp.1-6 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Neural speech synthesis using local phrase dependency structure information Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura (NIST) SP2021-23 |
In order to synthesize Japanese speech with natural prosody, we introduce an end-to-end TTS with new prosodic symbol rep... [more] |
SP2021-23 pp.107-112 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
An investigation on training of WaveNet vocoder in end-to-end text-to-speech Kazuki Yasuhara, Tomoki Hayashi, Tomoki Toda (Nagoya Univ.) SP2019-14 |
In this paper, we investigate the training of WaveNet vocoder in end-to-end text-to-speech. Tacotron 2, which is an end-... [more] |
SP2019-14 pp.31-36 |
SP |
2019-01-27 09:00 |
Ishikawa |
Kanazawa-Harmonie |
[Tutorial Invited Lecture]
Software components towards end-to-end speech synthesis at NII
-- Tutorial for Tacotron and WaveNet -- Yusuke Yasuda, Xin Wang (NII) SP2018-56 |
This presentation describes recent advances of end-to-end speech synthesis. We introduce major approaches and our method... [more] |
SP2018-56 p.21 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-22 13:00 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Invited Talk]
Expressive Speech Synthesis: Approaches to Text-to-Speech with Diverse Voices and Styles Takao Kobayashi (Tokyo Tech.) SP2017-64 |
As the performance of smart devices and information systems becomes higher, more advanced speech interfaces are requeste... [more] |
SP2017-64 pp.85-86 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Improvement of accent sandhi rules based on accent dictionary for Japanese text-to-speech systems Hiroto Aoyama, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-54 |
In order to synthesize more natural speech in Japanese text-to-speech systems, we improved accent sandhi rules. Conventi... [more] |
SP2016-54 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-03 09:00 |
Aichi |
Nagoya Inst of Tech. |
Evaluation of text-to-speech system construction for unknown-pronunciation languages Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2015-80 |
This paper discusses a method to construction of text-to-speech (TTS) systems for unknown-pronunciation languages. There... [more] |
SP2015-80 pp.93-98 |
TL |
2015-10-04 13:20 |
Tokyo |
WASEDA University |
An N-gram-based approach for input text synthesis depending on text-to-speech system Wang Le, Kacper Radzikowski, Yoshie Osamu (Waseda Univ.) TL2015-34 |
Plenty of researches on text-to-speech(TTS) system have been made, in which prosodic information plays an important role... [more] |
TL2015-34 pp.1-5 |
WIT |
2015-03-13 10:50 |
Ibaraki |
Kasuga Campus, Tsukuba University of Technology |
Comparative Evaluation of the Movie with Audio Description narrated with Text-to-speech Kayo Omori, Rio Nakagawa (TWCU), Michiaki Yasumura (Keio Univ.), Takayuki Watanabe (TWCU) WIT2014-88 |
We made the audio description narrated with TTS and compared it with that with human voice. We found that (1) the guide ... [more] |
WIT2014-88 pp.17-22 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 14:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Invited Talk]
Statistical approach to flexible speech synthesis
-- towards human-like talking machines -- Keiichi Tokuda (NITech/Google) SP2014-109 |
This talk will give an overview of statistical approach to
flexible speech synthesis. For constructing human-like
tal... [more] |
SP2014-109 p.31 |
SP |
2014-11-13 16:55 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
Emphasized Accent Phrase Prediction from Advertisement Text towards Expressive Text-to-speech Synthesis Hideharu Nakajima, Hideyuki Mizuno, Sumitaka Sakauchi (NTT) SP2014-95 |
Realizing Expressive Text-to-speech synthesis needs developments of both text processing and the rendering of natural ex... [more] |
SP2014-95 pp.31-36 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
Text-to-speech prosody synthesis based on probabilistic model for F0 contour Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) SP2014-28 |
This paper deals with the problem of generating the fundamental frequency (F0) contour of speech from a text input for t... [more] |
SP2014-28 pp.309-314 |
SP, IPSJ-SLP |
2013-12-20 15:25 |
Tokyo |
|
[Fellow Memorial Lecture]
Toward Speech Synthesis with Diverse Voices and Styles: Approaches and Issues Takao Kobayashi (Tokyo Tech.) SP2013-93 |
Recently, hidden Markov model-based (HMM-based) speech synthesis has been widely studied in the text-to-speech (TTS) syn... [more] |
SP2013-93 pp.119-122 |
ET |
2012-09-29 11:25 |
Okayama |
|
A Study on the Utilization of the English Text-to-Speech Software in the Blended Instruction and its Effect Noritake Fujishiro (Okayama Higashi Commercial H.S.), Isao Miyaji (Okayama Univ. of Science) ET2012-30 |
In Foreign Language Activities in elementally school and English language education in junior high school, we verified t... [more] |
ET2012-30 pp.13-16 |