Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online (Online) |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online (Online) |
[Poster Presentation]
Investigation of DNN-based speech synthesis utilizing oral reading skills obtained from large scale subjective evaluation Shun Akui (UTokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-71 SIP2020-102 SP2020-36 |
So far, we have been suggested the value of `oral reading skill' based on a listening evaluation experiment as a quantit... [more] |
EA2020-71 SIP2020-102 SP2020-36 pp.68-73 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 17:10 |
Online |
Online (Online) |
An investigation of rhythm-based speaker embeddings for phoneme duration modeling Kenichi Fujita, Atsushi Ando, Yusuke Ijima (NTT) EA2020-77 SIP2020-108 SP2020-42 |
In this study, we propose a speaker embedding method suitable for modeling phoneme duration length for each individual i... [more] |
EA2020-77 SIP2020-108 SP2020-42 pp.103-108 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
[Poster Presentation]
Initial analysis of oral reading skills obtained from large scale subjective evaluation Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2019-135 SIP2019-137 SP2019-84 |
Speech of professional newscasters easily suggest us his/her occupation, that is newscaster. So far, we have analyzed pr... [more] |
EA2019-135 SIP2019-137 SP2019-84 pp.195-200 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. (Kyoto) |
[Poster Presentation]
Analysis of prosodic differences between a newscaster and amateur speakers using partial-substituted synthetic speech Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2019-11 |
This paper analyzes prosodic differences between a professional newscaster and amateur speakers which affects listeners’... [more] |
SP2019-11 pp.13-18 |
WIT, SP |
2018-10-27 13:00 |
Fukuoka |
Kyushu Institute of Technology(Kitakyushu) (Fukuoka) |
An investigation of multi-speaker modeling for DNN-based speech synthesis incorporating generative adversarial networks Hiroki Kanagawa, Yusuke Ijima (NTT MD Lab.) SP2018-32 WIT2018-20 |
[more] |
SP2018-32 WIT2018-20 pp.1-6 |
SP, IPSJ-SLP (Joint) |
2018-07-26 14:30 |
Shizuoka |
Sago-Royal-Hotel (Hamamatsu) (Shizuoka) |
[Invited Talk]
Docomo AI Agent: Open Partner Initiative
-- Project SEBASTIEN -- Takanobu Oba, Takashi Yoshikawa (Docomo), Takaaki Fukutomi, Kiyoaki Matsui, Yusuke Ijima (NTT) SP2018-17 |
[more] |
SP2018-17 pp.7-8 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:25 |
Okinawa |
(Okinawa) |
Non-parallel and Many-to-Many Voice Conversion Using Variational Autoencoder Conditioned by Phonetic Posteriorgrams and d-vectors Yuki Saito (NTT/Univ. of Tokyo), Yusuke Ijima, Kyosuke Nishida (NTT), Shinnosuke Takamichi (Univ. of Tokyo) EA2017-105 SIP2017-114 SP2017-88 |
This paper proposes novel frameworks for non-parallel and many-to-many voice conversion (VC) using variational autoencod... [more] |
EA2017-105 SIP2017-114 SP2017-88 pp.21-26 |
PRMU, SP |
2017-06-22 15:15 |
Miyagi |
(Miyagi) |
Comparisons on Transplant Emotional Expressions in DNN-based TTS Synthesis Katsuki Inoue, Sunao Hara, Masanobu Abe (Okayama Univ.), Nobukatsu Hojo, Yusuke Ijima (NTT) PRMU2017-29 SP2017-5 |
Recent studies have shown that DNN-based speech synthesis can generate more natural synthesized speech than the conventi... [more] |
PRMU2017-29 SP2017-5 pp.23-28 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center (Okinawa) |
[Poster Presentation]
An investigation of speaker adaptation method for DNN-based speech synthesis using speaker codes Nobukatsu Hojo, Yusuke Ijima (NTT) EA2016-108 SIP2016-163 SP2016-103 |
In this work, we conducted objective evaluation experiments on the conventional speaker adaptation methods for DNN-based... [more] |
EA2016-108 SIP2016-163 SP2016-103 pp.147-152 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center (Okinawa) |
[Poster Presentation]
Prosodic Word Embeddings for DNN-based speech synthesis Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami (NTT) EA2016-109 SIP2016-164 SP2016-104 |
This paper proposed a novel word embeddings with prosodic information (prosodic word embeddings) for DNN-based speech sy... [more] |
EA2016-109 SIP2016-164 SP2016-104 pp.153-158 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 16:40 |
Tokyo |
NTT Musashino R&D (Tokyo) |
Generative Adversarial Network-based Postfiltering for Statistical Parametric Speech Synthesis Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino (NTT) SP2016-61 |
In the field of speech synthesis, statistical parametric speech synthesis has been widely used due to the flexibility an... [more] |
SP2016-61 pp.89-94 |
SP, IPSJ-SLP (Joint) |
2016-07-28 15:45 |
Yamagata |
Takinoyu Hotel (Yamagata) |
On the Use of Speaker Codes for Multi-Speaker Modeling in DNN-based Speech Synthesis Nobukatsu Hojo, Yusuke Ijima (NTT), Hideyuki Mizuno (Tokyo University of Science, Suwa) SP2016-22 |
Recent studies have shown that DNN-based speech synthesis can generate more natural synthesized speech than the conventi... [more] |
SP2016-22 pp.13-18 |
SP |
2016-01-14 15:10 |
Kanagawa |
Sunpian Kawasaki (Kanagawa) |
Objective evaluation of synthetic speech using association between dimensions within spectral features Yusuke Ijima, Taichi Asami (NTT), Hideyuki Mizuno (TUSS) SP2015-90 |
This paper proposes a novel objective evaluation technique for statistical parametric speech synthesis. A novel point of... [more] |
SP2015-90 pp.27-32 |