Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL |
2023-03-18 16:40 |
Okinawa |
OIST (Primary: On-site, Secondary: Online) |
Collection of Textual Expressions in the Wild Toward Voice-quality Control from Free Description Aya Watanabe, Shinnosuke Takamichi, Yuki Saito, Hiroshi Saruwatari (UTokyo) NLC2022-29 |
[more] |
NLC2022-29 pp.55-60 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 16:15 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images Hien Ohnaka (NITTC), Shinnosuke Takamichi (UT), Keisuke Imoto (DU), Yuki Okamoto (Rits), Kazuki Fujii, Hiroshi Saruwatari (UT) EA2022-90 SIP2022-134 SP2022-54 |
(To be available after the conference date) [more] |
EA2022-90 SIP2022-134 SP2022-54 pp.83-88 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72 |
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] |
EA2022-108 SIP2022-152 SP2022-72 pp.197-202 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 14:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Corpus construction toward multi-domain empathetic dialogue speech synthesis Yuki Saito, Eiji Iimori, Shinnosuke Takamichi (UT), Kentaro Tachibana (LINE), Hiroshi Saruwatari (UT) |
(To be available after the conference date) [more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 12:00 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluating the robustness of signal processing-based pseudonymization using parameter optimization against inversion attack. Hiroto Kai (Tokyo Metro. Univ.), Shinnosuke Takamichi (The Univ. of Tokyo), Sayaka Shiota, Hitoshi Kiya (Tokyo Metro. Univ.) EA2021-82 SIP2021-109 SP2021-67 |
[more] |
EA2021-82 SIP2021-109 SP2021-67 pp.114-119 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
End-to-end incremental TTS with lookahead generation with large pretrained language model Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2020-74 SIP2020-105 SP2020-39 |
(To be available after the conference date) [more] |
EA2020-74 SIP2020-105 SP2020-39 pp.85-90 |
SP, IPSJ-MUS, IPSJ-SLP |
2020-06-07 15:45 |
Online |
Online |
HumanGAN: generative adversarial network with human-based discriminator and its naturalness evaluation in synthesized voice Kazuki Fujii (NITTC), Yuki Saito, Shinnosuke Takamichi (UTokyo), Yukino Baba (UTsukuba), Hiroshi Saruwatari (UTokyo) SP2020-6 |
[more] |
SP2020-6 pp.15-20 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP |
2019-06-13 15:25 |
Kanagawa |
Tokyo Institute of Technology |
[Invited Talk]
Constructing voice corpus for next-generation speech research Shinnosuke Takamichi (UTokyo) SP2019-5 |
Thanks to developments of machine learning techniques including deep learning, solving more diverse issues is required i... [more] |
SP2019-5 p.25 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-21 13:30 |
Ishikawa |
Hotel Koshuen |
Evaluation of DNN-based Low-Musical-Noise Speech Enhancement Using Kurtosis Matching Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2018-66 EMM2018-66 |
This paper proposes DNN-based speech enhancement with low musical noise by kurtosis matching. Musical noise, artifacts g... [more] |
EA2018-66 EMM2018-66 pp.19-24 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:00 |
Okinawa |
|
Experimental Evaluation of Multichannel Audio Source Separation Based on IDLMA Daichi Kitamura, Hayato Sumino, Norihiro Takamune, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo), Nobutaka Ono (Tokyo Metropolitan Univ.) EA2017-104 SIP2017-113 SP2017-87 |
In this paper, we propose a new informed multichannel audio source separation called independent deeply learned matrix a... [more] |
EA2017-104 SIP2017-113 SP2017-87 pp.13-20 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:25 |
Okinawa |
|
Non-parallel and Many-to-Many Voice Conversion Using Variational Autoencoder Conditioned by Phonetic Posteriorgrams and d-vectors Yuki Saito (NTT/Univ. of Tokyo), Yusuke Ijima, Kyosuke Nishida (NTT), Shinnosuke Takamichi (Univ. of Tokyo) EA2017-105 SIP2017-114 SP2017-88 |
This paper proposes novel frameworks for non-parallel and many-to-many voice conversion (VC) using variational autoencod... [more] |
EA2017-105 SIP2017-114 SP2017-88 pp.21-26 |
SP, IPSJ-SLP (Joint) |
2017-07-27 16:15 |
Miyagi |
Akiu Resort Hotel Crescent |
Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities and Evaluation of Dual Learning Hiroyuki Miyoshi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo) SP2017-17 |
Voice conversion (VC) using sequence-to-sequence learning of context posterior probabilities is proposed. Conventional V... [more] |
SP2017-17 pp.9-14 |
EA, US (Joint) |
2017-01-25 13:00 |
Kyoto |
Doshisha Univ. |
[Poster Presentation]
Study on efficient solver for independent low-rank matrix analysis with sparse time-series-activity regularization Yoshiki Mitsui (Univ. Tokyo), Daichi Kitamura (SOKENDAI), Shinnosuke Takamichi (Univ. Tokyo), Nobutaka Ono (NII/SOKENDAI), Hiroshi Saruwatari (Univ. Tokyo) EA2016-72 |
In this paper, we propose a new blind source separation (BSS) method based on independent low-rank matrix analysis (ILRM... [more] |
EA2016-72 pp.25-30 |
SP |
2017-01-21 11:00 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
Evaluation of DNN-Based Voice Conversion Deceiving Anti-spoofing Verification Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UT) SP2016-69 |
This paper proposes a novel training algorithm for high-quality Deep Neural Network (DNN)-based voice conversion. To imp... [more] |
SP2016-69 pp.29-34 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 11:15 |
Aichi |
Nagoya Inst of Tech. |
Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification Shinya Kura, Shinnosuke Takamichi (NAIST), Tomoki Toda (NAIST/Nagoya Univ.), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2015-73 |
There are several attempts at correcting durational patterns of non-native speech towards language learning. One of the ... [more] |
SP2015-73 pp.19-24 |
SIP, EA, SP |
2015-03-02 11:15 |
Okinawa |
|
Modulation spectrum-constrained trajectory training algorithm for statistical parametric speech synthesis Shinnosuke Takamichi (NAIST/CMU), Tomoki Toda (NAIST), Alan W. Black (CMU), Satoshi Nakamura (NAIST) EA2014-77 SIP2014-118 SP2014-140 |
[more] |
EA2014-77 SIP2014-118 SP2014-140 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Prosody Correction Preserving Speaker Individuality in English-Read-By-Japanese Speech Synthesis Based on HMM Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2014-112 |
To build an English acoustic model that well captures speaker individuality of each Japanese speaker, a framework using ... [more] |
SP2014-112 pp.63-68 |