Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
An investigation on training of WaveNet vocoder in end-to-end text-to-speech Kazuki Yasuhara, Tomoki Hayashi, Tomoki Toda (Nagoya Univ.) SP2019-14 |
In this paper, we investigate the training of WaveNet vocoder in end-to-end text-to-speech. Tacotron 2, which is an end-... [more] |
SP2019-14 pp.31-36 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Robustness of statistical voice conversion based on waveform modification against external noise Yusuke Kurita, Kazuhiro Kobayashi, Kazuya Takeda (Nagoya Univ.), Tomoki Toda (Nagoya Univ./JST PRESTO) EA2018-153 SIP2018-159 SP2018-115 |
In this report, we investigate the statistical voice conversion (VC) under noisy environments.
VC achieves conversion f... [more] |
EA2018-153 SIP2018-159 SP2018-115 pp.317-322 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
An Evaluation of Underdetermined Source Separation Based on Multichannel Variational Autoencoder Shogo Seki (Nagoya Univ.), Hirokazu Kameoka (NTT), Li Li (Univ. Tsukuba), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2018-154 SIP2018-160 SP2018-116 |
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel Non-... [more] |
EA2018-154 SIP2018-160 SP2018-116 pp.323-328 |
SP |
2018-08-27 11:35 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Discrimination of pharyngeal residue using swallowing sound in dysphagia diagnosis Tatsunori Uchino, Atsushi Hashizume, Masahisa Katsuno, Tomoki Toda (Nagoya Univ.) SP2018-27 |
The measurement of pharyngeal residue with X-ray fluoroscopy is widely used as a typical diagnosis method of swallowing ... [more] |
SP2018-27 pp.23-27 |
SP |
2018-08-27 15:55 |
Kyoto |
Kyoto Univ. |
Sound Event Encoder Using Onomatopoeic Representations based on End-to-End Approach Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2018-30 |
In this paper, we propose a sound event encoder for converting sound events into their onomatopoeic representations. The... [more] |
SP2018-30 pp.37-42 |
EA, ASJ-H |
2018-08-23 12:55 |
Miyagi |
Tohoku Gakuin Univ. |
Self-produced speech enhancement and suppression method with wearable air- and body-conductive microphones Moe Takada, Shogo Seki, Tomoki Toda (Nagoya Univ.) EA2018-29 |
This paper presents a self-produced speech enhancement and suppression method for multichannel signals recorded with bot... [more] |
EA2018-29 pp.7-12 |
PRMU, SP |
2018-06-28 15:10 |
Nagano |
|
Multimodal voice conversion using deep bottleneck features and deep canonical correlation analysis Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu (Gifu Univ.), Tomoki Toda (Nagoya Univ.) PRMU2018-24 SP2018-4 |
In this paper, we aim at improving the speech quality in voice conversion and propose a novel multi-modal voice conversi... [more] |
PRMU2018-24 SP2018-4 pp.13-18 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
Development of NU Voice Conversion System 2018 Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi (Nagoya Univ.), Tomoki Toda (Nagoya Univ./JST PRESTO) EA2017-138 SIP2017-147 SP2017-121 |
This paper presents NU (Nagoya University) voice conversion (VC) system for the HUB task of Voice
Conversion Challenge ... [more] |
EA2017-138 SIP2017-147 SP2017-121 pp.203-208 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
An investigation of singing voice separation methods for a statistical approach to singing voice modification in music Tomoya Yamada, Shogo Seki, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) EA2017-139 SIP2017-148 SP2017-122 |
[more] |
EA2017-139 SIP2017-148 SP2017-122 pp.209-214 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
A Hybrid Approach on Electrolaryngeal Speech Enhancement based on Spectral Differential Features and Noise Suppression Mohammad Eshghi, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) EA2017-141 SIP2017-150 SP2017-124 |
This work presents a hybrid approach for enhancing the quality of the electrolaryngeal (EL) speech. Current hybrid enhan... [more] |
EA2017-141 SIP2017-150 SP2017-124 pp.221-226 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 14:45 |
Okinawa |
|
Development of NU non-parallel Voice Conversion System 2018 Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda (Nagoya Univ.) EA2017-172 SIP2017-181 SP2017-155 |
This paper introduces the NU non-parallel voice conversion (VC) system developed at Nagoya University for SPOKE task of ... [more] |
EA2017-172 SIP2017-181 SP2017-155 pp.385-390 |
SP, ASJ-H |
2018-01-21 13:30 |
Tokyo |
The University of Tokyo |
[Invited Talk]
Impact of WaveNet on Speech Synthesis Research Tomoki Toda (Nagoya Univ./JST) SP2017-80 |
[more] |
SP2017-80 p.79 |
SP, ASJ-H |
2018-01-21 14:45 |
Tokyo |
The University of Tokyo |
An investigation of multi-speaker WaveNet vocoder Tomoki Hayashi, Kazuhiro Kobayashi, Akira Tamamori, Kazuya Takeda, Tomoki Toda (Nagoya Univ.) SP2017-81 |
In this paper, we investigate a multi-speaker WaveNet vocoder. In our previous work, we have demonstrated that our propo... [more] |
SP2017-81 pp.81-86 |
SP, ASJ-H |
2018-01-21 15:10 |
Tokyo |
The University of Tokyo |
Statistical voice conversion with WaveNet vocoder Kazuhiro Kobayashi, Tomoki Hayashi, Akira Tamamori, Tomoki Toda (Nagoya Univ.) SP2017-82 |
[more] |
SP2017-82 pp.87-92 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Development of Speaker/Environment-Dependent Acoustic Model for Non-Audible Murmur Recognition Based on DNN Adaptation Seita Noda, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-56 |
In this research, we aim to improve the performance of non-audible murmur (NAM) recognition towards the development of s... [more] |
SP2017-56 pp.7-10 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
An Evaluation of Speech Waveform Modification Methods towards Improvement of Speech Intelligibility in Noisy Environment Tomohiro Takeyama, Kazuhiro Kobayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2017-57 |
In this research, in order to improve speech intelligibility for a listener under the noisy environment, we propose a te... [more] |
SP2017-57 pp.11-16 |
EA, ASJ-H |
2017-07-20 13:40 |
Hokkaido |
Hokkaido Univ. |
Explicit Event Duration-Controlled BLSTM-HSMM Hybrid Model for Polyphonic Sound Event Detection Tomoki Hayashi (Nagoya Univ.), Shinji Watanabe (MERL), Tomoki Toda (Nagoya Univ.), Takaaki Hori, JonathanLe Roux (MERL), Kazuya Takeda (Nagoya Univ.) EA2017-2 |
This paper presents a new BLSTM-HSMM hybrid approach for polyphonic Sound Event Detection (SED). It builds upon a state-... [more] |
EA2017-2 pp.9-14 |
SP, SIP, EA |
2017-03-01 09:20 |
Okinawa |
Okinawa Industry Support Center |
Speech waveform synthesis based on WaveNet considering speech generation process Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2016-82 SIP2016-137 SP2016-77 |
Our aim is to realize a new vocoder, which can resolve various constraints imposed on source-filter model and deal with ... [more] |
EA2016-82 SIP2016-137 SP2016-77 pp.1-6 |
SP, SIP, EA |
2017-03-01 09:45 |
Okinawa |
Okinawa Industry Support Center |
Nonaudible murmur enhancement based on non-negative tensor factorization with segment feature regularization in noisy environments Yusuke Tajiri (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ.) EA2016-83 SIP2016-138 SP2016-78 |
Towards the development of silent speech communication, there has been studied a statistical approach to enhancing nonau... [more] |
EA2016-83 SIP2016-138 SP2016-78 pp.7-12 |
SP, SIP, EA |
2017-03-01 10:50 |
Okinawa |
Okinawa Industry Support Center |
Missing Component Restoration for Speech Spectrogram Based on Time-domain Signal Estimation Shogo Seki (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2016-85 SIP2016-140 SP2016-80 |
This study proposes a missing component restoration method for time-frequency masked speech spectrogram based on time-do... [more] |
EA2016-85 SIP2016-140 SP2016-80 pp.19-24 |