Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Voice conversion based on deep neural network with multiple output sub-networks Tetsuya Hashimoto, Yosuke Kashiwagi, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) SP2014-117 |
(Advance abstract in Japanese is available) [more] |
SP2014-117 pp.99-104 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118 |
Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more] |
SP2014-118 pp.105-110 |
SP |
2014-11-14 10:30 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
A study of speaker normalization based on voice conversion for statistical acoustic-to-articulatory mapping Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-100 |
[more] |
SP2014-100 pp.59-64 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Visualization of World Englishes pronunciations from a speaker's self-centered viewpoint using attributes of accent, gender, and age Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UTokyo), Han-Ping Shen (NCKU) SP2014-12 |
English is the only language available for global communication and is known to have a large diversity of pronunciations... [more] |
SP2014-12 pp.127-132 |
SP |
2014-02-28 11:00 |
Tokushima |
The University of Tokushima |
Prediction of pronunciation distances based on structural representation for clustering World Englishes Shun Kasahara, Nobuaki Minematsu (Univ. of Tokyo), HanPing Shen (NCKU), Takehiko Makino (Chuo univ.), Daisuke Saito, Keikichi Hirose (Univ. of Tokyo) SP2013-109 |
The term of World Englishes is often used to indicate the current state of English as international language.It claims t... [more] |
SP2013-109 pp.13-18 |
SP, IPSJ-SLP |
2013-12-20 16:40 |
Tokyo |
|
Speech synthesis technologies that supports OJAD
-- An example of applying speech synthesis techniques not to synthsizing speech -- Nobuaki Minematsu, Ibuki Nakamura, Hiroya Hashimoto, Keikichi Hirose (Univ. Tokyo) SP2013-95 |
To support Japanese prosody instruction, the Online Japanese Accent Dictionary (OJAD) has been developed by using NLP an... [more] |
SP2013-95 pp.129-134 |
SP |
2013-11-22 10:15 |
Nara |
Nara Institute of Science and Technology |
[Invited Talk]
Computer assisted foreign language teaching and learning by using speech technologies Nobuaki Minematsu (Univ. of Tokyo) SP2013-77 |
Early works aiming at technical support of teaching and learning pronunciation of foreign language can be found in the 1... [more] |
SP2013-77 pp.37-42 |
SP |
2013-11-22 13:30 |
Nara |
Nara Institute of Science and Technology |
Automatic Prediction of Intelligibility of Spoken Words in Japanese Accented English Teeraphon Pongkittiphan, Nobuaki Minematsu (Univ. Tokyo), Takehiko Makino (Chuo Univ.), Han-Ping Shen (National Cheng Kung Univ.), Keikichi Hirose (Univ. Tokyo) SP2013-79 |
This study examines automatic prediction of the words that will be unintelligible if they are spoken by Japanese speaker... [more] |
SP2013-79 pp.47-52 |
SP, IPSJ-SLP (Joint) |
2013-07-25 17:30 |
Miyagi |
Soho (togatta spa) |
Voice conversion using piecewise linear transformation with long-span features and state discrimination of the target speaker Jun Ikeshima, Masayuki Suzuki, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2013-56 |
A method has been proposed for voice conversion. It is based on piecewise linear transformation with long time-span fea... [more] |
SP2013-56 pp.19-24 |
SP, EA, SIP |
2013-05-16 14:25 |
Okayama |
|
Artificial bandwidth extension based on REDIAL Nguyen Ducduy, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) EA2013-7 SIP2013-7 SP2013-7 |
Artificial Bandwidth Extension (ABE) has been introduced to improve perceived speech quality and intelligibility of narr... [more] |
EA2013-7 SIP2013-7 SP2013-7 pp.37-42 |
SP |
2013-02-28 13:10 |
Aichi |
Daido University |
Development of an online accent dictionary and a reading tutor to support teaching and learning of Japanese prosody Nobuaki Minematsu, Ibuki Nakamura, Masayuki Suzuki (Univ. of Tokyo), Hiroko Hirano (JHFLI), Chieko Nakagawa (Waseda Univ.), Noriko Nakamura (Keio Univ.), Yukinori Tagawa (Waseda Univ.), Hiroya Hashimoto, Keikichi Hirose (Univ. of Tokyo) SP2012-115 |
Through tight collaboration with teachers of Japanese, an online accent dictionary and a reading tutor are developed to ... [more] |
SP2012-115 pp.1-6 |
SP |
2013-02-28 13:40 |
Aichi |
Daido University |
Speaker-based pronunciation clustering of World Englishes based on pronunciation structure analysis Han-Ping Shen (National Cheng Kung Univ.), Nobuaki Minematsu (Univ. of Tokyo), Steven Weinberger (George Mason Univ.), Takehiko Makino (Chuo Univ.), Josef Novak, Teeraphon Pongkittiphan (Univ. of Tokyo), Chung-Hsien Wu (National Cheng Kung Univ.) SP2012-116 |
English is the only language available for global communication. Due to the influence of the students’ mother tongue, ho... [more] |
SP2012-116 pp.7-12 |
SP |
2013-02-28 14:10 |
Aichi |
Daido University |
Automatic detection of English sentence stress using discriminative models Shuhei Kato, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo), Yutaka Yamauchi (Tokyo International Univ.), Megumi Nishikawa (Tokai Univ.) SP2012-117 |
[more] |
SP2012-117 pp.13-18 |
SP |
2013-02-28 15:00 |
Aichi |
Daido University |
[Poster Presentation]
Voice femininity estimation for MtF patients using supervectors and SVR Chengshuo Wang, Masayuki Suzuki, Nobuaki Minematsu (Univ. of Tokyo), Kyoko Sakuraba (Dokkyo Medical Univ. Hospital), Keikichi Hirose (Univ. of Tokyo) SP2012-120 |
Femininity estimation of MtF (Male to Female) voices is technically implemented.Speaker characteristics are extracted as... [more] |
SP2012-120 pp.23-24 |
SP |
2013-01-30 16:15 |
Kyoto |
Doshisha Univ. |
Improvement of context label in HMM-based speech synthesis for Japanese Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) SP2012-103 |
An improved set of context labels was proposed for HMM-based speech synthesis of Japanese. The conventional labels incl... [more] |
SP2012-103 pp.31-36 |
SP, IPSJ-SLP |
2012-12-21 15:25 |
Tokyo |
TITECH(Ookayama) |
Syllable nucleus detection using waveform envelopes and modeling of the word acquisition process using word structures and syllable nuclei Yousuke Ozaki, Nobuaki Minematsu, Keikichi Hirose (The Univ. of Tokyo), Donna Erickson (Showa Univ. of Music) SP2012-94 |
Simulation of language acquisition processes is an active research area in speech and computer science. Here, models and... [more] |
SP2012-94 pp.113-118 |
SP |
2012-11-08 10:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Effects of speaker adaptive training on arbitrary speaker conversion based on tensor representation Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-72 |
In this paper, speaker adaptive training techniques are introduced to tensor-based arbitrary speaker conversion. In voic... [more] |
SP2012-72 pp.1-6 |
SP, IPSJ-SLP (Joint) |
2012-07-19 14:30 |
Yamagata |
Hotel Takinoyu (Yamagata Pref.) |
Automatic pronunciation error detecting of Chinese using SVM with structural features Tongmu Zhao (Univ. of Tokyo), Akemi Hoshino (Toyama National College of Tech.), Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-50 |
Pronunciation errors are often made by learners of a foreign language. To build a CALL (Computer-Aided Language Learnin... [more] |
SP2012-50 pp.1-6 |
SP |
2012-06-14 11:30 |
Kanagawa |
NTT Atsugi R&D Center |
Eigenvoice-based character conversion and its evaluation Teeraphon Pongkittiphan, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-34 |
This paper describes a new method of voice conversion,
which aims at character conversion based on eigenvoice
GMM (EV-... [more] |
SP2012-34 pp.7-12 |
EA, SP, SIP |
2012-05-25 14:10 |
Osaka |
Osaka Univ. Nakanoshima Center |
SPLICE leveraging lip image information for ASR in noisy environments. Yosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) EA2012-27 SIP2012-27 SP2012-27 |
[more] |
EA2012-27 SIP2012-27 SP2012-27 pp.155-160 |