Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
WIT, SP, ASJ-H, PRMU |
2015-06-18 16:00 |
Niigata |
(Niigata) |
Noise-robust Prediction of Pronunciation Distances Aiming at Clustering of World Englishes Using a Learner's Self-centered Viewpoint Yuichi Sato, Yosuke Kashiwagi, Shun Kasahara, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UT) PRMU2015-45 SP2015-14 WIT2015-14 |
In recent years,we have more and more international tourists and in 2020, we have Tokyo Olympic Games. For communicating... [more] |
PRMU2015-45 SP2015-14 WIT2015-14 pp.77-82 |
SP |
2015-01-22 14:40 |
Gifu |
Juroku Plaza (Gifu) |
Automatic prediction of intelligibility of English words spoken with Japanese accents
-- Comparative study of features and models used for prediction -- Teeraphon Pongkittiphan, Nobuaki Minematsu (Univ Tokyo), Takehiko Makino (Chuo Univ.), Daisuke Saito, Keikichi Hirose (Univ Tokyo) SP2014-132 |
This study investigates automatic prediction of the words in given sentences that will be unintelligible to American lis... [more] |
SP2014-132 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 19:20 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) (Kanagawa) |
An experimental study of definitions of reference pronunciation distances and acoustic features used for distance prediction with the aim of pronunciation clustering Shun Kasahara (Univ. of Tokyo), Tianze Shi (Tsinghua Univ.), Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (Univ. of Tokyo) SP2014-110 |
“World Englishes” indicates well one aspect of the current state of English as an international language, which claims t... [more] |
SP2014-110 pp.47-52 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) (Kanagawa) |
Noise robust speech recognition by non-negative matrix factorization using GMM clustering in MFCC domain Kentaro Fujigaki, Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-113 |
Exemplar-based feature enhancement by non-negative matrix factorization (NMF) was proposed for noise-robust speech recog... [more] |
SP2014-113 pp.69-74 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) (Kanagawa) |
Voice conversion based on deep neural network with multiple output sub-networks Tetsuya Hashimoto, Yosuke Kashiwagi, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) SP2014-117 |
(Advance abstract in Japanese is available) [more] |
SP2014-117 pp.99-104 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) (Kanagawa) |
Speaker adaptation using speaker-normalized DNN based on speaker codes Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-118 |
Recently, deep neural network (DNN) becomes one of the main streams of acoustic modeling for automatic speech recognitio... [more] |
SP2014-118 pp.105-110 |
SP |
2014-11-14 10:30 |
Fukuoka |
Kyushu Univ. Chikushi Campus (Fukuoka) |
A study of speaker normalization based on voice conversion for statistical acoustic-to-articulatory mapping Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-100 |
[more] |
SP2014-100 pp.59-64 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
(Tokyo) |
Visualization of World Englishes pronunciations from a speaker's self-centered viewpoint using attributes of accent, gender, and age Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UTokyo), Han-Ping Shen (NCKU) SP2014-12 |
English is the only language available for global communication and is known to have a large diversity of pronunciations... [more] |
SP2014-12 pp.127-132 |
SP |
2014-02-28 11:00 |
Tokushima |
The University of Tokushima (Tokushima) |
Prediction of pronunciation distances based on structural representation for clustering World Englishes Shun Kasahara, Nobuaki Minematsu (Univ. of Tokyo), HanPing Shen (NCKU), Takehiko Makino (Chuo univ.), Daisuke Saito, Keikichi Hirose (Univ. of Tokyo) SP2013-109 |
The term of World Englishes is often used to indicate the current state of English as international language.It claims t... [more] |
SP2013-109 pp.13-18 |
SP, IPSJ-SLP |
2013-12-20 16:40 |
Tokyo |
(Tokyo) |
Speech synthesis technologies that supports OJAD
-- An example of applying speech synthesis techniques not to synthsizing speech -- Nobuaki Minematsu, Ibuki Nakamura, Hiroya Hashimoto, Keikichi Hirose (Univ. Tokyo) SP2013-95 |
To support Japanese prosody instruction, the Online Japanese Accent Dictionary (OJAD) has been developed by using NLP an... [more] |
SP2013-95 pp.129-134 |
SP |
2013-11-22 13:30 |
Nara |
Nara Institute of Science and Technology (Nara) |
Automatic Prediction of Intelligibility of Spoken Words in Japanese Accented English Teeraphon Pongkittiphan, Nobuaki Minematsu (Univ. Tokyo), Takehiko Makino (Chuo Univ.), Han-Ping Shen (National Cheng Kung Univ.), Keikichi Hirose (Univ. Tokyo) SP2013-79 |
This study examines automatic prediction of the words that will be unintelligible if they are spoken by Japanese speaker... [more] |
SP2013-79 pp.47-52 |
SP, IPSJ-SLP (Joint) |
2013-07-25 17:30 |
Miyagi |
Soho (togatta spa) (Miyagi) |
Voice conversion using piecewise linear transformation with long-span features and state discrimination of the target speaker Jun Ikeshima, Masayuki Suzuki, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2013-56 |
A method has been proposed for voice conversion. It is based on piecewise linear transformation with long time-span fea... [more] |
SP2013-56 pp.19-24 |
SP, EA, SIP |
2013-05-16 14:25 |
Okayama |
(Okayama) |
Artificial bandwidth extension based on REDIAL Nguyen Ducduy, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) EA2013-7 SIP2013-7 SP2013-7 |
Artificial Bandwidth Extension (ABE) has been introduced to improve perceived speech quality and intelligibility of narr... [more] |
EA2013-7 SIP2013-7 SP2013-7 pp.37-42 |
SP, EA, SIP |
2013-05-16 14:50 |
Okayama |
(Okayama) |
Data Adaptive Pre-filtering with Spectral Weighting
-- Application to Single Channel Speech Enhancement -- Md. Khademul Islam Molla, Keikichi Hirose (Univ. of Tokyo) EA2013-8 SIP2013-8 SP2013-8 |
The spectral weighting for speech enhancement works better when the level of speech energy is available in priori. Diffe... [more] |
EA2013-8 SIP2013-8 SP2013-8 pp.43-48 |
SP |
2013-02-28 13:10 |
Aichi |
Daido University (Aichi) |
Development of an online accent dictionary and a reading tutor to support teaching and learning of Japanese prosody Nobuaki Minematsu, Ibuki Nakamura, Masayuki Suzuki (Univ. of Tokyo), Hiroko Hirano (JHFLI), Chieko Nakagawa (Waseda Univ.), Noriko Nakamura (Keio Univ.), Yukinori Tagawa (Waseda Univ.), Hiroya Hashimoto, Keikichi Hirose (Univ. of Tokyo) SP2012-115 |
Through tight collaboration with teachers of Japanese, an online accent dictionary and a reading tutor are developed to ... [more] |
SP2012-115 pp.1-6 |
SP |
2013-02-28 14:10 |
Aichi |
Daido University (Aichi) |
Automatic detection of English sentence stress using discriminative models Shuhei Kato, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo), Yutaka Yamauchi (Tokyo International Univ.), Megumi Nishikawa (Tokai Univ.) SP2012-117 |
[more] |
SP2012-117 pp.13-18 |
SP |
2013-02-28 15:00 |
Aichi |
Daido University (Aichi) |
[Poster Presentation]
Voice femininity estimation for MtF patients using supervectors and SVR Chengshuo Wang, Masayuki Suzuki, Nobuaki Minematsu (Univ. of Tokyo), Kyoko Sakuraba (Dokkyo Medical Univ. Hospital), Keikichi Hirose (Univ. of Tokyo) SP2012-120 |
Femininity estimation of MtF (Male to Female) voices is technically implemented.Speaker characteristics are extracted as... [more] |
SP2012-120 pp.23-24 |
SP |
2013-01-30 16:15 |
Kyoto |
Doshisha Univ. (Kyoto) |
Improvement of context label in HMM-based speech synthesis for Japanese Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) SP2012-103 |
An improved set of context labels was proposed for HMM-based speech synthesis of Japanese. The conventional labels incl... [more] |
SP2012-103 pp.31-36 |
SP, IPSJ-SLP |
2012-12-21 15:25 |
Tokyo |
TITECH(Ookayama) (Tokyo) |
Syllable nucleus detection using waveform envelopes and modeling of the word acquisition process using word structures and syllable nuclei Yousuke Ozaki, Nobuaki Minematsu, Keikichi Hirose (The Univ. of Tokyo), Donna Erickson (Showa Univ. of Music) SP2012-94 |
Simulation of language acquisition processes is an active research area in speech and computer science. Here, models and... [more] |
SP2012-94 pp.113-118 |
SP |
2012-11-08 10:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology (Miyagi) |
Effects of speaker adaptive training on arbitrary speaker conversion based on tensor representation Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-72 |
In this paper, speaker adaptive training techniques are introduced to tensor-based arbitrary speaker conversion. In voic... [more] |
SP2012-72 pp.1-6 |