Wed, Jan 30 PM 13:30 - 15:00 |
(1) |
13:30-14:00 |
A Preliminary Investigation on Improving Chinese Pinyin-to-character Conversion Using MI Based Automatic Lexical Formation |
Jinsong Zhang (Beijing Language and Culture Univ./NICT), Wei Li (Beijing Language and Culture Univ.), Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
(2) |
14:00-14:30 |
A Study on Perceptual Training of Mandarin Tone 2 and Tone 3 by Japanese Learners |
Jinsong Zhang (Beijing Language and Culture Univ./NICT), Yue Sun (Beijing Language and Culture Univ.), Ting Zou (Leiden Univ.), Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
(3) |
14:30-15:00 |
Detection Method of Utterances out-of-Scope for Dialogue-based CALL Systems trained with Learner Corpus |
Yu Nagai, Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
|
15:00-15:15 |
Break ( 15 min. ) |
Wed, Jan 30 PM 15:15 - 17:15 |
(4) |
15:15-15:45 |
Speaker Recognition Using Formant of Vowels |
Naoyuki Urakami, Yuta Shoji, Jun Shiraishi, Hironori Yamauchi, Yohei Fukumizu, Tomonori Izumi (Ritsumeikan Univ) |
(5) |
15:45-16:15 |
A Study on Speaker Recognition Based on Decomposition of Periodic and Aperiodic Components |
Yuki Ishikawa, Masafumi Nishida (Doshisha Univ.), Masakiyo Fujimoto (NTT), Seiichi Yamamoto (Doshisha Univ.) |
(6) |
16:15-16:45 |
Improvement of context label in HMM-based speech synthesis for Japanese |
Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) |
(7) |
16:45-17:15 |
F0 contour generation using rich context models in HMM-based speech synthesis |
Shinnosuke Takamichi, Tomoki Toda (NAIST), Yoshinori Shiga (NICT), Sakriani Sakti, Graham Neubig, Satoshi Nakamura (NAIST) |
Thu, Jan 31 AM 10:00 - 12:00 |
(8) |
10:00-10:30 |
Consideration of relationship between auditory impression and acoustic feature of death growl and scream singing voice |
Keizo Kato, Akinori Ito (Tohoku Univ.) |
(9) |
10:30-11:00 |
The construction of an evaluation scale for singing voice of popular music
-- in Amateur singing voice -- |
Ai Kanato, Hideaki Kikuchi (Waseda Univ.) |
(10) |
11:00-11:30 |
Investigation of correlation between temporal fluctuations of F0 and spectrum in scream vocal style |
Hironobu Nishiwaki, Hideki Banno, Kensaku Asahi (Meijo Univ.) |
(11) |
11:30-12:00 |
Proposals of vibrato feature to reflect magnitude of fluctuations of fundamental frequency and power in singing voice and evaluation method of the feature |
Chifumi Suzuki, Hideki Banno, Kensaku Asahi, Fumitada Itakura (Meijo Univ), Masanori Morise (Ritsumeikan Univ) |
Thu, Jan 31 PM 13:00 - 14:00 |
(12) |
13:00-14:00 |
[Invited Talk]
Speaker and style diversification in statistical parametric speech synthesis |
Takashi Nose (Tokyo Inst. of Tech.) |
|
14:00-14:15 |
Break ( 15 min. ) |
Thu, Jan 31 PM 14:15 - 16:45 |
(13) |
14:15-14:45 |
A study on speaker-normalized style conversion for arbitrary speaker's expressive speech synthesis |
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) |
(14) |
14:45-15:15 |
A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity |
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) |
(15) |
15:15-15:45 |
A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation |
Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) |
(16) |
15:45-16:15 |
Labeling of Spoken Dialog for Paralinguistic Information Processing |
Tomoyuki Shimakawa, Masanori Morise, Yoichi Yamashita (Ritsumeikan Univ.) |
(17) |
16:15-16:45 |
Evaluation and Automatic Estimation of Voice Characteristic Similarity Using Isolated Vowels |
Shohei Tsujimura, Masanori Morise, Yoichi Yamashita (Ritsumeikan Univ.) |