|
Chair |
|
Satoru Hayamizu (Gifu Univ.) |
Vice Chair |
|
Hisashi Kawai (KDDI Labs.) |
Secretary |
|
Hiroki Mori (Utsunomiya Univ.), Motoyuki Suzuki (Osaka Inst. of Tech.) |
Assistant |
|
Masakiyo Fujimoto (NTT), Yamato Ohtani (Toshiba) |
|
Conference Date |
Wed, Jan 30, 2013 13:30 - 17:15
Thu, Jan 31, 2013 10:00 - 16:45 |
Topics |
Speech, Language, and Dialogue, etc. |
Conference Place |
Kyotanabe Campus, Doshisha University |
Address |
1-3 Tatara Miyakodani, Kyotanabe City, 610-0394 Japan |
Transportation Guide |
15 minutes on foot from the Kintetsu Kodo Station http://www.doshisha.ac.jp/english/access/tanabe-access.html |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Wed, Jan 30 PM 13:30 - 15:00 |
(1) |
13:30-14:00 |
A Preliminary Investigation on Improving Chinese Pinyin-to-character Conversion Using MI Based Automatic Lexical Formation SP2012-98 |
Jinsong Zhang (Beijing Language and Culture Univ./NICT), Wei Li (Beijing Language and Culture Univ.), Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
(2) |
14:00-14:30 |
A Study on Perceptual Training of Mandarin Tone 2 and Tone 3 by Japanese Learners SP2012-99 |
Jinsong Zhang (Beijing Language and Culture Univ./NICT), Yue Sun (Beijing Language and Culture Univ.), Ting Zou (Leiden Univ.), Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
(3) |
14:30-15:00 |
Detection Method of Utterances out-of-Scope for Dialogue-based CALL Systems trained with Learner Corpus SP2012-100 |
Yu Nagai, Xiaoyun Wang, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.) |
|
15:00-15:15 |
Break ( 15 min. ) |
Wed, Jan 30 PM 15:15 - 17:15 |
(4) |
15:15-15:45 |
Speaker Recognition Using Formant of Vowels SP2012-101 |
Naoyuki Urakami, Yuta Shoji, Jun Shiraishi, Hironori Yamauchi, Yohei Fukumizu, Tomonori Izumi (Ritsumeikan Univ) |
(5) |
15:45-16:15 |
A Study on Speaker Recognition Based on Decomposition of Periodic and Aperiodic Components SP2012-102 |
Yuki Ishikawa, Masafumi Nishida (Doshisha Univ.), Masakiyo Fujimoto (NTT), Seiichi Yamamoto (Doshisha Univ.) |
(6) |
16:15-16:45 |
Improvement of context label in HMM-based speech synthesis for Japanese SP2012-103 |
Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu (Univ. of Tokyo) |
(7) |
16:45-17:15 |
F0 contour generation using rich context models in HMM-based speech synthesis SP2012-104 |
Shinnosuke Takamichi, Tomoki Toda (NAIST), Yoshinori Shiga (NICT), Sakriani Sakti, Graham Neubig, Satoshi Nakamura (NAIST) |
Thu, Jan 31 AM 10:00 - 12:00 |
(8) |
10:00-10:30 |
Consideration of relationship between auditory impression and acoustic feature of death growl and scream singing voice SP2012-105 |
Keizo Kato, Akinori Ito (Tohoku Univ.) |
(9) |
10:30-11:00 |
The construction of an evaluation scale for singing voice of popular music
-- in Amateur singing voice -- SP2012-106 |
Ai Kanato, Hideaki Kikuchi (Waseda Univ.) |
(10) |
11:00-11:30 |
Investigation of correlation between temporal fluctuations of F0 and spectrum in scream vocal style SP2012-107 |
Hironobu Nishiwaki, Hideki Banno, Kensaku Asahi (Meijo Univ.) |
(11) |
11:30-12:00 |
Proposals of vibrato feature to reflect magnitude of fluctuations of fundamental frequency and power in singing voice and evaluation method of the feature SP2012-108 |
Chifumi Suzuki, Hideki Banno, Kensaku Asahi, Fumitada Itakura (Meijo Univ), Masanori Morise (Ritsumeikan Univ) |
Thu, Jan 31 PM 13:00 - 14:00 |
(12) |
13:00-14:00 |
[Invited Talk]
Speaker and style diversification in statistical parametric speech synthesis SP2012-109 |
Takashi Nose (Tokyo Inst. of Tech.) |
|
14:00-14:15 |
Break ( 15 min. ) |
Thu, Jan 31 PM 14:15 - 16:45 |
(13) |
14:15-14:45 |
A study on speaker-normalized style conversion for arbitrary speaker's expressive speech synthesis SP2012-110 |
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) |
(14) |
14:45-15:15 |
A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity SP2012-111 |
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) |
(15) |
15:15-15:45 |
A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation SP2012-112 |
Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) |
(16) |
15:45-16:15 |
Labeling of Spoken Dialog for Paralinguistic Information Processing SP2012-113 |
Tomoyuki Shimakawa, Masanori Morise, Yoichi Yamashita (Ritsumeikan Univ.) |
(17) |
16:15-16:45 |
Evaluation and Automatic Estimation of Voice Characteristic Similarity Using Isolated Vowels SP2012-114 |
Shohei Tsujimura, Masanori Morise, Yoichi Yamashita (Ritsumeikan Univ.) |
Announcement for Speakers |
General Talk | Each speech will have 25 minutes for presentation and 5 minutes for discussion. |
Invited Talk | Each speech will have 50 minutes for presentation and 10 minutes for discussion. |
Contact Address and Latest Schedule Information |
SP |
Technical Committee on Speech (SP) [Latest Schedule]
|
Contact Address |
Hiroki Mori (Utsunomiya University)
E-: spee- |
Last modified: 2013-01-25 09:40:32
|
Notification: Mail addresses are partially hidden against SPAM.
|