Mon, Dec 21 AM 10:00 - 10:10 |
|
- |
|
Mon, Dec 21 AM 10:10 - 11:50 |
(1) |
10:10-10:35 |
Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition. |
Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(2) |
10:35-11:00 |
Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit |
Daisuke Saito, Ryo Matsuura, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) |
(3) |
11:00-11:25 |
|
(4) |
11:25-11:50 |
|
Mon, Dec 21 PM 13:00 - 13:50 |
(5) |
13:00-13:25 |
Recent Evaluations of a WFST-Based Speech Recognition Decoder |
Paul R. Dixon, Josef R. Novak, Tasuku Oonishi, Sadaoki Furui (Tokyo Inst. of Tech.) |
(6) |
13:25-13:50 |
Evaluation of Search Error Risk Minimization in Viterbi Beam Search |
Takaaki Hori, Shinji Watanabe, Atsushi Nakamura (NTT Corp.) |
Mon, Dec 21 PM 13:50 - 14:40 |
(7) |
13:50-14:15 |
|
(8) |
14:15-14:40 |
|
Mon, Dec 21 PM 14:55 - 15:45 |
(9) |
14:55-15:45 |
[Invited Talk]
Something is Missing in Automatic Speech Recognition Research. |
Sadaoki Furui (Tokyo Inst. of Tech.) |
Mon, Dec 21 PM 15:45 - 17:00 |
|
- |
|
Tue, Dec 22 AM 09:30 - 10:45 |
(10) |
09:30-09:55 |
Detection of Irritation Using Pitch-Delta inside vowels and Utterance Intervals
-- Targeting the Accurate Detection -- |
Kazuhide Okada (Toyota) |
(11) |
09:55-10:20 |
Voice activity detection using conditional random fields with multiple features |
Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(12) |
10:20-10:45 |
|
Tue, Dec 22 AM 11:00 - 12:15 |
(13) |
11:00-11:25 |
Sentence generation from keywords using N-gram for Spoken Dialog System |
Yoshitaka Yoshimi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(14) |
11:25-11:50 |
|
(15) |
11:50-12:15 |
|
Tue, Dec 22 PM 13:15 - 14:05 |
(16) |
13:15-14:05 |
|
Tue, Dec 22 PM 14:20 - 15:35 |
(17) |
14:20-14:45 |
|
(18) |
14:45-15:10 |
Spoken Term Detection by Query Term Extension using Vocabulary on Web for Speech Query Terms |
Go Kuriki, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame (Iwate Pref. Univ.), Kazuyo Tanaka (Tsukuba Univ.), Shi-wook Lee (AIST) |
(19) |
15:10-15:35 |
|
Tue, Dec 22 PM 15:50 - 17:50 |
(20) |
15:50-17:50 |
|
(21) |
15:50-17:50 |
|
(22) |
15:50-17:50 |
|
(23) |
15:50-17:50 |
Spectral Subtraction Based on Series Expansion of Orthogonal Functions |
Taiji Akasaka, Tetsuya Shimamura (Saitama Univ.) |
(24) |
15:50-17:50 |
Spectral Representation of Double Autocorrelation Functions for Speech Signals and Its Application to Noisy Word Recognition System |
Nguyen Ngoc Dinh, Tetsuya Shimamura (Saitama Univ.) |
(25) |
15:50-17:50 |
HMM-based Speech Synthesis Using Quantized-F0-based Prosodic Context |
Koujirou Ooki, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) |
(26) |
15:50-17:50 |
Speech Conversion Method into Intelligible English Using Utterance of User |
Yuichi Koshiba, Akira Kurematsu, Katsuhiko Shirai (Waseda Univ.) |
(27) |
15:50-17:50 |
|
(28) |
15:50-17:50 |
Analysis and Synthesis of Voice with Distance Perspective |
Motoi Omachi, Kazuhiko Iwata, Tetsunori Kobayashi (Waseda Univ.) |
(29) |
15:50-17:50 |
A study on speech synthesis by modeling harmonics structure with Multi Beta Mixture Model |
Toru Nakashika (Kobe Univ.), Ryuki Tachibana, Masafumi Nishimura (IBM Japan), Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
(30) |
15:50-17:50 |
A study on Voice Conversion Based on F0 Quantization and Non-parallel Training |
Yuhei Ota, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) |
(31) |
15:50-17:50 |
Factor analysis models representing various voice characteristics for HMM based speech synthesis |
Kyosuke Kazumi, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(32) |
15:50-17:50 |
|
(33) |
15:50-17:50 |
|
(34) |
15:50-17:50 |
Dysarthric Speech Recognition Using Pose-Robust Lip Area Feature Extraction Based on AAM and Acoustic Features |
Chikoto Miyamoto, Yuto Komai, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.), Ichao Li (Otemon Gakuin Univ.), Toshitaka Nakabayashi (Kobe Univ.) |
(35) |
15:50-17:50 |
|
(36) |
15:50-17:50 |
A speech-oriented information kiosk based on user-generated dialog contents |
Toshinori Fukuta, Yoshitaka Yoshimi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(37) |
15:50-17:50 |
|
(38) |
15:50-17:50 |
|
(39) |
15:50-17:50 |
On relationship between speech bandwidth and word intelligibility in noisy environment |
Sachiko Kurihara, Yusuke Hiwasaki, Shigeaki Sasaki, Yoichi Haneda (NTT Corp.) |
(40) |
15:50-17:50 |
One study of reasonable acoustic attachment for mute image
-- To Target accurate Automatic Iipreading -- |
Kazuhide Okada (Toyota) |
(41) |
15:50-17:50 |
|