Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC |
2009-12-21 10:10 |
Tokyo |
Univ. of Tokyo |
Speaker Adaptation Using Nonlinear Spectral Transformation For Speech Recognition. Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-12 SP2009-76 |
This paper proposes a speaker adaptation technique using nonlinear spectral transform based on GMMs.
One of the most po... [more] |
NLC2009-12 SP2009-76 pp.1-6 |
SP, NLC |
2009-12-21 10:35 |
Tokyo |
Univ. of Tokyo |
Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit Daisuke Saito, Ryo Matsuura, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) NLC2009-13 SP2009-77 |
Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently... [more] |
NLC2009-13 SP2009-77 pp.7-12 |
SP, NLC |
2009-12-21 13:00 |
Tokyo |
Univ. of Tokyo |
Recent Evaluations of a WFST-Based Speech Recognition Decoder Paul R. Dixon, Josef R. Novak, Tasuku Oonishi, Sadaoki Furui (Tokyo Inst. of Tech.) NLC2009-14 SP2009-78 |
This paper describes the latest performance evaluations on the Tokyo Tech Transducer-based (T3) speech decoder. These ev... [more] |
NLC2009-14 SP2009-78 pp.25-30 |
SP, NLC |
2009-12-21 13:25 |
Tokyo |
Univ. of Tokyo |
Evaluation of Search Error Risk Minimization in Viterbi Beam Search Takaaki Hori, Shinji Watanabe, Atsushi Nakamura (NTT Corp.) NLC2009-15 SP2009-79 |
[more] |
NLC2009-15 SP2009-79 pp.31-35 |
SP, NLC |
2009-12-21 14:55 |
Tokyo |
Univ. of Tokyo |
[Invited Talk]
Something is Missing in Automatic Speech Recognition Research. Sadaoki Furui (Tokyo Inst. of Tech.) NLC2009-16 SP2009-80 |
I have been working on automatic speech recognition (ASR) research for almost 40 years. Although ASR technology has mad... [more] |
NLC2009-16 SP2009-80 pp.49-54 |
SP, NLC |
2009-12-22 09:30 |
Tokyo |
Univ. of Tokyo |
Detection of Irritation Using Pitch-Delta inside vowels and Utterance Intervals
-- Targeting the Accurate Detection -- Kazuhide Okada (Toyota) NLC2009-17 SP2009-81 |
As one function of human interface detection by voice there is a irritation detection which requires to encourage or com... [more] |
NLC2009-17 SP2009-81 pp.55-58 |
SP, NLC |
2009-12-22 09:55 |
Tokyo |
Univ. of Tokyo |
Voice activity detection using conditional random fields with multiple features Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-18 SP2009-82 |
Voice Activity Detection (VAD) which is a technique to distinguish between speech and non-speech is used in noisy enviro... [more] |
NLC2009-18 SP2009-82 pp.59-64 |
SP, NLC |
2009-12-22 11:00 |
Tokyo |
Univ. of Tokyo |
Sentence generation from keywords using N-gram for Spoken Dialog System Yoshitaka Yoshimi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-19 SP2009-83 |
A probabilistic answer selection on spoken dialog system requires a lot of question-and-answer pair as a training data. ... [more] |
NLC2009-19 SP2009-83 pp.71-76 |
SP, NLC |
2009-12-22 14:45 |
Tokyo |
Univ. of Tokyo |
Spoken Term Detection by Query Term Extension using Vocabulary on Web for Speech Query Terms Go Kuriki, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame (Iwate Pref. Univ.), Kazuyo Tanaka (Tsukuba Univ.), Shi-wook Lee (AIST) NLC2009-20 SP2009-84 |
According to the increase of multimedia contents, many users want a new function to watch the video section of their int... [more] |
NLC2009-20 SP2009-84 pp.99-104 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Spectral Subtraction Based on Series Expansion of Orthogonal Functions Taiji Akasaka, Tetsuya Shimamura (Saitama Univ.) NLC2009-21 SP2009-85 |
In this paper, spectral subtraction based on series expansion of orthogonal functions is proposed. we estimate the power... [more] |
NLC2009-21 SP2009-85 pp.129-134 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Spectral Representation of Double Autocorrelation Functions for Speech Signals and Its Application to Noisy Word Recognition System Nguyen Ngoc Dinh, Tetsuya Shimamura (Saitama Univ.) NLC2009-22 SP2009-86 |
In this paper, we propose a new spectral representation for noisy word recognition system, that we call Spectral Represe... [more] |
NLC2009-22 SP2009-86 pp.135-140 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
HMM-based Speech Synthesis Using Quantized-F0-based Prosodic Context Koujirou Ooki, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) NLC2009-23 SP2009-87 |
This paper describes a technique for an HMM-based speech synthesis without using any manual labeling of accent informati... [more] |
NLC2009-23 SP2009-87 pp.141-146 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Speech Conversion Method into Intelligible English Using Utterance of User Yuichi Koshiba, Akira Kurematsu, Katsuhiko Shirai (Waseda Univ.) NLC2009-24 SP2009-88 |
Due to recent international and diverse use of English, unintelligible English utterance is increasing. To solve this pr... [more] |
NLC2009-24 SP2009-88 pp.147-152 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Analysis and Synthesis of Voice with Distance Perspective Motoi Omachi, Kazuhiko Iwata, Tetsunori Kobayashi (Waseda Univ.) NLC2009-25 SP2009-89 |
This paper describes the characteristics of voices uttered with different distance perspectives.
When we use a loud voi... [more] |
NLC2009-25 SP2009-89 pp.159-163 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
A study on speech synthesis by modeling harmonics structure with Multi Beta Mixture Model Toru Nakashika (Kobe Univ.), Ryuki Tachibana, Masafumi Nishimura (IBM Japan), Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) NLC2009-26 SP2009-90 |
There are currently some researches related to speech synthesis, but here we present a new framework
for speech synthes... [more] |
NLC2009-26 SP2009-90 pp.165-170 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
A study on Voice Conversion Based on F0 Quantization and Non-parallel Training Yuhei Ota, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) NLC2009-27 SP2009-91 |
This paper presents a novel voice conversion technique using HMM-based phoneme recognition and speech synthesis with non... [more] |
NLC2009-27 SP2009-91 pp.171-176 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Factor analysis models representing various voice characteristics for HMM based speech synthesis Kyosuke Kazumi, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-28 SP2009-92 |
This paper describes factor analysis models for realizing
various voice characteristics in the HMM-based speech synthe... [more] |
NLC2009-28 SP2009-92 pp.177-182 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
Dysarthric Speech Recognition Using Pose-Robust Lip Area Feature Extraction Based on AAM and Acoustic Features Chikoto Miyamoto, Yuto Komai, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.), Ichao Li (Otemon Gakuin Univ.), Toshitaka Nakabayashi (Kobe Univ.) NLC2009-29 SP2009-93 |
We investigated the speech recognition of a person with articulation disorders resulting from athetoid cerebral palsy. T... [more] |
NLC2009-29 SP2009-93 pp.195-200 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
A speech-oriented information kiosk based on user-generated dialog contents Toshinori Fukuta, Yoshitaka Yoshimi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inst. of Tech.) NLC2009-30 SP2009-94 |
On the development of a spoken dialog system, the system developer has to build and customize the contents for the targe... [more] |
NLC2009-30 SP2009-94 pp.207-212 |
SP, NLC |
2009-12-22 15:50 |
Tokyo |
Univ. of Tokyo |
On relationship between speech bandwidth and word intelligibility in noisy environment Sachiko Kurihara, Yusuke Hiwasaki, Shigeaki Sasaki, Yoichi Haneda (NTT Corp.) NLC2009-31 SP2009-95 |
To quantify the relationship between intelligibility and frequency bandwidth of speech in transmission systems, word int... [more] |
NLC2009-31 SP2009-95 pp.225-230 |