Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC |
2008-12-10 15:05 |
Tokyo |
Waseda Univ. |
An automatic transcription system for creation of meeting records in the Japanese Congress Yuya Akita, Masato Mimura, Tatsuya Kawahara (Kyoto Univ.) NLC2008-44 SP2008-99 |
We have been developping an automatic speech recognition (ASR) system for creation of meeting records of the National Co... [more] |
NLC2008-44 SP2008-99 pp.121-126 |
SP, NLC |
2008-12-10 15:30 |
Tokyo |
Waseda Univ. |
Effect of punctuation marks for speech translatio unit boundary detection Tohru Shimizu (NICT/ATR), Satoshi Nakamura (National Institute of Information and Communication), Tatsuya Kawahara (Kyoto University) NLC2008-45 SP2008-100 |
As automatic speech recognition and translation of long and complicated utterance cause more errors, there is increasing... [more] |
NLC2008-45 SP2008-100 pp.127-131 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Characteristics of pitch accents in infant-directed speech
-- An analysis of Riken Japanese Mother-Infant Conversation Corpus -- Mafuyu Kitahara (Waseda Univ.), Ken'ya Nishikawa (RIKEN/Keio Univ.), Yosuke Igarashi (NIJL/RIKEN), Takahito Shinya (Sophi Univ./RIKEN), Reiko Mazuka (RIKEN/Duke Univ.) NLC2008-46 SP2008-101 |
Pitch in R-JMICC and list reading data from the same group of speakers
were analyzed. Accentual pitch and maximum pitch... [more] |
NLC2008-46 SP2008-101 pp.133-136 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
The effect of associated conditions on the received emotional information transferred by sound effects Mari Sato, Kiyoaki Aikawa (Univ. of Technology) NLC2008-47 SP2008-102 |
This report describes an emotional communication model regarding music or pictures as code words incorporating the envir... [more] |
NLC2008-47 SP2008-102 pp.137-142 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Physical Model of the Vocal Tract with Flexible Velum Takayuki Arai, Kimi Tanaka (Sophia Univ.), Ryuta Kataoka (Showa Univ.) NLC2008-48 SP2008-103 |
We developed a physical model of the human vocal tract with a flexible velum using gel-type material. In this model, whi... [more] |
NLC2008-48 SP2008-103 pp.143-148 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Articulatory feature extraction based on 3-stage MLNs and Inhibition/Enhancement Network Mohammad Nurul Huda, Hiroaki Kawashima, Tsuneo Nitta (Toyohashi Univ. of Tech.) NLC2008-49 SP2008-104 |
This paper describes an articulatory feature extractor for accurate phoneme recognition. The proposed method comprises t... [more] |
NLC2008-49 SP2008-104 pp.149-154 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Parameter optimization for a fundamental frequency extractor based on TANDEM-STRAIGHT Hanae Itagaki, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara (Wakayama Univ.) NLC2008-50 SP2008-105 |
Abstract A fundamental frequency extractor based on a temporally stable power spectral representation for peri-odic sign... [more] |
NLC2008-50 SP2008-105 pp.155-160 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Study on Spectro-Temporal Features Based on Gradient Histograms Takashi Muroi, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) NLC2008-51 SP2008-106 |
This paper proposes a novel feature extraction method for speech recognition based on gradient features on 2-D time-freq... [more] |
NLC2008-51 SP2008-106 pp.161-165 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Automatic Speech Character Identification using Vocal Tract information Yusuke Watanabe, Naoki Matsumoto (Meiji Univ.) NLC2008-52 SP2008-107 |
In this paper, we propose a speech character identification system based on vocal tract information and three-layered ne... [more] |
NLC2008-52 SP2008-107 pp.167-172 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Evaluation of speaker identification/verification method using phase information Longbiao Wang (Shizuoka Univ.), Kazue Minami, Kazumasa Yamamoto, Seiichi Nakagawa (Toyohashi Univ. of Tech.) NLC2008-53 SP2008-108 |
[more] |
NLC2008-53 SP2008-108 pp.173-178 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Dialect-based speaker classification of Chinese using acoustic features invariant with extra-linguistic factors XueBin Ma, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose (Univ. of Tokyo), Akira Nemoto (Nankai Univ.), Feng Shi (nankai Univ.) NLC2008-54 SP2008-109 |
Chinese dialects-based speaker classification using modern speech technologies is really a challenge, not only because t... [more] |
NLC2008-54 SP2008-109 pp.179-184 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nitech) NLC2008-55 SP2008-110 |
[more] |
NLC2008-55 SP2008-110 pp.185-190 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Sudden noise reduction using dynamic speech feature model Nobuyuki Miyake, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) NLC2008-56 SP2008-111 |
This paper describes a method for reducing sudden noise using a static and dynamic speech feature model. We have propoes... [more] |
NLC2008-56 SP2008-111 pp.191-196 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speech period detection using Hough transform of distance matrix images Hiroyuki Nishi, Yoshimasa Kimura, Nguyen Van Don (Sojo Univ.) NLC2008-57 SP2008-112 |
In this report, using an input voice and a standard patterns as both axes, the analysis method that uses the distance ma... [more] |
NLC2008-57 SP2008-112 pp.197-202 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Isolated word recognition based on speech structures and discriminant analysis Satoshi Asakawa, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) NLC2008-58 SP2008-113 |
Non-linguistic factors of speech such as vocal tract sizes and recording devices easily change
acoustic features of spe... [more] |
NLC2008-58 SP2008-113 pp.203-208 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speech recognition using localized affine invariant features Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) NLC2008-59 SP2008-114 |
This paper proposes localized affine invariant features (LAIFs) for speaker-independent automatic speech recognition. Th... [more] |
NLC2008-59 SP2008-114 pp.209-214 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Tying covariance parameters for HMM-based speech synthesis Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inusitute of Technology) NLC2008-60 SP2008-115 |
In this paper, we proposed a tying covariance technique in hidden Markov model (HMM) basedspeech
synthesis system. In r... [more] |
NLC2008-60 SP2008-115 pp.215-220 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speech Recognition Based on Statistical Models Including Multiple Decision Trees Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-61 SP2008-116 |
This paper proposes a speech recoginition technique using multiple decision trees. In the use of context dependent model... [more] |
NLC2008-61 SP2008-116 pp.221-226 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Recording system for controlling speaking rate (ReCoK5) and public domain speech database with speaking rate variations (SRV-DB) Kota Takahashi, Keigo Tsutaki, Toru Yoshihara (The University of Electro-Communications) NLC2008-62 SP2008-117 |
A specialized speech database is required for the studies of an efficient listening method for general people or a relia... [more] |
NLC2008-62 SP2008-117 pp.227-232 |
SP, NLC |
2008-12-10 16:10 |
Tokyo |
Waseda Univ. |
Speaking rate estimation and utterance analysis of fast speech for high-speed reproduction
-- A practical example of speech database with speaking rate variations -- Toru Yoshihara, Keigo Tsutaki, Kota Takahashi (The University of Electro-Communications) NLC2008-63 SP2008-118 |
A quantitative index is needed to estimate the optimum reproduction speed for high speed reproduction of various voices ... [more] |
NLC2008-63 SP2008-118 pp.233-238 |