Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380
[TOP] | [2009] | [2010] | [2011] | [2012] | [2013] | [2014] | [2015] | [Japanese] / [English]
SP2012-85
Automatic Vocabulary Adaptation for Speech Recognition based on Semantic Similarity and Confidence Measure
Shoko Yamahata, Yoshikazu Yamaguchi, Atsunori Ogawa, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi (NTT)
pp. 1 - 6
SP2012-86
[Invited Talk]
Towards Integrated Processing of Speech and Image Information
Yasuo Ariki (Kobe University)
pp. 27 - 32
SP2012-87
[Invited Talk]
Making A Technology Seem Natural
Eric Chang (Microsoft Research Asia)
pp. 33 - 34
SP2012-88
Recent efforts for high-performance multi-modal speech recognition
Satoshi Tamura, Peng Shen, Hiroya Okuda, Naoya Ukai, Takuya Kawasaki, Takumi Seko, Satoru Hayamizu (Gifu Univ.)
pp. 41 - 46
SP2012-89
Normalization of EMA data
-- tongue movement during articulation of consonant clusters --
Seiya Funatsu (Prefectural Univ. of Hiroshima), Masako Fujimoto (NINJAL)
pp. 59 - 64
SP2012-90
Fundamental frequency estimation combining air conducted speech with bone conducted speech
Kosuke Osa, Tetsuya Shimamura (Saitama Univ.)
pp. 65 - 70
SP2012-91
Relative amplitude between consonant and vowel of Bone Conducted speech
Tatsuya Kato, Tetsuya Shimamura (Saitama Univ.)
pp. 71 - 74
SP2012-92
Interpolation of unlearned position based on local regression for single-channel talker localization using acoustic transfer function
Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
pp. 75 - 80
SP2012-93
Reduction of cross spectrum for feature-domain sound source separation
Atsushi Ando (Nagoya Univ.), Kenta Niwa (NTT), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.)
pp. 107 - 112
SP2012-94
Syllable nucleus detection using waveform envelopes and modeling of the word acquisition process using word structures and syllable nuclei
Yousuke Ozaki, Nobuaki Minematsu, Keikichi Hirose (The Univ. of Tokyo), Donna Erickson (Showa Univ. of Music)
pp. 113 - 118
SP2012-95
Sparse Coding-Based Voice Conversion from Lip Information
Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
pp. 119 - 124
SP2012-96
Two-step Correction of the Speech Recognition Result based on Syntax and Semantics
Ryohei Nakatani, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
pp. 149 - 154
SP2012-97
Automatic Speech Translation System Selecting Target Language by Direction of Arrival Information
Masanori Tsujikawa, Koji Okabe, Ken Hanazawa (NEC)
pp. 161 - 165
Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.