IEICE Technical Report

Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380

Speech

Workshop Date : 2011-12-19 - 2011-12-20 / Issue Date : 2011-12-12

SP2011-81
Extraction of new abbreviated words using Crowdsourcing System
Toshihiko Sakai (Kyushu Univ.), Masayuki Ashikawa (Toshiba), Sachio Hirokawa (Kyushu Univ.)
pp. 13 - 17

SP2011-82
Telephone conversations retrieval using Line Detection method in the Distance Matrix Images(LD-DMI)
Hiroyuki Nishi, Yuuki Yokobayashi, Haiyen, Yoshimasa Kimura, Toshio Kakinoki (Sojo Univ)
pp. 33 - 38

SP2011-83
A study on language identification using non-negative matrix factorization as an extractor of phonotactic information
Tsuyoshi Ogata, Kazuyuki Takagi (UEC Tokyo)
pp. 45 - 48

SP2011-84
Phoneme Recognition based on AF-HMMs with Optimal State Configuration
Narpendyah W. Ariwardhani, Yurie Iribe, Kouichi Katsurada, Tsuneo Nitta (Toyohashi Univ. of Tech.)
pp. 49 - 54

SP2011-85
Concise representation of a matrix of basis functions for speech analysis and synthesis by using segmental NMF
Cheol Lee, Kazunori Mano (Shibaura Inst. of Tech.)
pp. 55 - 60

SP2011-86
Speaker identification using closed caption for scene retrieval in television broadcasting
Keita Yamamuro, Katunobu Itou (Hosei Univ.)
pp. 61 - 66

SP2011-87
Speaker Clustering Using Speaker Subspace Obtained Dynamically based on Variance of Intra-Utterance
Yuki Ishikawa, Masafumi Nishida, Seiichi Yamamoto (Doshisha Univ.)
pp. 67 - 71

SP2011-88
Study on extraction of vocal part in music signal by using non-negative matrix algorithm
Yuta Yasui, Hideki Banno, Fumitada Itakura (Meijo Univ)
pp. 73 - 78

SP2011-89
A proposal of acoustic feature related to voice quality for estimation of similarity in singing voice
Chifumi Suzuki, Hideki Banno, Fumitada Itakura (Meijo Univ.), Masanori Morise (Ritsumeikan Univ.)
pp. 79 - 84

SP2011-90
[Poster Presentation] Digital Signals Gave Birth to the Grammars and the Abstract Concepts -- Autogenetic Multi-Stage Development of Human Vocal Communication System --
Kimiaki Tokumaru (System Engineer)
pp. 107 - 112

SP2011-91
Simultaneous application of speaker adaptation and noise mixture model estimation for noise suppression
Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani (NTT)
pp. 113 - 118

SP2011-92
GIF-SP: Improvement of Speech Recognition Using General and Discriminative Feature
Satoshi Tamura, Yoji Tagami, Satoru Hayamizu (Gifu Univ.)
pp. 119 - 124

SP2011-93
Speaker Verification Using MMAP Adaptation
Sangeeta Biswas, Johan Rohdin, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.)
pp. 133 - 137

SP2011-94
Error Correction Using CRF for Mis-Recognition around OOV Words on Speech Recognition Result
Ryohei Nakatani (Kobe Univ.), Naoto Iwahashi (NICT), Mikio Nakano (HRI-JP), Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
pp. 139 - 144

SP2011-95
[Invited Talk] Development of a framework for constructing spoken dialogue systems based on user-generated content
Keiichi Tokuda (NITech)
pp. 153 - 157

SP2011-96
An Open-Source Toolkit for Building Attractive Voice Interaction Systems -- MMDAgent
Akinobu Lee, Keiichiro Oura, Keiichi Tokuda (Nitech)
pp. 159 - 164

SP2011-97
An MRHSMM-based conversational speech synthesis with controllability of paralinguistic information
Tomohiro Nagata, Hiroki Mori (Utsunomiya Univ), Takashi Nose (Tokyo Tech)
pp. 179 - 184

SP2011-98
On the use of prosodic-event-based HMM in F0 generation of conversational speech
Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech)
pp. 185 - 190

SP2011-99
A Study on Speaker Independent Style Conversion in HMM Speech Synthesis
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Tech)
pp. 191 - 196

SP2011-100
A study on modeling phone duration using dynamic features for HMM-based speech synthesis
Takashi Nose, Takao Kobayashi (Tokyo Tech)
pp. 197 - 202

Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan