Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS |
2014-05-24 08:50 |
Tokyo |
|
"Ongaku" Symposium 2014: The 2nd Symposium on Any Topics Related to Acoustics, Audition and Natural Language Hirokazu Kameoka (Univ. of Tokyo/NTT), Eriko Aiba (UEC), Yasunori Ohishi (NTT), Tetsuro Kitahara (Nihon Univ.), Tatsuya Kitamura (Konan Univ.), Shoei Sato (NHK), Masahito Togami (Hitachi), Tomoki Toda (NAIST), Kazuyoshi Yoshii (Kyoto Univ.) SP2014-1 |
[more] |
SP2014-1 pp.1-3 |
SP, IPSJ-MUS |
2014-05-24 09:00 |
Tokyo |
|
[Invited Talk]
Speaker adaptation technologies for speech synthesis and its application to assistive technology Junichi Yamagishi (NII) SP2014-2 |
[more] |
SP2014-2 pp.5-6 |
SP, IPSJ-MUS |
2014-05-24 10:30 |
Tokyo |
|
[Invited Talk]
Infinite data analysis and Bayesian nonparametrics for audio signal processing Masahiro Nakano (NTT) SP2014-3 |
Bayesian nonparametrics has been an active area of research. Althoug we can construct a variety of models based on famou... [more] |
SP2014-3 pp.13-18 |
SP, IPSJ-MUS |
2014-05-24 15:30 |
Tokyo |
|
[Invited Talk]
From multimodal spatial hearing to engineering applications to cope with severe disasters
-- Our recent research restuls on spatial acoustic information sciences -- Yo-iti Suzuki, Shuichi Sakamoto (Tohoku Univ.) SP2014-4 |
[more] |
SP2014-4 pp.19-20 |
SP, IPSJ-MUS |
2014-05-25 09:00 |
Tokyo |
|
[Invited Talk]
Behavioral neurosciences of vocal control and learning
-- using the songbird as a model system -- Ryosuke O. Tachibana (Univ. of Tokyo) SP2014-5 |
This article provides an introductory review of behavioral neuroscience studies on vocal control and learning of songbir... [more] |
SP2014-5 pp.29-34 |
SP, IPSJ-MUS |
2014-05-25 09:45 |
Tokyo |
|
[Invited Talk]
Machine Translation
-- Why couldn't we do it? Why are we starting to be able to now? -- Graham Neubig (NAIST) SP2014-6 |
While machine translation has been a long-held dream dating back to the appearance of the first computers, for many year... [more] |
SP2014-6 pp.35-38 |
SP, IPSJ-MUS |
2014-05-25 10:30 |
Tokyo |
|
[Invited Talk]
Applications and Advances of Deep Learning for Automatic Speech Recognition Yotaro Kubo (Amazon) SP2014-7 |
In this presentation, I will introduce recent advances in deep learning methods, and several instances of application of... [more] |
SP2014-7 pp.39-44 |
SP, IPSJ-MUS |
2014-05-25 15:30 |
Tokyo |
|
[Invited Talk]
R&D of Music Information Retrieval Technology and Issues for its Deployment to Practical Applications Keiichiro Hoashi (KDDI Labs) SP2014-8 |
[more] |
SP2014-8 p.45 |
SP, IPSJ-MUS |
2014-05-25 16:15 |
Tokyo |
|
[Invited Talk]
What Higher-Order Statistics Tell Us?
-- Acoustic Signal Processing Based on Unsupervised Learning -- Hiroshi Saruwatari (Univ. of Tokyo) SP2014-9 |
[more] |
SP2014-9 pp.47-52 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
A Consideration of Evaluation Measurements in Spoken Term Detection Satoshi Oshima, Yoshiaki Itoh (Iwate Prefectural Univ.) SP2014-10 |
[more] |
SP2014-10 pp.117-121 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Robustness of Speaker Identification Using Pseudo Pitch Synchronized Phase Information Yuta Kawakami, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.), Seiichi Nakagawa (Toyohashi Univ. of Tech.) SP2014-11 |
The phase information is useful for the speaker recognition task, but MFCC ignores that. In this work, we conducted spea... [more] |
SP2014-11 pp.123-126 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Visualization of World Englishes pronunciations from a speaker's self-centered viewpoint using attributes of accent, gender, and age Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UTokyo), Han-Ping Shen (NCKU) SP2014-12 |
English is the only language available for global communication and is known to have a large diversity of pronunciations... [more] |
SP2014-12 pp.127-132 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Native language recognition using machine learning Ryota Sakagami, Kouki Takeshita, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech) SP2014-13 |
The difference in pronunciation occurs in a non-native speaker and a native speaker. Therefore, communication is difficu... [more] |
SP2014-13 pp.139-141 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Language recognition in reverberant environments Kouki Takeshita, Ryota Sakagami, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech.) SP2014-14 |
[more] |
SP2014-14 pp.143-145 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Discriminative training of acoustic models for system combination Yuuki Tachioka (Mitsubishi Electric), Shinji Watanabe, Jonathan Le Roux, John R. Hershey (MERL) SP2014-15 |
In discriminative training methods, the objective function is designed to improve the performance of automatic speech re... [more] |
SP2014-15 pp.147-152 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Distant-talking Speech Recognition with Asynchronous Speech Recording Shunta Teraoka, Yuma Ueda (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai, Taku Fukushima (Shizuoka Univ.) SP2014-16 |
Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on di... [more] |
SP2014-16 pp.153-157 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
[研究紹介] A spectrogram-patch-input DNN model for detection and classification of acoustic events robust to speech overlapping scenarios Miquel Espi, Masakiyo Fujimoto, Yotaro Kubo, Tomohiro Nakatani (NTT) SP2014-17 |
This paper presents an acoustic event detection and classification method that learns features from spectrogram patches ... [more] |
SP2014-17 pp.171-176 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Development of environmental sound collection system using smart devices based on crowd-sourcing approach Sunao Hara, Akinori Kasai, Masanobu Abe (Okayama Univ.), Noboru Sonehara (NII) SP2014-18 |
In this study, we aimed to construct environmental sound database for various sounds as Wisdom of the crowds. For exampl... [more] |
SP2014-18 pp.177-180 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
ROCKON:Environmental sound collection and recognition system using smartphones Minori Matsuyama, Takahiko Tsuda, Ryuichi Nisimura, Hideki Kawahara (Wakayama Univ), Junnosuke Yamada (NTT), Toshio Irino (Wakayama Univ) SP2014-19 |
[more] |
SP2014-19 pp.181-186 |
SP, IPSJ-MUS |
2014-05-24 11:30 |
Tokyo |
|
Underdetermined Blind Separation of Moving Sources Based on Probabilistic Modeling Takuya Higuchi, Norihiro Takamune, Tomohiko Nakamura (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT) SP2014-20 |
This paper deals with the problem of the underdetermined blind separation and tracking of moving sources. In practical s... [more] |
SP2014-20 pp.211-216 |