Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-SLP (Joint) |
2015-07-16 13:30 |
Nagano |
Katakura Suwako Hotel |
Acoustic data-driven pronunciation lexicon for non-native speech recognition Satoshi Tsujioka (NAIST), Liang Lu (University of Edinburgh), Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura (NAIST) SP2015-36 |
Nowadays, the English language is often used as a tool to facilitate communication at international meetings. Consequent... [more] |
SP2015-36 pp.1-6 |
SP, IPSJ-SLP (Joint) |
2015-07-16 14:00 |
Nagano |
Katakura Suwako Hotel |
A Spoken term detection method matching at a frame level Ryota Konno, Kazunori Kojima (IPU), Shi-wook Lee (AIST), Kazuyo Tanaka (Univ. of Tsukuba), Yoshiaki Itoh (IPU) SP2015-37 |
This paper proposes a spoken term detection method matching at a frame level. [more] |
SP2015-37 pp.7-12 |
SP, IPSJ-SLP (Joint) |
2015-07-16 15:10 |
Nagano |
Katakura Suwako Hotel |
A study on discriminative approach for estimation of the divergence between distributions and its application to language identification Yosuke Kashiwagi, Congying Zhang, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) SP2015-38 |
In this paper, we propose a method for estimating the statistical divergence between probability distributions by a disc... [more] |
SP2015-38 pp.13-18 |
SP, IPSJ-SLP (Joint) |
2015-07-16 16:20 |
Nagano |
Katakura Suwako Hotel |
Sequence Discriminative Training for Low-Rank Deep Neural Networks Yuuki Tachioka (Mitsubishi Electric), Shinji Watanabe, Jonathan Le Roux, John Hershey (MERL) SP2015-39 |
Deep neural network (DNN) acoustic models outperform conventional Gaussian mixture model (GMM) but the number of paramet... [more] |
SP2015-39 pp.19-24 |
SP, IPSJ-SLP (Joint) |
2015-07-16 16:50 |
Nagano |
Katakura Suwako Hotel |
A Feature-Space Adaptation Technique using Regression Tree-based Multiple Transformation Matrices Hiroki Kanagawa, Yuuki Tachioka (Mitsubishi Electric Corp.), Shinji Watanabe (MERL), Jun Ishii (Mitsubishi Electric Corp.) SP2015-40 |
(To be available after the conference date) [more] |
SP2015-40 pp.25-30 |
SP, IPSJ-SLP (Joint) |
2015-07-16 17:20 |
Nagano |
Katakura Suwako Hotel |
Experimental evaluation of network size effect in speaker adaptive trained DNNs embedding linear transformation networks Tsubasa Ochiai (Doshisha Univ./NICT), Shigeki Matsuda (Doshisha Univ.), Hideyuki Watanabe, Xugang Lu, Hisashi Kawai (NICT), Shigeru Katagiri (Doshisha Univ.) SP2015-41 |
Recently we proposed a novel speaker adaptation method that applied the Speaker Adaptive Training
(SAT) concept to DNN-... [more] |
SP2015-41 pp.31-36 |
SP, IPSJ-SLP (Joint) |
2015-07-16 17:50 |
Nagano |
Katakura Suwako Hotel |
Speaker Adaptation Technique for Speech Recognition using a Feature Augmentation Framework Hiroshi Fujimura, Takashi Masuko (TOSHIBA) SP2015-42 |
Deep Neural Networks (DNNs) are powerful machine learning models.Nevertheless, the performance degrades for out-of domai... [more] |
SP2015-42 pp.37-42 |
SP, IPSJ-SLP (Joint) |
2015-07-17 09:00 |
Nagano |
Katakura Suwako Hotel |
Spoken Language Identification based on Language Modeling of Tandem-MLP Features Ryo Masumura, Taichi Asami, Hirokazu Masataki, Sumitaka Sakauchi (NTT) SP2015-43 |
[more] |
SP2015-43 pp.43-48 |
SP, IPSJ-SLP (Joint) |
2015-07-17 09:30 |
Nagano |
Katakura Suwako Hotel |
Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis Shinji Takaki (NII), SangJin Kim (Naver Labs), Junichi Yamagishi (NII), JongJin Kim (Naver Labs) SP2015-44 |
In this paper, we investigate a combination of several feed-forward deep neural networks (DNNs) for a high-quality stati... [more] |
SP2015-44 pp.49-54 |
SP, IPSJ-SLP (Joint) |
2015-07-17 10:10 |
Nagano |
Katakura Suwako Hotel |
[Invited Talk]
Image feature extraction and transfer learning using deep convolutional neural networks Hideki Nakayama (Univ. of Tokyo) SP2015-45 |
Convolutional neural network (CNN) has attracted more and more attention for its remarkable performance in visual recogn... [more] |
SP2015-45 pp.55-59 |
SP, IPSJ-SLP (Joint) |
2015-07-17 11:10 |
Nagano |
Katakura Suwako Hotel |
[Invited Talk]
Aspects of feature extraction in DNN acoustic models Takuya Yoshioka, Marc Delcroix, Masakiyo Fujimoto, Tomohiro Nakatani (NTT) SP2015-46 |
Since the advent of acoustic models based on deep neural networks (DNNs), a vast amount of efforts have been made to fur... [more] |
SP2015-46 pp.61-65 |
SP, IPSJ-SLP (Joint) |
2015-07-17 13:10 |
Nagano |
Katakura Suwako Hotel |
A study on effectiveness of pop noise for speaker verification Shiori Nakano, Ryosuke Nakanishi, Sayaka Shiota, Hitoshi Kiya (Tokyo Metro Univ.) SP2015-47 |
This paper investigates an effectiveness of pop noise, which is unconsciously caused by human breath, for automatic spea... [more] |
SP2015-47 pp.67-72 |
SP, IPSJ-SLP (Joint) |
2015-07-17 13:40 |
Nagano |
Katakura Suwako Hotel |
Voice liveness detection based on frequency characteristics for speaker verification Sayaka Shiota (Tokyo Metro. Univ.), Fernando Villaviencio, Junichi Yamagishi, Nobutaka Ono, Isao Echizen (NII), Tomoko Matsui (ISM) SP2015-48 |
[more] |
SP2015-48 pp.73-78 |
SP, IPSJ-SLP (Joint) |
2015-07-17 14:10 |
Nagano |
Katakura Suwako Hotel |
Investigation of privacy-preserving sounds to degrade automatic speaker verification performance Kei Hashimoto (NITECH), Junichi Yamagishi, Isao Echizen (NII) SP2015-49 |
Sharing speech without permission and identifying the individual from the speech by speaker recognition lead to problems... [more] |
SP2015-49 pp.79-84 |