Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 09:55 |
Tokyo |
NTT Musashino R&D |
Hard-Attentional Neural Network Models for Emphasis Speech Translation Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura (NAIST) SP2016-50 |
Traditional speech translation systems are oblivious to paralinguistic information. A recent work has tried to tackle th... [more] |
SP2016-50 pp.7-8 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 10:20 |
Tokyo |
NTT Musashino R&D |
Constructing a Japanese multimodal corpus from emotional monologues and dialogues Nurul Lubis (NAIST), Randy Gomez (HRI), Sakriani Sakti (NAIST), Keisuke Nakamura (HRI), Koichiro Yoshino, Satoshi Nakamura (NAIST), Kazuhiro Nakadai (HRI) SP2016-51 |
To fully incorporate emotion into human-computer interaction, rich sets of labeled emotional data is prerequisite. Howev... [more] |
SP2016-51 pp.9-10 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 11:20 |
Tokyo |
NTT Musashino R&D |
Speaker Recognition Based on Features through 1-Dimensional Convolutional Neural Network Shohei Sonoda, Yufu Kasahara, Masato Inoue (Waseda Univ) SP2016-52 |
Most of the speaker recognition methods utilize the voice features of the mel-frequency cepstrum coefficients (MFCCs) an... [more] |
SP2016-52 pp.17-21 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 11:45 |
Tokyo |
NTT Musashino R&D |
Study on i-vector based speaker verification using rank for short utterances Misaki Tsujikawa (Panasonic/Sokendai), Tsuyoki Nishikawa (Panasonic), Tomoko Matsui (ISM) SP2016-53 |
Generally, short utterance test data seriously degrades the accuracy of speaker verification. However, in many voice-ope... [more] |
SP2016-53 pp.23-26 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Improvement of accent sandhi rules based on accent dictionary for Japanese text-to-speech systems Hiroto Aoyama, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-54 |
In order to synthesize more natural speech in Japanese text-to-speech systems, we improved accent sandhi rules. Conventi... [more] |
SP2016-54 pp.31-36 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
F0 control by modeling differential features in DNN-based speech synthesis Shuhei Yamada, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-55 |
We have been developing ``tailor-made speech synthesis,'' a framework which enables users to modify synthetic speech nat... [more] |
SP2016-55 pp.37-42 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Fast algorithm for statistical phrase/accent command estimation based on generative model incorporating spectral features Ryotaro Sato (The Univ. of Tokyo), Hirokazu Kameoka, Kunio Kashino (NTT) SP2016-56 |
On the basis of the Fujisaki model, we propose a fast algorithm for estimating the model parameters, namely, the timings... [more] |
SP2016-56 pp.43-48 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Development of the Julius-compatible interface for the speech recognition engine of Kaldi toolkit Yusuke Yamada, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.) SP2016-57 |
[more] |
SP2016-57 pp.49-51 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Deep Neural Network Using Fundamental Frequency For Noise Robust Speaker Recognition Yoshihiro Suzuki, Yosuke Sugiura, Tetsuya Shimamura (Saitama Univ.) SP2016-58 |
In this paper, we propose a neural network architecture for speaker recognition to simplify learning process. In the pro... [more] |
SP2016-58 pp.53-56 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Quantization Noise Reduction of Speech by Using Denoising Auto-encoder Shohei Oouchi, Kazunori Mano (SIT) SP2016-59 |
A quantization noise reduction technique based on Denoising Auto-encoder (DAE) was studied. DAE is a neural network to c... [more] |
SP2016-59 pp.57-58 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
Remote Sensing Applied to the Input Interface of Spoken Dialogue System Makoto Koike (MK Microwave Researh) SP2016-60 |
I propose herein that remote sensing is applied to the input interface of spoken language systems. The remote sensing r... [more] |
SP2016-60 pp.59-64 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 16:40 |
Tokyo |
NTT Musashino R&D |
Generative Adversarial Network-based Postfiltering for Statistical Parametric Speech Synthesis Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino (NTT) SP2016-61 |
In the field of speech synthesis, statistical parametric speech synthesis has been widely used due to the flexibility an... [more] |
SP2016-61 pp.89-94 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 09:20 |
Tokyo |
NTT Musashino R&D |
Proposal of a robust and high-performance F0 estimator and its evaluation Masanori Morise (Univ. of Yamanashi) SP2016-62 |
[more] |
SP2016-62 pp.107-112 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 09:45 |
Tokyo |
NTT Musashino R&D |
Deep Learning-based ASR using Cochleogram and Spectrogram Features Combination Andros Tjandra, Sakriani Sakti (NAIST), Mirna Adriani (UI), Satoshi Nakamura (NAIST) SP2016-63 |
[more] |
SP2016-63 pp.113-114 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 10:45 |
Tokyo |
NTT Musashino R&D |
[Invited Talk]
Towards Fusion of Speech and Natural Language Processing Research
-- Past and Future of Speech Translation Research -- Satoshi Nakamura (NAIST) SP2016-64 |
(To be available after the conference date) [more] |
SP2016-64 pp.121-122 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 13:15 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Design and analysis of dialogue acts for the Kyutech corpus Masato Hino, Takasi Yamamura, Kazutaka Shimada (Kyutech) NLC2016-29 |
Conversation understanding is one of the most important tasks in natural language processing. Recently, many researchers... [more] |
NLC2016-29 pp.1-6 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 13:15 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Toward the effective collection of dialogues using gamification Shin Kanouchi, Mamoru Komachi (TMU) NLC2016-30 |
In non-task-oriented dialogue systems, researches and developments using large-scale dialogue data have been thriving.
... [more] |
NLC2016-30 pp.7-12 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 13:15 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Constructing taxonomy of implicit information in dialogue by multiple working groups Koh Mitsuda, Ryuichiro Higashinaka, Yoshihiro Matsuo (NTT) NLC2016-31 |
[more] |
NLC2016-31 pp.13-18 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 14:45 |
Tokyo |
NTT Musashino R&D |
[Invited Lecture]
EMNLP 2016 Report (1) Kei Wakabayashi (Univ. of Tsukuba) NLC2016-32 |
[more] |
NLC2016-32 p.51 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-21 15:05 |
Tokyo |
NTT Musashino R&D |
[Invited Lecture]
International Conference Report EMNLP 2016 (2) Sho Takase (Tohoku Univ.) NLC2016-33 |
(To be available after the conference date) [more] |
NLC2016-33 p.53 |