Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Acoustic-to-articulatory inversion mapping with variational latent trajectory Gaussian mixture model Patrick Lumban Tobing (Nagoya Univ.), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ.) EA2016-134 SIP2016-189 SP2016-129 |
[more] |
EA2016-134 SIP2016-189 SP2016-129 pp.291-296 |
SP |
2016-08-24 13:00 |
Kyoto |
ACCMS, Kyoto Univ. |
Adaptation Methods for Daily Activity Recognition Based on Deep Neural Network Tomoki Hayashi (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-27 |
Our objective is to build a monitoring system which enables elderly people to live actively, and the key technology to a... [more] |
SP2016-27 pp.1-6 |
SP |
2016-08-24 13:25 |
Kyoto |
ACCMS, Kyoto Univ. |
Daily Activity Recognition Based on Recurrent Neural Network Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-28 |
Our goal is to build an automatic surveillance system for elderly people and the core technique is daily activity recogn... [more] |
SP2016-28 pp.7-12 |
EA, SP, SIP |
2016-03-28 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
Method of presenting sound signals for selective listening system Tomomi Suzuki (Nagoya Univ.), Takanori Nishino (Mie Univ.), Yoshio Ishiguro, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2015-66 SIP2015-115 SP2015-94 |
We are able to selectively concentrate on the desired acoustic information in the environmental sounds.
The aim of this... [more] |
EA2015-66 SIP2015-115 SP2015-94 pp.1-6 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
An evaluation of F0 transformation for statistical singing voice conversion based on spectral differential filtering Kazuhiro Kobayashi (NAIST), Tomoki Toda (Nagoya Univ./NAIST), Satoshi Nakamura (NAIST) EA2015-84 SIP2015-133 SP2015-112 |
In this report, we propose a technique for cross-gender statistical singing voice conversion (SVC) with direct waveform ... [more] |
EA2015-84 SIP2015-133 SP2015-112 pp.105-110 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
An evaluation of acoustic-to-articulatory inversion mapping with latent trajectory Gaussian mixture model Patrick Lumban Tobing (NAIST), Tomoki Toda (Nagoya Univ./NAIST), Hirokazu Kameoka (NTT), Satoshi Nakamura (NAIST) EA2015-85 SIP2015-134 SP2015-113 |
In this report, we present an evaluation of acoustic-to-articulatory inversion mapping based on latent trajectory
Gauss... [more] |
EA2015-85 SIP2015-134 SP2015-113 pp.111-116 |
EA, SP, SIP |
2016-03-28 13:15 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Nonaudible murmur enhancement based on non-negative tensor factorization of air- and body-conducted signals in real environments Yusuke Tajiri (NAIST), Hirokazu Kameoka (NTT), Tomoki Toda (Nagoya Univ./NAIST), Satoshi Nakamura (NAIST) EA2015-86 SIP2015-135 SP2015-114 |
Nonaudible murmur (NAM) recorded with a special body-conductive microphone called NAM microphone is one of the promising... [more] |
EA2015-86 SIP2015-135 SP2015-114 pp.117-122 |
EA, SP, SIP |
2016-03-29 14:40 |
Oita |
Beppu International Convention Center B-ConPlaza |
Product-of-Experts approach to integration of F0 generative process model to statistical F0 prediction for electrolaryngeal speech enhancement Kou Tanaka (NAIST), Hirokazu Kameoka (NTT), Tomoki Toda (The University of Nagoya/NAIST), Satoshi Nakamura (NAIST) EA2015-133 SIP2015-182 SP2015-161 |
We have previously proposed a statistical fundamental frequency (F0) prediction method that makes it possible to predict... [more] |
EA2015-133 SIP2015-182 SP2015-161 pp.373-377 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 11:15 |
Aichi |
Nagoya Inst of Tech. |
Evaluation and Analysis of Duration Correction for Non-Native Speech Based on Waveform Modification Shinya Kura, Shinnosuke Takamichi (NAIST), Tomoki Toda (NAIST/Nagoya Univ.), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2015-73 |
There are several attempts at correcting durational patterns of non-native speech towards language learning. One of the ... [more] |
SP2015-73 pp.19-24 |
EA, EMM |
2015-11-12 17:00 |
Kumamoto |
Kumamoto Univ. |
Noise suppression method for body-conducted soft speech based on external noise monitoring Yusuke Tajiri (NAIST), Tomoki Toda (Nagoya Univ.), Satoshi Nakamura (NAIST) EA2015-31 EMM2015-52 |
As one of the silent speech interfaces, nonaudible murmur (NAM) microphone has been developed for detecting an extremely... [more] |
EA2015-31 EMM2015-52 pp.41-46 |
SP |
2015-10-15 13:25 |
Hyogo |
Kobe Univ. |
Statistical singing voice conversion based on direct waveform modification and its parameter generation algorithms Kazuhiro Kobayashi, Tomoki Toda, Satoshi Nakamura (NAIST) SP2015-60 |
This report presents a novel statistical singing voice conversion (SVC) technique with direct waveform modification base... [more] |
SP2015-60 pp.7-12 |
EA, ASJ-H |
2015-08-03 13:00 |
Miyagi |
Tohoku Univ., Research Inst. of Electrical Communication |
Aliasing reduction in L-F model implementation for an interactive tool applicable to speech science education Hideki Kawahara (Wakayama Univ.), Ken-Ichi Sakakibara (Health Sciences Univ. Hokkaido), Hideki Banno (Meijo Univ.), Masanori Morise (Univ. Yamanashi), Tomoki Toda (NAIST), Toshio Irino (Wakayama Univ.) EA2015-10 |
[more] |
EA2015-10 pp.1-6 |
SP, IPSJ-SLP (Joint) |
2015-07-16 13:30 |
Nagano |
Katakura Suwako Hotel |
Acoustic data-driven pronunciation lexicon for non-native speech recognition Satoshi Tsujioka (NAIST), Liang Lu (University of Edinburgh), Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura (NAIST) SP2015-36 |
Nowadays, the English language is often used as a tool to facilitate communication at international meetings. Consequent... [more] |
SP2015-36 pp.1-6 |
WIT, SP, ASJ-H, PRMU |
2015-06-18 15:15 |
Niigata |
|
Implementation of Direct F0 Control of an Electrolarynx based on Real-time Excitation Prediction Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) PRMU2015-40 SP2015-9 WIT2015-9 |
[more] |
PRMU2015-40 SP2015-9 WIT2015-9 pp.47-52 |
WIT, SP, ASJ-H, PRMU |
2015-06-18 14:25 |
Niigata |
|
Non-Audible Murmur Enhancement Method using Air- and Body-Conductive Microphones in Noisy Environments and its Evaluation Yusuke Tajiri, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) PRMU2015-42 SP2015-11 WIT2015-11 |
As one of the silent speech interfaces, Non-Audible Murmur (NAM) microphone which can detect an extremely soft whispered... [more] |
PRMU2015-42 SP2015-11 WIT2015-11 pp.59-64 |
SIP, EA, SP |
2015-03-02 11:15 |
Okinawa |
|
Modulation spectrum-constrained trajectory training algorithm for statistical parametric speech synthesis Shinnosuke Takamichi (NAIST/CMU), Tomoki Toda (NAIST), Alan W. Black (CMU), Satoshi Nakamura (NAIST) EA2014-77 SIP2014-118 SP2014-140 |
[more] |
EA2014-77 SIP2014-118 SP2014-140 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-15 09:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Recognition and Analysis of Emotion in Indonesian Conversational Speech Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda (NAIST), Dessi Lestari, Ayu Purwarianti (ITB), Satoshi Nakamura (NAIST) SP2014-106 |
The importance of incorporating emotional aspect in human computer interaction continues to arise. Unfortunately, explor... [more] |
SP2014-106 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Articulatory Controllable Speech Modification using Sequential Inversion and Production Mapping with Gaussian Mixture Models Patrick Lumban Tobing, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST), Ayu Purwarianti (ITB) SP2014-111 |
In this report, we propose an articulatory controllable speech modification framework using statistical inversion and pr... [more] |
SP2014-111 pp.57-62 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
Prosody Correction Preserving Speaker Individuality in English-Read-By-Japanese Speech Synthesis Based on HMM Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2014-112 |
To build an English acoustic model that well captures speaker individuality of each Japanese speaker, a framework using ... [more] |
SP2014-112 pp.63-68 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 11:00 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
HMM-Based Speech Synthesis System with Prosody Modification Based on Speech Input Yuri Nishigaki, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2014-115 |
As a creative activity using speech synthesis technologies has been grown rapidly, it is desired to develop an interface... [more] |
SP2014-115 pp.81-86 |