Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP |
2012-11-08 10:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Effects of speaker adaptive training on arbitrary speaker conversion based on tensor representation Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2012-72 |
In this paper, speaker adaptive training techniques are introduced to tensor-based arbitrary speaker conversion. In voic... [more] |
SP2012-72 pp.1-6 |
SP |
2012-11-08 11:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Implementation of real-time statistical voice conversion on a DSP Takuto Moriguchi, Tomoki Toda (NAIST), Motoaki Sano, Hiroshi Sato (FOSTER), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2012-73 |
Real-time statistical voice conversion is one of promising approach to developing technology for body– conducted s... [more] |
SP2012-73 pp.7-12 |
SP |
2012-11-08 11:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Intra-speaker spectral parameter variation between utterances of the same sentence and its prediction Tatsuo Inukai, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST) SP2012-74 |
In spectral conversion of statistical voice conversion technologies, distance measures between the converted and target ... [more] |
SP2012-74 pp.13-18 |
SP |
2012-11-08 13:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Measurement of temporal change of vocal tract volume during production of plosive and fricative consonants Tatsuya Kitamura, Hiroaki Hatano (Konan Univ.) SP2012-75 |
The volume of the vocal tract of a male speaker during production of voiced and voiceless plosives and fricatives was me... [more] |
SP2012-75 pp.19-23 |
SP |
2012-11-08 13:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
A study for estimating the length and sectional area of the vocal tract from formants using an acoustic sensitivity function Yuki Sakamoto, Tokihiko Kaburagi (Kyushu Univ.) SP2012-76 |
In this paper, we consider a method for estimating the vocal-tract area function from given formant frequencies. A sensi... [more] |
SP2012-76 pp.25-30 |
SP |
2012-11-08 14:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Analysis on speaker's intentions conveyed to the listeners by sentence-final particles and their intonations Kazuhiko Iwata, Tetsunori Kobayashi (Waseda Univ.) SP2012-77 |
We investigated the speaker's intentions that the listeners perceived depending on the combinations of a sentence-final ... [more] |
SP2012-77 pp.31-36 |
SP |
2012-11-08 14:45 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Improvements of HMM-based speech synthesis using rich context models Shinnosuke Takamichi, Tomoki Toda (NAIST), Yoshinori Shiga (NICT), Sakriani Sakti, Graham Neubig, Satoshi Nakamura (NAIST) SP2012-78 |
In the traditional HMM-based speech synthesis, generated speech parameters tend to be excessively smoothed.
To allevia... [more] |
SP2012-78 pp.37-42 |
SP |
2012-11-08 15:15 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Modeling of local variance of spectral features and its application to parameter generation in HMM-based speech synthesis Takashi Nose, Vataya Chunwijitra, Takao Kobayashi (Tokyo Tech) SP2012-79 |
In this paper, we describe a technique for modeling local variance (LV)
of speech features and propose a novel paramete... [more] |
SP2012-79 pp.43-48 |
SP |
2012-11-08 16:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Prosody Generation based on HMM using Tow-stage Clustering Yasuyuki Mitsui, Reishi Kondo, Masanori Kato (NEC) SP2012-80 |
HMM-based speech synthesis can generate highly natural prosody, but there is a problem that the pitch patterns represent... [more] |
SP2012-80 pp.49-54 |
SP |
2012-11-08 16:30 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
One-click vocal tract mapping interface and its applications to signal conversion Kohichi Ogata, Kentaro Yamashita (Kumamoto Univ.) SP2012-81 |
This paper describes the development of a vocal tract mapping interface to produce vowel sounds. The interface is a kind... [more] |
SP2012-81 pp.55-60 |
SP |
2012-11-08 17:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
Three-dimensional rectangular vocal-tract model representing the characteristics of asymmetric vocal-tract wall impedances Kunitoshi Motoki (Hokkai-Gakuen Univ.) SP2012-82 |
A method to compute the acoustic characteristics of a simplified three-dimensional vocal-tract model with wall impedance... [more] |
SP2012-82 pp.61-66 |
SP |
2012-11-09 09:45 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
[Invited Talk]
Speech synthesis
-- Modeling human-like speech output capabilities -- Yoshinori Sagisaka (Waseda Univ.) SP2012-83 |
Research activities in speech synthesis are introduced on four topics (1)Tone control, (2)Timing control, (3)Corpus-base... [more] |
SP2012-83 pp.67-72 |
SP |
2012-11-09 11:00 |
Miyagi |
Ichibancho Lobby, Tohoku Institute of Technology |
[Invited Talk]
Trend of Speech Production Research and its Aspects for Speech Synthesis Masaaki Honda (Waseda Univ.) SP2012-84 |
[more] |
SP2012-84 pp.73-78 |