SP 2019-06-13
Kanagawa Tokyo Institute of Technology A study on style transplantation modeling techniques for DNN-based speech synthesis
Yoshiki Hiruta (Tokyo Tech), Tomoki Koriyama (The Univ. of Tokyo), Yuuki Tachioka (Denso IT Lab), Takao Kobayashi (Tokyo Tech) SP2019-1
This paper investigates style transplantation modeling techniques for DNN-based statistical parametric speech synthesis.... [more] SP2019-1
EA, SIP, SP 2019-03-14
Nagasaki i+Land nagasaki (Nagasaki-shi) A Study on Speech Synthesis Based on Deep Gaussain Processes and Latent Variable Representation of Accent
Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) EA2018-129 SIP2018-135 SP2018-91
 [more] EA2018-129 SIP2018-135 SP2018-91
(Joint) [detail]
Okinawa   On the Use of Deep Gaussian Processes for GPR-based Speech Synthesis
Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) EA2017-106 SIP2017-115 SP2017-89
This paper proposes a speech synthesis framework
based on deep Gaussian processes (DGPs).
DGP is a Bayesian deep learn... [more]
EA2017-106 SIP2017-115 SP2017-89
SP, ASJ-H 2018-01-20
Tokyo The University of Tokyo A study on statistical speech synthesis based on GP-DNN hybrid model
Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2017-67
We propose a novel approach to Gaussian process regression (GPR)-based speech synthesis
in this paper.
Since the conve... [more]
(Joint) [detail]
Tokyo Waseda Univ. Green Computing Systems Research Organization [Invited Talk] Expressive Speech Synthesis: Approaches to Text-to-Speech with Diverse Voices and Styles
Takao Kobayashi (Tokyo Tech.) SP2017-64
As the performance of smart devices and information systems becomes higher, more advanced speech interfaces are requeste... [more] SP2017-64
SP 2016-01-14
Kanagawa Sunpian Kawasaki Performance evaluation of CRF/HMM-based automatic accent labeling for speech synthesis
Rina Mashiko, Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) SP2015-85
We have proposed an accent type and phrase boundary estimation technique using acoustic and language models represented ... [more] SP2015-85
SP 2015-08-21
Iwate Iwate Prefectural Univ. Performance Evaluation of Large-Scale Training Sentence Set Construction Based on Entropy in Statistical Speech Synthesis
Takashi Nose (Tohoku Univ.), Yusuke Arao (DNP), Takao Kobayashi (Tokyo Tech), Komei Sugiura, Yoshinori Shiga (NICT) SP2015-57
This paper reports the evaluation results of training sentence set construction based on entropy that we previously prop... [more] SP2015-57
SP, IPSJ-MUS 2014-05-25
Tokyo   A Kana Protocol Recommendation Method for Switch Input Speech Synthesis Systems
Fuming Fang, Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2014-36
Switch-to-speech interface can provide a means of interactive speech communication as a support system
for people with ... [more]
SP 2014-01-23
Aichi Meijo Univ. A study on hyperparameter optimization for speech synthesis based on Gaussian process regression
Tomoki Koriyama (Tokyo Inst. of Tech.), Takashi Nose (Tohoku Univ.), Takao Kobayashi (Tokyo Inst. of Tech.) SP2013-99
 [more] SP2013-99
SP, IPSJ-SLP 2013-12-20
Tokyo   Automatic Estimation of Accent Phrase Boundaries Using Language and Acoustic Models
Hiroshi Suzuki, Tomoki Koriyama (Tokyo Tech), Takashi Nose (Tohoku Univ.), Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2013-89
This paper proposes a technique for automatically estimating accent phrase boundaries for text-to-speech synthesis syste... [more] SP2013-89
SP, IPSJ-SLP 2013-12-20
Tokyo   [Fellow Memorial Lecture] Toward Speech Synthesis with Diverse Voices and Styles: Approaches and Issues
Takao Kobayashi (Tokyo Tech.) SP2013-93
Recently, hidden Markov model-based (HMM-based) speech synthesis has been widely studied in the text-to-speech (TTS) syn... [more] SP2013-93
SP 2013-01-31
Kyoto Doshisha Univ. A study on speaker-normalized style conversion for arbitrary speaker's expressive speech synthesis
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-110
This paper proposes a technique for improving naturalness of synthetic speech using a framework of speaker adaptive trai... [more] SP2012-110
SP 2013-01-31
Kyoto Doshisha Univ. A Study on Style Control Based on Multiple-Regression HSMM for Synthesizing Singing Voices with Various Expressivity
Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Takao Kobayashi (Tokyo Inst. of Tech.) SP2012-111
This paper proposes a style control technique based on multiple regression HSMM (MRHSMM)
for changing styles and their ... [more]
SP 2013-01-31
Kyoto Doshisha Univ. A Study on Multi-class Local Prosodic Context for Expressive Prosody Generation
Yu Maeno, Takashi Nose, Takao Kobayashi, Tomoki Koriyama (Tokyo Inst. of Tech.), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-112
This paper describes a technique for reproducing local prosodic variability which appears in expressive speech including... [more] SP2012-112
SP 2012-11-08
Miyagi Ichibancho Lobby, Tohoku Institute of Technology Modeling of local variance of spectral features and its application to parameter generation in HMM-based speech synthesis
Takashi Nose, Vataya Chunwijitra, Takao Kobayashi (Tokyo Tech) SP2012-79
In this paper, we describe a technique for modeling local variance (LV)
of speech features and propose a novel paramete... [more]
SP 2012-06-14
Kanagawa NTT Atsugi R&D Center A Study on Automatic Prosodic Context Labeling for Emphatic Speech Synthesis
Yu Maeno, Takashi Nose, Takao Kobayashi (Tokyo Tech), Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka (NTT) SP2012-33
This paper describes automatic prosodic context labeling of training data for synthesizing expressive speech in HMM-base... [more] SP2012-33
SP, NLC, IPSJ-SLP [detail] 2011-12-20
Tokyo   On the use of prosodic-event-based HMM in F0 generation of conversational speech
Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-53 SP2011-98
In this paper, we propose prosodic-event-based HMM
for effectively modeling F0 pattern of spontaneous conversational sp... [more]
NLC2011-53 SP2011-98
SP, NLC, IPSJ-SLP [detail] 2011-12-20
Tokyo   A Study on Speaker Independent Style Conversion in HMM Speech Synthesis
Hiroki Kanagawa, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-54 SP2011-99
This paper proposes a technique for synthesizing speech of a desired style using speaker-independent style conversion in... [more] NLC2011-54 SP2011-99
SP, NLC, IPSJ-SLP [detail] 2011-12-20
Tokyo   A study on modeling phone duration using dynamic features for HMM-based speech synthesis
Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-55 SP2011-100
This paper proposes a technique for modeling and generating phone durations
using their dynamic features to improve pre... [more]
NLC2011-55 SP2011-100
EA, SIP, SP 2011-05-13
Osaka Ritsumeikan Univ. Performance evaluation of contexts for conversational speech synthesis using Corpus of Spontaneous Japanese
Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) EA2011-27 SIP2011-27 SP2011-27
This paper proposes an extended context set for generating the prosodic variability of spontaneous speech in HMM-based c... [more] EA2011-27 SIP2011-27 SP2011-27
