IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (Searched in: All Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 22  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
SP, IPSJ-SLP, EA, SIP [detail] 2023-02-28
10:10
Okinawa
(Primary: On-site, Secondary: Online)
Singing voice synthesis based on a frame-driven attention mechanism considering vocal timing deviation
Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech) EA2022-78 SIP2022-122 SP2022-42
This paper proposes singing voice synthesis (SVS) based on a frame-driven attention mechanism considering vocal timing d... [more] EA2022-78 SIP2022-122 SP2022-42
pp.19-24
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2019-12-06
13:55
Tokyo NHK Science & Technology Research Labs. [Poster Presentation] Synthetic speech-based sound masking for privacy protection when speaking to smartphones in public space
Takahiro Tsugui, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-38
In this paper, we propose a synthetic speech-based sound masking method that protects the privacy when speaking to smart... [more] SP2019-38
pp.55-60
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2019-12-06
16:00
Tokyo NHK Science & Technology Research Labs. A comparison of neural vocoders in singing voice synthesis
Sota Wada, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-42
In this study, we compare five types of vocoders based on neural networks (neural vocoders) for singing voice synthesis.... [more] SP2019-42
pp.85-90
PRMU, SP 2018-06-29
11:00
Nagano   Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling
Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11
This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more] PRMU2018-31 SP2018-11
pp.53-58
SP, ASJ-H 2018-01-20
14:55
Tokyo The University of Tokyo [Poster Presentation] TRAJECTORY TRAINING CONSIDERING POWER FOR SPEECH SYNTHESIS BASED ON NEURAL NETWORKS
Ryohei Funato, Kei Hashimoto, keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2017-74
In statistical parametric speech synthesis, a relation between acoustic features and linguistic features is modeled by s... [more] SP2017-74
pp.43-48
SP, ASJ-H 2018-01-21
15:35
Tokyo The University of Tokyo Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet
Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2017-83
This paper proposes a mel-cepstrum based quantization noise shaping for improving the quality of synthetic speech genera... [more] SP2017-83
pp.93-98
SP, ASJ-H 2018-01-21
16:00
Tokyo The University of Tokyo A study on voice conversion based on WaveNet
Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) SP2017-84
This paper proposes a voice conversion technique based on WaveNet to directly generate target audio waveforms from acous... [more] SP2017-84
pp.99-104
SP 2017-01-21
11:00
Tokyo The University of Tokyo [Poster Presentation] Designing linguistic features for expressive speech synthesis using audiobooks
Chiaki Asai, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2016-70
In order to synthesize expressive speech, various statistical parametric speech synthesis systems have been proposed. Sp... [more] SP2016-70
pp.35-40
SP 2017-01-21
16:35
Tokyo The University of Tokyo Simultaneous modeling of acoustic feature sequences and its temporal structures for DNN-based speech synthesis
Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2016-76
In statistical parametric speech synthesis, a hidden Markov model (HMM) is widely used as an acoustic model. Recently, d... [more] SP2016-76
pp.71-76
PRMU, SP, WIT, ASJ-H 2016-06-13
09:30
Tokyo   Image recognition based on discriminative models using features generated from separable lattice HMMs
Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2016-36 SP2016-2 WIT2016-2
One of the major problems in image recognition is degradation in the recognition performance caused by geometric variati... [more] PRMU2016-36 SP2016-2 WIT2016-2
pp.7-12
PRMU, CNR 2016-02-21
14:00
Fukuoka   Parameter sharing structures of separable lattice HMMs using mixture output distributions for image recognition
Masato Sukegawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2015-138 CNR2015-39
In image recognition systems, it is important to deal with geometrical variations such as size and location. Separable l... [more] PRMU2015-138 CNR2015-39
pp.37-42
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2015-12-03
09:00
Aichi Nagoya Inst of Tech. Evaluation of text-to-speech system construction for unknown-pronunciation languages
Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2015-80
This paper discusses a method to construction of text-to-speech (TTS) systems for unknown-pronunciation languages. There... [more] SP2015-80
pp.93-98
SP, IPSJ-SLP
(Joint)
2015-07-17
14:10
Nagano Katakura Suwako Hotel Investigation of privacy-preserving sounds to degrade automatic speaker verification performance
Kei Hashimoto (NITECH), Junichi Yamagishi, Isao Echizen (NII) SP2015-49
Sharing speech without permission and identifying the individual from the speech by speaker recognition lead to problems... [more] SP2015-49
pp.79-84
SP 2014-01-23
16:00
Aichi Meijo Univ. Speaker recognition based on log-linear models using feature generation by variational Bayesian method
Akifumi Tsuge, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2013-98
This paper presents a speaker recognition technique based on log-linear models (LLMs) using Bayesian statistics. Since d... [more] SP2013-98
pp.13-18
PRMU 2013-02-22
09:30
Osaka   Extended separable lattice HMMs based on state duration control for recognition of images with variations
Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2012-164
In this paper, an extension of separable lattice HMMs is described that (SL-HMM) introduces state duration control for d... [more] PRMU2012-164
pp.149-154
PRMU 2013-02-22
10:00
Osaka   Image recognition based on hidden Markov eigen-image models with the variational Bayesian method
Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2012-165
This paper proposes an image recognition technique based on Hidden Markov Eigen-image Models (HMEMs) using the variation... [more] PRMU2012-165
pp.155-160
PRMU 2011-11-25
13:45
Nagasaki   Face recognition based on separable lattice 2-D HMMs with variational Bayesian method
Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2011-120
This paper proposes an image recognition technique based on separable lattice 2-D hidden Markov models (SL2D-HMMs) with ... [more] PRMU2011-120
pp.125-130
SP 2011-06-23
15:30
Aichi Nagoya Univ. Bayesian speech recognition based on model structure integration
Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2011-32
This paper proposes an acoustic modeling technique using multiple model structures based on a Bayesian framework for spe... [more] SP2011-32
pp.11-16
SP, NLC 2008-12-10
09:55
Tokyo Waseda Univ. Bayesian Context Clustering Using Cross Validation for HMM-Based Speech Synthesis
Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-36 SP2008-91
This paper proposes a prior distribution determination technique using cross validation for HMM-based speech synthesis b... [more] NLC2008-36 SP2008-91
pp.73-78
SP, NLC 2008-12-10
16:10
Tokyo Waseda Univ. Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method
Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nitech) NLC2008-55 SP2008-110
 [more] NLC2008-55 SP2008-110
pp.185-190
 Results 1 - 20 of 22  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan