ken-system: - All Technical Committee Conferences

IEICE Technical Committee Submission System
Conference Schedule

Online Proceedings
[Sign in]
Tech. Rep. Archives

[Japanese] / [English]

(

Committee/Place/Topics

) --Press->

(

Paper Keywords: / Column:Title Auth. Affi. Abst. Keyword

) --Press->

All Technical Committee Conferences (Searched in: All Years)

Search Results: Conference Papers

Conference Papers (Available on Advance Programs) (Sort by: Date Descending)

Committee	Date Time	Place		Paper Title / Authors	Abstract	Paper #
SP, IPSJ-SLP, EA, SIP [detail]	2023-02-28 10:10	Okinawa	(Primary: On-site, Secondary: Online)	Singing voice synthesis based on a frame-driven attention mechanism considering vocal timing deviation Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech) EA2022-78 SIP2022-122 SP2022-42	This paper proposes singing voice synthesis (SVS) based on a frame-driven attention mechanism considering vocal timing d... [more]	EA2022-78 SIP2022-122 SP2022-42 pp.19-24
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail]	2019-12-06 13:55	Tokyo	NHK Science & Technology Research Labs.	[Poster Presentation] Synthetic speech-based sound masking for privacy protection when speaking to smartphones in public space Takahiro Tsugui, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-38	In this paper, we propose a synthetic speech-based sound masking method that protects the privacy when speaking to smart... [more]	SP2019-38 pp.55-60
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail]	2019-12-06 16:00	Tokyo	NHK Science & Technology Research Labs.	A comparison of neural vocoders in singing voice synthesis Sota Wada, Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2019-42	In this study, we compare five types of vocoders based on neural networks (neural vocoders) for singing voice synthesis.... [more]	SP2019-42 pp.85-90
PRMU, SP	2018-06-29 11:00	Nagano		Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11	This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more]	PRMU2018-31 SP2018-11 pp.53-58
SP, ASJ-H	2018-01-20 14:55	Tokyo	The University of Tokyo	[Poster Presentation] TRAJECTORY TRAINING CONSIDERING POWER FOR SPEECH SYNTHESIS BASED ON NEURAL NETWORKS Ryohei Funato, Kei Hashimoto, keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2017-74	In statistical parametric speech synthesis, a relation between acoustic features and linguistic features is modeled by s... [more]	SP2017-74 pp.43-48
SP, ASJ-H	2018-01-21 15:35	Tokyo	The University of Tokyo	Mel-cepstrum based quantization noise shaping applied to speech synthesis based on WaveNet Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2017-83	This paper proposes a mel-cepstrum based quantization noise shaping for improving the quality of synthetic speech genera... [more]	SP2017-83 pp.93-98
SP, ASJ-H	2018-01-21 16:00	Tokyo	The University of Tokyo	A study on voice conversion based on WaveNet Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) SP2017-84	This paper proposes a voice conversion technique based on WaveNet to directly generate target audio waveforms from acous... [more]	SP2017-84 pp.99-104
SP	2017-01-21 11:00	Tokyo	The University of Tokyo	[Poster Presentation] Designing linguistic features for expressive speech synthesis using audiobooks Chiaki Asai, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2016-70	In order to synthesize expressive speech, various statistical parametric speech synthesis systems have been proposed. Sp... [more]	SP2016-70 pp.35-40
SP	2017-01-21 16:35	Tokyo	The University of Tokyo	Simultaneous modeling of acoustic feature sequences and its temporal structures for DNN-based speech synthesis Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2016-76	In statistical parametric speech synthesis, a hidden Markov model (HMM) is widely used as an acoustic model. Recently, d... [more]	SP2016-76 pp.71-76
PRMU, SP, WIT, ASJ-H	2016-06-13 09:30	Tokyo		Image recognition based on discriminative models using features generated from separable lattice HMMs Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2016-36 SP2016-2 WIT2016-2	One of the major problems in image recognition is degradation in the recognition performance caused by geometric variati... [more]	PRMU2016-36 SP2016-2 WIT2016-2 pp.7-12
PRMU, CNR	2016-02-21 14:00	Fukuoka		Parameter sharing structures of separable lattice HMMs using mixture output distributions for image recognition Masato Sukegawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2015-138 CNR2015-39	In image recognition systems, it is important to deal with geometrical variations such as size and location. Separable l... [more]	PRMU2015-138 CNR2015-39 pp.37-42
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail]	2015-12-03 09:00	Aichi	Nagoya Inst of Tech.	Evaluation of text-to-speech system construction for unknown-pronunciation languages Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2015-80	This paper discusses a method to construction of text-to-speech (TTS) systems for unknown-pronunciation languages. There... [more]	SP2015-80 pp.93-98
SP, IPSJ-SLP (Joint)	2015-07-17 14:10	Nagano	Katakura Suwako Hotel	Investigation of privacy-preserving sounds to degrade automatic speaker verification performance Kei Hashimoto (NITECH), Junichi Yamagishi, Isao Echizen (NII) SP2015-49	Sharing speech without permission and identifying the individual from the speech by speaker recognition lead to problems... [more]	SP2015-49 pp.79-84
SP	2014-01-23 16:00	Aichi	Meijo Univ.	Speaker recognition based on log-linear models using feature generation by variational Bayesian method Akifumi Tsuge, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2013-98	This paper presents a speaker recognition technique based on log-linear models (LLMs) using Bayesian statistics. Since d... [more]	SP2013-98 pp.13-18
PRMU	2013-02-22 09:30	Osaka		Extended separable lattice HMMs based on state duration control for recognition of images with variations Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2012-164	In this paper, an extension of separable lattice HMMs is described that (SL-HMM) introduces state duration control for d... [more]	PRMU2012-164 pp.149-154
PRMU	2013-02-22 10:00	Osaka		Image recognition based on hidden Markov eigen-image models with the variational Bayesian method Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2012-165	This paper proposes an image recognition technique based on Hidden Markov Eigen-image Models (HMEMs) using the variation... [more]	PRMU2012-165 pp.155-160
PRMU	2011-11-25 13:45	Nagasaki		Face recognition based on separable lattice 2-D HMMs with variational Bayesian method Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) PRMU2011-120	This paper proposes an image recognition technique based on separable lattice 2-D hidden Markov models (SL2D-HMMs) with ... [more]	PRMU2011-120 pp.125-130
SP	2011-06-23 15:30	Aichi	Nagoya Univ.	Bayesian speech recognition based on model structure integration Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) SP2011-32	This paper proposes an acoustic modeling technique using multiple model structures based on a Bayesian framework for spe... [more]	SP2011-32 pp.11-16
SP, NLC	2008-12-10 09:55	Tokyo	Waseda Univ.	Bayesian Context Clustering Using Cross Validation for HMM-Based Speech Synthesis Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Institute of Technology) NLC2008-36 SP2008-91	This paper proposes a prior distribution determination technique using cross validation for HMM-based speech synthesis b... [more]	NLC2008-36 SP2008-91 pp.73-78
SP, NLC	2008-12-10 16:10	Tokyo	Waseda Univ.	Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nitech) NLC2008-55 SP2008-110	[more]	NLC2008-55 SP2008-110 pp.185-190

Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan