IEICE Technical Committee Submission System
Conference Paper's Information
Online Proceedings
[Sign in]
Tech. Rep. Archives
 Go Top Page Go Previous   [Japanese] / [English] 

Paper Abstract and Keywords
Presentation 2014-05-25 11:30
Speech waveform generation on subband domain
Nobuyuki Nishizawa, Tsuneo Kato (KDDI R&D Labs) SP2014-35
Abstract (in Japanese) (See Japanese page) 
(in English) To reduce the computational cost for waveform generation in speech synthesis based on analysis-synthesis systems like HMM-based speech synthesizers, a method based on the subband coding, which is also used in MPEG Audio, is introduced. In the method, signal processing is performed on the subband domain rather than the time domain. Although the HMM-based speech synthesis can generate relatively high quality sound in a small footprint, the computational cost for the waveform generation process is higher than that of the conventional concatenative speech synthesis directly using waveform segments. Since a low amount of computations is often required in small systems such as embedded systems, we proposed the subband-coding-based method to reduce the computational cost in our former studies. In the method, speech waveforms are generated by combining sinusoids and band-decomposed white noises. Because subband code vectors for such waveforms result in sparse vectors, the computational cost can be reduced by processing on the subband domain even where the cost for decoding of the subband code vectors is taken into account. Moreover, a method for the computation of spectral convolutions from melcepstra using a fast discrete cosine transformation algorithm is also introduced. It would be required in HMM-based speech synthesizers with the proposed waveform generation method in practice.
Keyword (in Japanese) (See Japanese page) 
(in English) HMM-based speech synthesis / speech waveform generation / filter bank / subband coding / embedded systems / / /  
Reference Info. IEICE Tech. Rep., vol. 114, no. 52, SP2014-35, pp. 349-354, May 2014.
Paper # SP2014-35 
Date of Issue 2014-05-17 (SP) 
ISSN Print edition: ISSN 0913-5685    Online edition: ISSN 2432-6380
Copyright
and
reproduction
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
Download PDF SP2014-35

Conference Information
Committee SP IPSJ-MUS  
Conference Date 2014-05-24 - 2014-05-25 
Place (in Japanese) (See Japanese page) 
Place (in English)  
Topics (in Japanese) (See Japanese page) 
Topics (in English)  
Paper Information
Registration To SP 
Conference Code 2014-05-SP-MUS 
Language Japanese 
Title (in Japanese) (See Japanese page) 
Sub Title (in Japanese) (See Japanese page) 
Title (in English) Speech waveform generation on subband domain 
Sub Title (in English)  
Keyword(1) HMM-based speech synthesis  
Keyword(2) speech waveform generation  
Keyword(3) filter bank  
Keyword(4) subband coding  
Keyword(5) embedded systems  
Keyword(6)  
Keyword(7)  
Keyword(8)  
1st Author's Name Nobuyuki Nishizawa  
1st Author's Affiliation KDDI R&D Laboratories, Inc. (KDDI R&D Labs)
2nd Author's Name Tsuneo Kato  
2nd Author's Affiliation KDDI R&D Laboratories, Inc. (KDDI R&D Labs)
3rd Author's Name  
3rd Author's Affiliation ()
4th Author's Name  
4th Author's Affiliation ()
5th Author's Name  
5th Author's Affiliation ()
6th Author's Name  
6th Author's Affiliation ()
7th Author's Name  
7th Author's Affiliation ()
8th Author's Name  
8th Author's Affiliation ()
9th Author's Name  
9th Author's Affiliation ()
10th Author's Name  
10th Author's Affiliation ()
11th Author's Name  
11th Author's Affiliation ()
12th Author's Name  
12th Author's Affiliation ()
13th Author's Name  
13th Author's Affiliation ()
14th Author's Name  
14th Author's Affiliation ()
15th Author's Name  
15th Author's Affiliation ()
16th Author's Name  
16th Author's Affiliation ()
17th Author's Name  
17th Author's Affiliation ()
18th Author's Name  
18th Author's Affiliation ()
19th Author's Name  
19th Author's Affiliation ()
20th Author's Name  
20th Author's Affiliation ()
Speaker Author-1 
Date Time 2014-05-25 11:30:00 
Presentation Time 240 minutes 
Registration for SP 
Paper # SP2014-35 
Volume (vol) vol.114 
Number (no) no.52 
Page pp.349-354 
#Pages
Date of Issue 2014-05-17 (SP) 


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan