Paper Abstract and Keywords |
Presentation |
2012-11-08 14:45
Improvements of HMM-based speech synthesis using rich context models Shinnosuke Takamichi, Tomoki Toda (NAIST), Yoshinori Shiga (NICT), Sakriani Sakti, Graham Neubig, Satoshi Nakamura (NAIST) SP2012-78 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In the traditional HMM-based speech synthesis, generated speech parameters tend to be excessively smoothed.
To alleviate this problem, we have proposed a parameter generation method with rich context models in our previous work.
This method improves speech quality while keeping the flexibility of HMM-based speech synthesis.
However, synthetic speech still sounds muffled because the generated parameters strongly depend on over-smoothed initial parameters in iterative parameter generation procedure.
In this paper, we propose an initialization method for generating less-smoothed initial parameters using context-clustered HMMs based on a large-sized decision tree.
Experimental evaluations of the proposed method demonstrate that the proposed method yields significant improvements in the quality of synthetic speech. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
HMM-based speech synthesis / rich context model / parameter generation / tree-based context clustering / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 112, no. 281, SP2012-78, pp. 37-42, Nov. 2012. |
Paper # |
SP2012-78 |
Date of Issue |
2012-11-01 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2012-78 |
|