Paper Abstract and Keywords |
Presentation |
2020-10-22 13:00
[Invited Talk]
NHK's activities on Japanese end-to-end speech synthesis Kiyoshi Kurihara (NHK) SP2020-11 WIT2020-12 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
The main business of NHK (Japan Broadcasting Corporation) is the production and broadcasting of programs. Many programs are produced daily and a considerable amount of work goes into the production of speech content by many people including announcers, directors, and engineers. To support this work and to provide new speech services, we have been researching speech synthesis using Deep Neural Networks (DNNs). DNN speech synthesis requires a large amount of data for training purposes, so we are also involved in the research of end-to-end speech synthesis to reduce the cost of obtaining this training data and generate high-quality speech. To achieve end-to-end speech synthesis in the Japanese language, we adapted the sequence-to-sequence + attention system of speech synthesis (seq2seq speech synthesis), which has proven results in English, to Japanese and proposed a speech synthesis technique that takes character strings consisting of kana (phonetic) text and prosodic symbols as input based on JEITA IT-4006, symbols for Japanese Text-to-Speech Synthesizer. We also developed a technique that enables control of speaking style by adding tags that express speaking style to the input data of seq2seq speech synthesis. We are developing applications for a speech synthesis system that incorporates these techniques and studying their use in a variety of scenarios. This talk describes these NHK activities in speech synthesis and introduces NHK’s efforts in universal services now being researched and developed at NHK Science & Technology Research Laboratories. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Statistical parametric speech synthesis / End-to-end speech synthesis / Speaking Style / Encoder-Decoder model / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 120, no. 197, SP2020-11, pp. 19-20, Oct. 2020. |
Paper # |
SP2020-11 |
Date of Issue |
2020-10-15 (SP, WIT) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2020-11 WIT2020-12 |
Conference Information |
Committee |
WIT SP IPSJ-SLP |
Conference Date |
2020-10-22 - 2020-10-23 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Online |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SP |
Conference Code |
2020-10-WIT-SP-SLP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
NHK's activities on Japanese end-to-end speech synthesis |
Sub Title (in English) |
|
Keyword(1) |
Statistical parametric speech synthesis |
Keyword(2) |
End-to-end speech synthesis |
Keyword(3) |
Speaking Style |
Keyword(4) |
Encoder-Decoder model |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kiyoshi Kurihara |
1st Author's Affiliation |
NHK (Japan Broadcasting Corporation) (NHK) |
2nd Author's Name |
|
2nd Author's Affiliation |
() |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2020-10-22 13:00:00 |
Presentation Time |
60 minutes |
Registration for |
SP |
Paper # |
SP2020-11, WIT2020-12 |
Volume (vol) |
vol.120 |
Number (no) |
no.197(SP), no.198(WIT) |
Page |
pp.19-20 |
#Pages |
2 |
Date of Issue |
2020-10-15 (SP, WIT) |
|