Paper Abstract and Keywords |
Presentation |
2015-10-15 13:50
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Tech), Akinori Ito (Tohoku Univ.) SP2015-61 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. The proposed technique use neural network (NN) for many-to-one mapping and the NN is trained with the pairs of multiple source speakers and a target speaker. The conversion of speaker individuality of the input speech is conducted by spectral differential filter. In the previous studies of voice conversion, speaker-dependent approach was proposed where parallel speech data of source and target speakers are used for conversion model training. There is also another approach where the conversion model is trained by using speaker adaptation with a small amount of target speaker's speech. Recently, we proposed speaker-independent voice conversion without using a user's speech in the training step. The purpose of this study is to improve the naturalness of the converted speech in the speaker-independent voice conversion. We directly convert the waveform of the input speaker using a filter whose parameters are obtained by the differential of spectral features before and after feature mapping. An advantage is that the direct waveform conversion alleviate the quality reduction caused by the extraction error of fundamental frequency in the conventional technique. We also show that the naturalness is further improved by variance compensation with affine transformation. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
/ / / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 115, no. 253, SP2015-61, pp. 13-18, Oct. 2015. |
Paper # |
SP2015-61 |
Date of Issue |
2015-10-08 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2015-61 |
Conference Information |
Committee |
SP |
Conference Date |
2015-10-15 - 2015-10-16 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Kobe Univ. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Speech interface, Synthesis, Dialogue, Application system, etc. |
Paper Information |
Registration To |
SP |
Conference Code |
2015-10-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network |
Sub Title (in English) |
|
Keyword(1) |
|
Keyword(2) |
|
Keyword(3) |
|
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Harunori Koike |
1st Author's Affiliation |
Tohoku University (Tohoku Univ.) |
2nd Author's Name |
Takashi Nose |
2nd Author's Affiliation |
Tohoku University (Tohoku Univ.) |
3rd Author's Name |
Takahiro Shinozaki |
3rd Author's Affiliation |
Tokyo Institute of Technology (Tokyo Tech) |
4th Author's Name |
Akinori Ito |
4th Author's Affiliation |
Tohoku University (Tohoku Univ.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2015-10-15 13:50:00 |
Presentation Time |
25 minutes |
Registration for |
SP |
Paper # |
SP2015-61 |
Volume (vol) |
vol.115 |
Number (no) |
no.253 |
Page |
pp.13-18 |
#Pages |
6 |
Date of Issue |
2015-10-08 (SP) |
|