Paper Abstract and Keywords |
Presentation |
2022-10-13 14:15
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui (Saitama Univ.) SIS2022-12 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected compared to amplitude and frequency among the basic quantities handled in speech signal processing. This is because it was believed that the contribution of phase to speech quality was small, based on the perception that human hearing is insensitive to changes in phase. However, with the development of speech signal processing, the importance of phase to speech quality has become clear. In this paper, we introduce the capsule structure of the Capsule Network, which has shown excellent performance in the field of image recognition in recent years, to the speech enhancement network, and attempt to improve the performance of the speech enhancement network and the naturalness of speech by constructing a speech enhancement model that also focuses on phase information. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
speech enhancement / phase / speech quality / Capsule Network / capsule structure / naturalness of speech / / |
Reference Info. |
IEICE Tech. Rep., vol. 122, no. 209, SIS2022-12, pp. 7-12, Oct. 2022. |
Paper # |
SIS2022-12 |
Date of Issue |
2022-10-06 (SIS) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SIS2022-12 |
Conference Information |
Committee |
SIS ITE-BCT |
Conference Date |
2022-10-13 - 2022-10-14 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Hachinohe Institute of Technology |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SIS |
Conference Code |
2022-10-SIS-BCT |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks |
Sub Title (in English) |
|
Keyword(1) |
speech enhancement |
Keyword(2) |
phase |
Keyword(3) |
speech quality |
Keyword(4) |
Capsule Network |
Keyword(5) |
capsule structure |
Keyword(6) |
naturalness of speech |
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Reito Kasuga |
1st Author's Affiliation |
Saitama University (Saitama Univ.) |
2nd Author's Name |
Tetsuya Shimamura |
2nd Author's Affiliation |
Saitama University (Saitama Univ.) |
3rd Author's Name |
Yosuke Sugiura |
3rd Author's Affiliation |
Saitama University (Saitama Univ.) |
4th Author's Name |
Nozomiko Yasui |
4th Author's Affiliation |
Saitama University (Saitama Univ.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2022-10-13 14:15:00 |
Presentation Time |
20 minutes |
Registration for |
SIS |
Paper # |
SIS2022-12 |
Volume (vol) |
vol.122 |
Number (no) |
no.209 |
Page |
pp.7-12 |
#Pages |
6 |
Date of Issue |
2022-10-06 (SIS) |
|