Paper Abstract and Keywords |
Presentation |
2018-08-27 15:55
Sound Event Encoder Using Onomatopoeic Representations based on End-to-End Approach Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2018-30 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In this paper, we propose a sound event encoder for converting sound events into their onomatopoeic representations. The proposed method uses Connectionist Temporal Classification (CTC) as an End-to-End approach to directly convert a sequence of feature vectors of each sound event into a corresponding onomatopoeic representation which accurately represents each sound and can be intuitively understood. Moreover, to address the issue of the ambiguity of onomatopoeic representations among different individuals, we develop a database of sound events and their corresponding typical onomatopoeic representations as accepted by multiple listeners. To evaluate the performance of our proposed method, we conduct objective and subjective evaluations. Objective evaluation results demonstrate that the proposed sound event encoder is capable of converting sound events into their onomatopoeic representations with word error rate (WER) is 46.0% and phoneme error rate (PER) is 20.49%. From subjective evaluation results demonstrate with a 74.5% subjective acceptability rate, and that use of typical onomatopoeic representations, as approved by multiple subjects, yields significant improvement, resulting in an acceptability rate of 81.8%. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Sound event / Symbolization / Onomatopoeic representation / End-to-End / Connectionist Temporal Classification / / / |
Reference Info. |
IEICE Tech. Rep., vol. 118, no. 198, SP2018-30, pp. 37-42, Aug. 2018. |
Paper # |
SP2018-30 |
Date of Issue |
2018-08-20 (SP) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2018-30 |
Conference Information |
Committee |
SP |
Conference Date |
2018-08-27 - 2018-08-27 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Kyoto Univ. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SP |
Conference Code |
2018-08-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Sound Event Encoder Using Onomatopoeic Representations based on End-to-End Approach |
Sub Title (in English) |
|
Keyword(1) |
Sound event |
Keyword(2) |
Symbolization |
Keyword(3) |
Onomatopoeic representation |
Keyword(4) |
End-to-End |
Keyword(5) |
Connectionist Temporal Classification |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Koichi Miyazaki |
1st Author's Affiliation |
Nagoya University (Nagoya Univ.) |
2nd Author's Name |
Tomoki Hayashi |
2nd Author's Affiliation |
Nagoya University (Nagoya Univ.) |
3rd Author's Name |
Tomoki Toda |
3rd Author's Affiliation |
Nagoya University (Nagoya Univ.) |
4th Author's Name |
Kazuya Takeda |
4th Author's Affiliation |
Nagoya University (Nagoya Univ.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2018-08-27 15:55:00 |
Presentation Time |
25 minutes |
Registration for |
SP |
Paper # |
SP2018-30 |
Volume (vol) |
vol.118 |
Number (no) |
no.198 |
Page |
pp.37-42 |
#Pages |
6 |
Date of Issue |
2018-08-20 (SP) |
|