Paper Abstract and Keywords |
Presentation |
2024-05-22 14:55
Environmental sound synthesis and creation of dataset using vocal imitations Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) EA2024-5 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
One way to represent the characteristics of environmental sounds is to imitate the environmental sounds by human voice called vocal imitation. The input information (e.g., image, text, etc.) in conventional environmental sound synthesis cannot represent the pitch and rhythm of environmental sounds.On the other hand, vocal imitation is effective in representing the pitch and rhythm of environmental sounds.We thus create a dataset of vocal imitations of environmental sounds.We also propose an environmental sound synthesis method using vocal imitation of environmental sounds.Using vocal imitations and sound event labels as input to the environmental sound synthesis model, we can control the pitch, rhythm, and sound event of the synthesized sounds.Experimental results show that using vocal imitations effectively controls the pitch and rhythm of synthesized sounds. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Environmental sound synthesis / Environmental sound conversion / Vocal imitation / Sound event label / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 124, no. 42, EA2024-5, pp. 22-22, May 2024. |
Paper # |
EA2024-5 |
Date of Issue |
2024-05-15 (EA) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2024-5 |
|