Paper Abstract and Keywords |
Presentation |
2012-12-21 15:25
Syllable nucleus detection using waveform envelopes and modeling of the word acquisition process using word structures and syllable nuclei Yousuke Ozaki, Nobuaki Minematsu, Keikichi Hirose (The Univ. of Tokyo), Donna Erickson (Showa Univ. of Music) SP2012-94 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Simulation of language acquisition processes is an active research area in speech and computer science. Here, models and hypotheses proposed in developmental psychology play important roles. Problems addressed in simulation studies may be broadly classified into two categories: (1) How to segment a continuous speech stream into words? (2) How to normalize the acoustic features that vary based on speaker and environment? The main focus of this study is put on the second problem. In our previous studies, we proposed a new speech modeling technique, called speech structures, where the non-linguistic aspect of speech is cancelled well from speech acoustics and only the linguistic aspect is represented in the model. In this study, by considering infants' good sensitivity to rhythmic structure of language, automatic detection of syllable nuclei are technically implemented using waveform envelopes. Then, the detected nuclei are used in the matching module of a structure-based word recognition system. Results show that the validity of using the syllable nuclei to improve the performance. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
language acquisition / speech normalization / Speech Structure / language rhythm / sonority / waveform envelope / syllable nucleus / |
Reference Info. |
IEICE Tech. Rep., vol. 112, no. 369, SP2012-94, pp. 113-118, Dec. 2012. |
Paper # |
SP2012-94 |
Date of Issue |
2012-12-13 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2012-94 |
Conference Information |
Committee |
SP IPSJ-SLP |
Conference Date |
2012-12-20 - 2012-12-21 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
TITECH(Ookayama) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
14th Symposium on Spoken Language |
Paper Information |
Registration To |
SP |
Conference Code |
2012-12-SP-SLP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Syllable nucleus detection using waveform envelopes and modeling of the word acquisition process using word structures and syllable nuclei |
Sub Title (in English) |
|
Keyword(1) |
language acquisition |
Keyword(2) |
speech normalization |
Keyword(3) |
Speech Structure |
Keyword(4) |
language rhythm |
Keyword(5) |
sonority |
Keyword(6) |
waveform envelope |
Keyword(7) |
syllable nucleus |
Keyword(8) |
|
1st Author's Name |
Yousuke Ozaki |
1st Author's Affiliation |
The University of Tokyo (The Univ. of Tokyo) |
2nd Author's Name |
Nobuaki Minematsu |
2nd Author's Affiliation |
The University of Tokyo (The Univ. of Tokyo) |
3rd Author's Name |
Keikichi Hirose |
3rd Author's Affiliation |
The University of Tokyo (The Univ. of Tokyo) |
4th Author's Name |
Donna Erickson |
4th Author's Affiliation |
Showa University of Music (Showa Univ. of Music) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2012-12-21 15:25:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
SP2012-94 |
Volume (vol) |
vol.112 |
Number (no) |
no.369 |
Page |
pp.113-118 |
#Pages |
6 |
Date of Issue |
2012-12-13 (SP) |
|