Paper Abstract and Keywords |
Presentation |
2009-12-21 10:35
Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit Daisuke Saito, Ryo Matsuura, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) NLC2009-13 SP2009-77 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently proposed a structural and abstract representation of speech, where these variations were effectively removed. This representation captures only dynamics of speech. In our previous studies, using this abstract representation, an ASR framework, which we call the structure-based ASR, was proposed and examined. However, there are two problems for the structure-based ASR; the curse of dimensionality and the large size of modeling unit. As a solution for these problems, this report proposes a new acoustic modeling based on parameter sharing of statistical structure models and efficient reuse of the shared models. In the proposed method, edge vectors, which represent speech contrasts between any two acoustic events, are considered and parameter sharing is carried out based on clustering in the parametric space of edge vectors. To construct an acoustic model for a new word, the most likely edge models are selected and allocated for each edge vector in the structure of that new word. Experiments of recognition using continuous utterances of Japanese vowels show the validity of the proposed method. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
structural representation / invariant features / speech contrasts / clustering / acoustic modeling / / / |
Reference Info. |
IEICE Tech. Rep., vol. 109, no. 356, SP2009-77, pp. 7-12, Dec. 2009. |
Paper # |
SP2009-77 |
Date of Issue |
2009-12-14 (NLC, SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
NLC2009-13 SP2009-77 |
Conference Information |
Committee |
SP NLC |
Conference Date |
2009-12-21 - 2009-12-22 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Univ. of Tokyo |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
11th Spoken Language Symposium (SP/NLC/SLP) |
Paper Information |
Registration To |
SP |
Conference Code |
2009-12-SP-NLC |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit |
Sub Title (in English) |
|
Keyword(1) |
structural representation |
Keyword(2) |
invariant features |
Keyword(3) |
speech contrasts |
Keyword(4) |
clustering |
Keyword(5) |
acoustic modeling |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Daisuke Saito |
1st Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
2nd Author's Name |
Ryo Matsuura |
2nd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
3rd Author's Name |
Nobuaki Minematsu |
3rd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
4th Author's Name |
Keikichi Hirose |
4th Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2009-12-21 10:35:00 |
Presentation Time |
25 minutes |
Registration for |
SP |
Paper # |
NLC2009-13, SP2009-77 |
Volume (vol) |
vol.109 |
Number (no) |
no.355(NLC), no.356(SP) |
Page |
pp.7-12 |
#Pages |
6 |
Date of Issue |
2009-12-14 (NLC, SP) |
|