IEICE Technical Committee Submission System
Conference Paper's Information
Online Proceedings
[Sign in]
Tech. Rep. Archives
 Go Top Page Go Previous   [Japanese] / [English] 

Paper Abstract and Keywords
Presentation 2009-12-21 10:35
Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit
Daisuke Saito, Ryo Matsuura, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) NLC2009-13 SP2009-77
Abstract (in Japanese) (See Japanese page) 
(in English) Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently proposed a structural and abstract representation of speech, where these variations were effectively removed. This representation captures only dynamics of speech. In our previous studies, using this abstract representation, an ASR framework, which we call the structure-based ASR, was proposed and examined. However, there are two problems for the structure-based ASR; the curse of dimensionality and the large size of modeling unit. As a solution for these problems, this report proposes a new acoustic modeling based on parameter sharing of statistical structure models and efficient reuse of the shared models. In the proposed method, edge vectors, which represent speech contrasts between any two acoustic events, are considered and parameter sharing is carried out based on clustering in the parametric space of edge vectors. To construct an acoustic model for a new word, the most likely edge models are selected and allocated for each edge vector in the structure of that new word. Experiments of recognition using continuous utterances of Japanese vowels show the validity of the proposed method.
Keyword (in Japanese) (See Japanese page) 
(in English) structural representation / invariant features / speech contrasts / clustering / acoustic modeling / / /  
Reference Info. IEICE Tech. Rep., vol. 109, no. 356, SP2009-77, pp. 7-12, Dec. 2009.
Paper # SP2009-77 
Date of Issue 2009-12-14 (NLC, SP) 
ISSN Print edition: ISSN 0913-5685    Online edition: ISSN 2432-6380
Copyright
and
reproduction
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
Download PDF NLC2009-13 SP2009-77

Conference Information
Committee SP NLC  
Conference Date 2009-12-21 - 2009-12-22 
Place (in Japanese) (See Japanese page) 
Place (in English) Univ. of Tokyo 
Topics (in Japanese) (See Japanese page) 
Topics (in English) 11th Spoken Language Symposium (SP/NLC/SLP) 
Paper Information
Registration To SP 
Conference Code 2009-12-SP-NLC 
Language Japanese 
Title (in Japanese) (See Japanese page) 
Sub Title (in Japanese) (See Japanese page) 
Title (in English) Experimental study of acoustic modeling using speaker-invariant speech contrast as modeling unit 
Sub Title (in English)  
Keyword(1) structural representation  
Keyword(2) invariant features  
Keyword(3) speech contrasts  
Keyword(4) clustering  
Keyword(5) acoustic modeling  
Keyword(6)  
Keyword(7)  
Keyword(8)  
1st Author's Name Daisuke Saito  
1st Author's Affiliation The University of Tokyo (Univ. of Tokyo)
2nd Author's Name Ryo Matsuura  
2nd Author's Affiliation The University of Tokyo (Univ. of Tokyo)
3rd Author's Name Nobuaki Minematsu  
3rd Author's Affiliation The University of Tokyo (Univ. of Tokyo)
4th Author's Name Keikichi Hirose  
4th Author's Affiliation The University of Tokyo (Univ. of Tokyo)
5th Author's Name  
5th Author's Affiliation ()
6th Author's Name  
6th Author's Affiliation ()
7th Author's Name  
7th Author's Affiliation ()
8th Author's Name  
8th Author's Affiliation ()
9th Author's Name  
9th Author's Affiliation ()
10th Author's Name  
10th Author's Affiliation ()
11th Author's Name  
11th Author's Affiliation ()
12th Author's Name  
12th Author's Affiliation ()
13th Author's Name  
13th Author's Affiliation ()
14th Author's Name  
14th Author's Affiliation ()
15th Author's Name  
15th Author's Affiliation ()
16th Author's Name  
16th Author's Affiliation ()
17th Author's Name  
17th Author's Affiliation ()
18th Author's Name  
18th Author's Affiliation ()
19th Author's Name  
19th Author's Affiliation ()
20th Author's Name  
20th Author's Affiliation ()
Speaker Author-1 
Date Time 2009-12-21 10:35:00 
Presentation Time 25 minutes 
Registration for SP 
Paper # NLC2009-13, SP2009-77 
Volume (vol) vol.109 
Number (no) no.355(NLC), no.356(SP) 
Page pp.7-12 
#Pages
Date of Issue 2009-12-14 (NLC, SP) 


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan