Language Modeling for Automatic Speech Recognition of Lectures in Elementary and Secondary Education

Nanjo,Hiroaki; Hisaki,Ippei; Wada,Yuki

IEICE Technical Committee Submission System
Conference Paper's Information

Online Proceedings
[Sign in]
Tech. Rep. Archives

Paper Abstract and Keywords
Presentation		2011-10-06 11:30 Language Modeling for Automatic Speech Recognition of Lectures in Elementary and Secondary Education Hiroaki Nanjo, Ippei Hisaki, Yuki Wada (Ryukoku Univ.) SP2011-54 WIT2011-36
Abstract	(in Japanese)	(See Japanese page)
	(in English)	Automatic speech recognition (ASR) of lectures on elementary and secondary education is addressed. Most of conventional studies of lecture speech recognition target on lectures in universities or oral presentations in technical conferences, in which lecturers make their speech for adult audiences. On the contrary, in elementary school or junior high-school, lecture audience is immature people. Lecturers (teachers) often make utterances in a different way from talks to adult audiences. Specifically, teachers try to select easy words and phrases, some of which are only for kids. For ASR of elementary school lectures, a language model which covers such linguistic phenomena is required. In this paper, suitable vocabulary and language model for elementary school lectures are discussed. Word 3-gram language model trained with texts for adults (Corpus of spontaneous Japanese and one-year newspaper articles) cannot cover a half of 3-grams (about 3000 kinds) appeared in 13 lectures in school. We got higher adjusted testset perplexity about 343. Word 3-gram language model trained with small texts for kids (1.2M words from kids-oriented web sites), we can cover one-third of 3-grams, which are not modeled in the language model for adult. We confirmed that it is significant to collect text corpora for ASR of elementary school lectures.
Keyword	(in Japanese)	(See Japanese page)
	(in English)	Lecture Speech Recognition / Information Support / Elementary and Secondary Education / Language Model / Speaking Style / / /
Reference Info.		IEICE Tech. Rep., vol. 111, no. 225, SP2011-54, pp. 13-18, Oct. 2011.
Paper #		SP2011-54
Date of Issue		2011-09-29 (SP, WIT)
ISSN		Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380
Copyright and reproduction		All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
Download PDF		SP2011-54 WIT2011-36

Conference Information
Committee	WIT SP
Conference Date	2011-10-06 - 2011-10-07
Place (in Japanese)	(See Japanese page)
Place (in English)	TFT Bldg.
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Welfare and speech processing, etc.
Paper Information
Registration To	SP
Conference Code	2011-10-WIT-SP
Language	Japanese
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Language Modeling for Automatic Speech Recognition of Lectures in Elementary and Secondary Education
Sub Title (in English)
Keyword(1)	Lecture Speech Recognition
Keyword(2)	Information Support
Keyword(3)	Elementary and Secondary Education
Keyword(4)	Language Model
Keyword(5)	Speaking Style
Keyword(6)
Keyword(7)
Keyword(8)
1st Author's Name	Hiroaki Nanjo
1st Author's Affiliation	Ryukoku University (Ryukoku Univ.)
2nd Author's Name	Ippei Hisaki
2nd Author's Affiliation	Ryukoku University (Ryukoku Univ.)
3rd Author's Name	Yuki Wada
3rd Author's Affiliation	Ryukoku University (Ryukoku Univ.)
4th Author's Name
4th Author's Affiliation	()
5th Author's Name
5th Author's Affiliation	()
6th Author's Name
6th Author's Affiliation	()
7th Author's Name
7th Author's Affiliation	()
8th Author's Name
8th Author's Affiliation	()
9th Author's Name
9th Author's Affiliation	()
10th Author's Name
10th Author's Affiliation	()
11th Author's Name
11th Author's Affiliation	()
12th Author's Name
12th Author's Affiliation	()
13th Author's Name
13th Author's Affiliation	()
14th Author's Name
14th Author's Affiliation	()
15th Author's Name
15th Author's Affiliation	()
16th Author's Name
16th Author's Affiliation	()
17th Author's Name
17th Author's Affiliation	()
18th Author's Name
18th Author's Affiliation	()
19th Author's Name
19th Author's Affiliation	()
20th Author's Name
20th Author's Affiliation	()
Speaker	Author-1
Date Time	2011-10-06 11:30:00
Presentation Time	30 minutes
Registration for	SP
Paper #	SP2011-54, WIT2011-36
Volume (vol)	vol.111
Number (no)	no.225(SP), no.226(WIT)
Page	pp.13-18
#Pages	6
Date of Issue	2011-09-29 (SP, WIT)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan