Paper Abstract and Keywords |
Presentation |
2011-10-06 11:30
Language Modeling for Automatic Speech Recognition of Lectures in Elementary and Secondary Education Hiroaki Nanjo, Ippei Hisaki, Yuki Wada (Ryukoku Univ.) SP2011-54 WIT2011-36 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Automatic speech recognition (ASR) of lectures on elementary and
secondary education is addressed. Most of conventional studies of
lecture speech recognition target on lectures in universities
or oral presentations in technical conferences, in which
lecturers make their speech for adult audiences. On the
contrary, in elementary school or junior high-school, lecture
audience is immature people. Lecturers (teachers) often make
utterances in a different way from talks to adult audiences.
Specifically, teachers try to select easy words and phrases, some
of which are only for kids.
For ASR of elementary school lectures, a language model which
covers such linguistic phenomena is required. In this paper,
suitable vocabulary and language model for elementary school
lectures are discussed.
Word 3-gram language model trained with texts for adults (Corpus
of spontaneous Japanese and one-year newspaper articles) cannot
cover a half of 3-grams (about 3000 kinds) appeared in 13 lectures
in school. We got higher adjusted testset perplexity about 343.
Word 3-gram language model trained with small texts for kids
(1.2M words from kids-oriented web sites), we can cover one-third
of 3-grams, which are not modeled in the language model for adult.
We confirmed that it is significant to collect text corpora for
ASR of elementary school lectures. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Lecture Speech Recognition / Information Support / Elementary and Secondary Education / Language Model / Speaking Style / / / |
Reference Info. |
IEICE Tech. Rep., vol. 111, no. 225, SP2011-54, pp. 13-18, Oct. 2011. |
Paper # |
SP2011-54 |
Date of Issue |
2011-09-29 (SP, WIT) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2011-54 WIT2011-36 |
Conference Information |
Committee |
WIT SP |
Conference Date |
2011-10-06 - 2011-10-07 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
TFT Bldg. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Welfare and speech processing, etc. |
Paper Information |
Registration To |
SP |
Conference Code |
2011-10-WIT-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Language Modeling for Automatic Speech Recognition of Lectures in Elementary and Secondary Education |
Sub Title (in English) |
|
Keyword(1) |
Lecture Speech Recognition |
Keyword(2) |
Information Support |
Keyword(3) |
Elementary and Secondary Education |
Keyword(4) |
Language Model |
Keyword(5) |
Speaking Style |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Hiroaki Nanjo |
1st Author's Affiliation |
Ryukoku University (Ryukoku Univ.) |
2nd Author's Name |
Ippei Hisaki |
2nd Author's Affiliation |
Ryukoku University (Ryukoku Univ.) |
3rd Author's Name |
Yuki Wada |
3rd Author's Affiliation |
Ryukoku University (Ryukoku Univ.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2011-10-06 11:30:00 |
Presentation Time |
30 minutes |
Registration for |
SP |
Paper # |
SP2011-54, WIT2011-36 |
Volume (vol) |
vol.111 |
Number (no) |
no.225(SP), no.226(WIT) |
Page |
pp.13-18 |
#Pages |
6 |
Date of Issue |
2011-09-29 (SP, WIT) |
|