Paper Abstract and Keywords |
Presentation |
2020-12-02 09:40
Approximation Method for Bayes Optimal Prediction in Phoneme Recognition Problem Taishi Yamaoka, Shota Saito, Toshiyasu Matsushima (Waseda Univ.) IT2020-30 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In this paper, we propose a method of phoneme recognition. In the previous studies on phoneme recognition using the Hidden Markov Model, the Hidden Markov Model used for prediction is defined as one by a certain criteria. In addition, for the defined Hidden Markov Model, parameters were estimated from the training data, and the phonemes corresponding to the new speech data were predicted using paremters.
In this peper, we assume 0-1 loss as the loss function, and formulate the optimum prediction based on Bayesian criterion. In other words, instead of selecting one Hidden Markov Model and estimating its parameters and making predictions using them, we propose a prediction that directly minimizes the probability of error in the prediction.
Although this prediction is theoretically optimal, its calculation involves two problems: (i) The complexity of the sum calculation of the state transition series is on the exponential order with respect to the length of the voice. (ii) It is difficult to analytically calculate the integral by the posterior distribution of the parameters of the Hidden Markov Model. In order to solve these problems, in this paper, we apply the Viterbi algorithm for problem (i) and the Variational Bayesian method for problem (ii), and propose a Bayesian semi-optimal algorithm. This algorithm makes predictions by weighted averages of approximate posterior distributions of multiple Hidden Markov Models. By conducting numerical experiments using artificial data, it was confirmed that the proposed method has a smaller false recognition rate than the method of selecting and predicting one model as in the previous research. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Phoneme recognition / Hidden Markov model / Bayes criteria / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 120, no. 268, IT2020-30, pp. 32-37, Dec. 2020. |
Paper # |
IT2020-30 |
Date of Issue |
2020-11-24 (IT) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
IT2020-30 |
Conference Information |
Committee |
IT |
Conference Date |
2020-12-01 - 2020-12-03 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Online |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Lectures for Young Researchers, General |
Paper Information |
Registration To |
IT |
Conference Code |
2020-12-IT |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Approximation Method for Bayes Optimal Prediction in Phoneme Recognition Problem |
Sub Title (in English) |
|
Keyword(1) |
Phoneme recognition |
Keyword(2) |
Hidden Markov model |
Keyword(3) |
Bayes criteria |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Taishi Yamaoka |
1st Author's Affiliation |
Waseda University (Waseda Univ.) |
2nd Author's Name |
Shota Saito |
2nd Author's Affiliation |
Waseda University (Waseda Univ.) |
3rd Author's Name |
Toshiyasu Matsushima |
3rd Author's Affiliation |
Waseda University (Waseda Univ.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2020-12-02 09:40:00 |
Presentation Time |
20 minutes |
Registration for |
IT |
Paper # |
IT2020-30 |
Volume (vol) |
vol.120 |
Number (no) |
no.268 |
Page |
pp.32-37 |
#Pages |
6 |
Date of Issue |
2020-11-24 (IT) |
|