Paper Abstract and Keywords |
Presentation |
2014-12-16 11:00
Noise robust speech recognition by non-negative matrix factorization using GMM clustering in MFCC domain Kentaro Fujigaki, Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) SP2014-113 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Exemplar-based feature enhancement by non-negative matrix factorization (NMF) was proposed for noise-robust speech recognition. When we consider only additive noises, we can decompose a noisy speech spectrum into a linear but sparse combination of speech and noise bases. In the conventional NMF, decomposition is unsupervised. If we can give the phoneme sequence of an input utterance to the NMF processing, it is surely possible to realize much more precise decomposition. However, in the task of speech recognition, the phoneme sequence is unknown and unavailable. In this paper, therefore, we introduce unsupervised GMM clustering and classify each input frame by using GMM indexes. For NMF, speech bases are built separately for each GMM index. Experiments show that our proposed method of combining NMF with GMM clustering gives higher robustness of recognizing noisy speech than the original NMF. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
robust speech recognition / noise surpression / feature enhancement / NMF / GMM clustering / / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 365, SP2014-113, pp. 69-74, Dec. 2014. |
Paper # |
SP2014-113 |
Date of Issue |
2014-12-08 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2014-113 |
Conference Information |
Committee |
NLC IPSJ-NL SP IPSJ-SLP JSAI-SLUD |
Conference Date |
2014-12-15 - 2014-12-17 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Tokyo Institute of Technology (Suzukakedai Campus) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
The 6th Symposium on Collective Knowlege |
Paper Information |
Registration To |
SP |
Conference Code |
2014-12-NLC-NL-SP-SLP-SLUD |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Noise robust speech recognition by non-negative matrix factorization using GMM clustering in MFCC domain |
Sub Title (in English) |
|
Keyword(1) |
robust speech recognition |
Keyword(2) |
noise surpression |
Keyword(3) |
feature enhancement |
Keyword(4) |
NMF |
Keyword(5) |
GMM clustering |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kentaro Fujigaki |
1st Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
2nd Author's Name |
Yosuke Kashiwagi |
2nd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
3rd Author's Name |
Daisuke Saito |
3rd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
4th Author's Name |
Nobuaki Minematsu |
4th Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
5th Author's Name |
Keikichi Hirose |
5th Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2014-12-16 11:00:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
SP2014-113 |
Volume (vol) |
vol.114 |
Number (no) |
no.365 |
Page |
pp.69-74 |
#Pages |
6 |
Date of Issue |
2014-12-08 (SP) |
|