Paper Abstract and Keywords |
Presentation |
2019-03-15 13:30
[Poster Presentation]
An Evaluation of Underdetermined Source Separation Based on Multichannel Variational Autoencoder Shogo Seki (Nagoya Univ.), Hirokazu Kameoka (NTT), Li Li (Univ. Tsukuba), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) EA2018-154 SIP2018-160 SP2018-116 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel Non-negative Matrix Factorization (MNMF) is one of powerful approaches, which adopts the NMF concept for source power spectrogram modeling. This concept is also employed in Independent Low-Rank Matrix Analysis (ILRMA), a special class of the MNMF framework formulated under determined conditions. These methods work reasonably for particular types of sound sources, however, one limitation is that they can fail to work for sources with spectrograms that do not comply with the NMF model. To address this limitation, an extension of ILRMA called the Multichannel Variational Autoencoder (MVAE) method was recently proposed, where a Conditional VAE (CVAE) is used instead of the NMF model for source power spectrogram modeling. This approach has shown to perform impressively in determined source separation tasks thanks to the representation power of DNNs. This paper generalizes MVAE originally formulated under determined mixing conditions so that it can also deal with underdetermined cases. The proposed method was evaluated on an underdetermined source separation task of separating out three sources from two microphone inputs. Experimental results revealed that the generalized MVAE method achieved better performance than the MNMF method. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Underdetermined source separation / Multichannel variational autoencoder / Multichannel non-negative matrix factorization / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 118, no. 497, SP2018-116, pp. 323-328, March 2019. |
Paper # |
SP2018-116 |
Date of Issue |
2019-03-07 (EA, SIP, SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2018-154 SIP2018-160 SP2018-116 |
Conference Information |
Committee |
EA SIP SP |
Conference Date |
2019-03-14 - 2019-03-15 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
i+Land nagasaki (Nagasaki-shi) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Engineering/Electro Acoustics, Signal Processing, Speech, and Related Topics |
Paper Information |
Registration To |
SP |
Conference Code |
2019-03-EA-SIP-SP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
An Evaluation of Underdetermined Source Separation Based on Multichannel Variational Autoencoder |
Sub Title (in English) |
|
Keyword(1) |
Underdetermined source separation |
Keyword(2) |
Multichannel variational autoencoder |
Keyword(3) |
Multichannel non-negative matrix factorization |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Shogo Seki |
1st Author's Affiliation |
Nagoya University (Nagoya Univ.) |
2nd Author's Name |
Hirokazu Kameoka |
2nd Author's Affiliation |
Nippon Telegraph and Telephone Corporation (NTT) |
3rd Author's Name |
Li Li |
3rd Author's Affiliation |
University of Tsukuba (Univ. Tsukuba) |
4th Author's Name |
Tomoki Toda |
4th Author's Affiliation |
Nagoya University (Nagoya Univ.) |
5th Author's Name |
Kazuya Takeda |
5th Author's Affiliation |
Nagoya University (Nagoya Univ.) |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
21st Author's Name |
|
21st Author's Affiliation |
() |
22nd Author's Name |
|
22nd Author's Affiliation |
() |
23rd Author's Name |
|
23rd Author's Affiliation |
() |
24th Author's Name |
|
24th Author's Affiliation |
() |
25th Author's Name |
|
25th Author's Affiliation |
() |
26th Author's Name |
/ / |
26th Author's Affiliation |
()
() |
27th Author's Name |
/ / |
27th Author's Affiliation |
()
() |
28th Author's Name |
/ / |
28th Author's Affiliation |
()
() |
29th Author's Name |
/ / |
29th Author's Affiliation |
()
() |
30th Author's Name |
/ / |
30th Author's Affiliation |
()
() |
31st Author's Name |
/ / |
31st Author's Affiliation |
()
() |
32nd Author's Name |
/ / |
32nd Author's Affiliation |
()
() |
33rd Author's Name |
/ / |
33rd Author's Affiliation |
()
() |
34th Author's Name |
/ / |
34th Author's Affiliation |
()
() |
35th Author's Name |
/ / |
35th Author's Affiliation |
()
() |
36th Author's Name |
/ / |
36th Author's Affiliation |
()
() |
Speaker |
Author-1 |
Date Time |
2019-03-15 13:30:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
EA2018-154, SIP2018-160, SP2018-116 |
Volume (vol) |
vol.118 |
Number (no) |
no.495(EA), no.496(SIP), no.497(SP) |
Page |
pp.323-328 |
#Pages |
6 |
Date of Issue |
2019-03-07 (EA, SIP, SP) |