Paper Abstract and Keywords |
Presentation |
2013-07-19 14:45
Multimodal blind speech extraction using image tracking in spoken dialogue robot Kitachan Kotaro Yoshie, Yuji Onuma, Hiroshi Saruwatari, Satoshi Nakamura, Kiyohiro Shikano (NAIST) EA2013-50 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In the robot dialogue system, utilization of a hands-free microphone enables us to realize a more natural conversation. However, there is a problem in which the recognition accuracy is decreased as the interfering noise increases.
Therefore, in this study, we aim to improve the recognition accuracy
by using semi-blind speech extraction with the speaker position obtained
from image information via "Kinect" that has a microphone array and a camera.
We use a spoken dialogue robot Kitachan managed by the authors for station guidance. As the results of experiments, we confirm that the recognition performance is markedly improved compared with the case without using image information. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
BSSA / multichannel source separation / spoken dialogue robot / image tracking / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 113, no. 134, EA2013-50, pp. 99-104, July 2013. |
Paper # |
EA2013-50 |
Date of Issue |
2013-07-11 (EA) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2013-50 |
Conference Information |
Committee |
EA |
Conference Date |
2013-07-18 - 2013-07-19 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Health Sci. Univ. of Hokkaido |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Engineering/Electro Acoustics, Musical Acoustics, Psychological and Physiological Acoustics, and Related Topics |
Paper Information |
Registration To |
EA |
Conference Code |
2013-07-EA |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Multimodal blind speech extraction using image tracking in spoken dialogue robot Kitachan |
Sub Title (in English) |
|
Keyword(1) |
BSSA |
Keyword(2) |
multichannel source separation |
Keyword(3) |
spoken dialogue robot |
Keyword(4) |
image tracking |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kotaro Yoshie |
1st Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
2nd Author's Name |
Yuji Onuma |
2nd Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
3rd Author's Name |
Hiroshi Saruwatari |
3rd Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
4th Author's Name |
Satoshi Nakamura |
4th Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
5th Author's Name |
Kiyohiro Shikano |
5th Author's Affiliation |
Nara Institute of Science and Technology (NAIST) |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2013-07-19 14:45:00 |
Presentation Time |
30 minutes |
Registration for |
EA |
Paper # |
EA2013-50 |
Volume (vol) |
vol.113 |
Number (no) |
no.134 |
Page |
pp.99-104 |
#Pages |
6 |
Date of Issue |
2013-07-11 (EA) |
|