Paper Abstract and Keywords |
Presentation |
2014-05-15 11:55
Cool Implementation of Voice Recognition System for Web Application Yuichi Maki, Noriyoshi Kamado, Shigeru Fujimura, Yushi Aono, Jyouji Nakayama, Sumitaka Sakauchi, Tomohiro Yamada (NTT) MoNA2014-6 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
We propose a browser-based speech recognition system using HTML5 in a broad sense and report its performance in actual use. Our proposed method enables browsers in PCs and mobile devices to use speech recognition function by client-side JavaScript code. Unlike traditional Web applications, there is no need to install specific application or browser plug-in. The flow of processing, at first getting the streaming audio data from the microphone of the client, and the client device transmits its streaming data to the speech-recognition server by the WebSocket protocol. In consideration of the quality of mobile broadband, by using client-side JavaScript, compres- sion for audio data is also performed, and the Voice Activity Detector(VAD) and the speech recognition decoder are implemented to the server because of the reduction of the computational cost of the client. In this paper, we explain the architecture of our proposed system and support status of browsers and devices. And we also report the audio compression performance in the client and the quality of actual use in mobile broadband. In the result, the proposed method has adequate quality as using speech recognition system. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Voice Recognition / Web Application / Web Browser / WebSocket / Media Stream / Web Audio API / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 31, MoNA2014-6, pp. 31-36, May 2014. |
Paper # |
MoNA2014-6 |
Date of Issue |
2014-05-08 (MoNA) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
MoNA2014-6 |
|