Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:20 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
A Study on Hybrid RNN-T/Attention-based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration Takafumi Moriya, Takanori Ashihara, Atsushi Ando, Hiroshi Sato, Tomohiro Tanaka, Kohei Matsuura, Ryo Masumura, Marc Delcroix (NTT), Takahiro Shinozaki (Tokyo Tech) EA2021-78 SIP2021-105 SP2021-63 |
In this paper we propose improvements to our recently proposed hybrid RNN-T/Attention architecture that includes a share... [more] |
EA2021-78 SIP2021-105 SP2021-63 pp.90-95 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center (Okinawa) |
[Poster Presentation]
Hardware Speech Sensor Based on Deep Neural Network Feature Extractor and Template Matching Yi Liu, Boyu Qian, Jian Wang, Takahiro Shinozaki (Titech) EA2016-135 SIP2016-190 SP2016-130 |
We explore the possibility of combination of a DNN-based feature extractor and template based matching for keyword detec... [more] |
EA2016-135 SIP2016-190 SP2016-130 pp.297-300 |
SP |
2016-10-27 15:10 |
Shizuoka |
Shizuoka University. (Shizuoka) |
[Invited Talk]
Constructing speech recognition system using Kaldi toolkit Takahiro Shinozaki (Tokyo Tech) SP2016-46 |
[more] |
SP2016-46 pp.25-29 |
SP |
2016-08-24 16:15 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
SP2016-33 |
(To be available after the conference date) [more] |
SP2016-33 pp.33-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 13:55 |
Aichi |
Nagoya Inst of Tech. (Aichi) |
Automation of high performance system building for large vocabulary speech recognition using evolution strategy with pareto optimality Takafumi Moriya, Tomohiro Tanaka, Takahiro Shinozaki (Tokyo Tech), Shinji Watanabe (MERL), Kevin Duh (NAIST) SP2015-75 |
The performance of speech recognition tasks can be significantly improved by the use of deep neural networks (DNN). Howe... [more] |
SP2015-75 pp.31-36 |
EA, EMM |
2015-11-12 15:15 |
Kumamoto |
Kumamoto Univ. (Kumamoto) |
Facial image conversion based on transformation of Animation Units using DNN Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) EA2015-28 EMM2015-49 |
[more] |
EA2015-28 EMM2015-49 pp.23-28 |
SP |
2015-10-15 13:50 |
Hyogo |
Kobe Univ. (Hyogo) |
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Tech), Akinori Ito (Tohoku Univ.) SP2015-61 |
In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. ... [more] |
SP2015-61 pp.13-18 |
SP |
2015-10-16 10:50 |
Hyogo |
Kobe Univ. (Hyogo) |
Switch-To-Speech Communication Aid System Using WFST and Low Latency Search Algorithm Fuming Fang, Takahiro Shinozaki (Tokyo Tech) SP2015-68 |
To establish a communication method for the patients who have lost nearly all of their voluntary movement
including spe... [more] |
SP2015-68 pp.51-56 |
EMM, IT |
2015-05-21 15:20 |
Kyoto |
Kyoto International Community House (Kyoto) |
A study on speaker conversion using speech and expression features for video chatting Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) IT2015-9 EMM2015-9 |
In this paper, we suggest two method that the individuality of the face of original speaker convert that of target speak... [more] |
IT2015-9 EMM2015-9 pp.45-50 |