Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:20 |
Okinawa |
(Primary: On-site, Secondary: Online) |
A Study on Hybrid RNN-T/Attention-based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration Takafumi Moriya, Takanori Ashihara, Atsushi Ando, Hiroshi Sato, Tomohiro Tanaka, Kohei Matsuura, Ryo Masumura, Marc Delcroix (NTT), Takahiro Shinozaki (Tokyo Tech) EA2021-78 SIP2021-105 SP2021-63 |
In this paper we propose improvements to our recently proposed hybrid RNN-T/Attention architecture that includes a share... [more] |
EA2021-78 SIP2021-105 SP2021-63 pp.90-95 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Hardware Speech Sensor Based on Deep Neural Network Feature Extractor and Template Matching Yi Liu, Boyu Qian, Jian Wang, Takahiro Shinozaki (Titech) EA2016-135 SIP2016-190 SP2016-130 |
We explore the possibility of combination of a DNN-based feature extractor and template based matching for keyword detec... [more] |
EA2016-135 SIP2016-190 SP2016-130 pp.297-300 |
SP |
2016-10-27 15:10 |
Shizuoka |
Shizuoka University. |
[Invited Talk]
Constructing speech recognition system using Kaldi toolkit Takahiro Shinozaki (Tokyo Tech) SP2016-46 |
[more] |
SP2016-46 pp.25-29 |
SP |
2016-08-24 16:15 |
Kyoto |
ACCMS, Kyoto Univ. |
SP2016-33 |
(To be available after the conference date) [more] |
SP2016-33 pp.33-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2015-12-02 13:55 |
Aichi |
Nagoya Inst of Tech. |
Automation of high performance system building for large vocabulary speech recognition using evolution strategy with pareto optimality Takafumi Moriya, Tomohiro Tanaka, Takahiro Shinozaki (Tokyo Tech), Shinji Watanabe (MERL), Kevin Duh (NAIST) SP2015-75 |
The performance of speech recognition tasks can be significantly improved by the use of deep neural networks (DNN). Howe... [more] |
SP2015-75 pp.31-36 |
EA, EMM |
2015-11-12 15:15 |
Kumamoto |
Kumamoto Univ. |
Facial image conversion based on transformation of Animation Units using DNN Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) EA2015-28 EMM2015-49 |
[more] |
EA2015-28 EMM2015-49 pp.23-28 |
SP |
2015-10-15 13:50 |
Hyogo |
Kobe Univ. |
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Tech), Akinori Ito (Tohoku Univ.) SP2015-61 |
In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. ... [more] |
SP2015-61 pp.13-18 |
SP |
2015-10-16 10:50 |
Hyogo |
Kobe Univ. |
Switch-To-Speech Communication Aid System Using WFST and Low Latency Search Algorithm Fuming Fang, Takahiro Shinozaki (Tokyo Tech) SP2015-68 |
To establish a communication method for the patients who have lost nearly all of their voluntary movement
including spe... [more] |
SP2015-68 pp.51-56 |
EMM, IT |
2015-05-21 15:20 |
Kyoto |
Kyoto International Community House |
A study on speaker conversion using speech and expression features for video chatting Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) IT2015-9 EMM2015-9 |
In this paper, we suggest two method that the individuality of the face of original speaker convert that of target speak... [more] |
IT2015-9 EMM2015-9 pp.45-50 |
SP, IPSJ-MUS |
2014-05-25 11:30 |
Tokyo |
|
A Kana Protocol Recommendation Method for Switch Input Speech Synthesis Systems Fuming Fang, Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2014-36 |
Switch-to-speech interface can provide a means of interactive speech communication as a support system
for people with ... [more] |
SP2014-36 pp.355-360 |
SP, IPSJ-SLP |
2013-12-20 10:10 |
Tokyo |
|
Automatic Estimation of Accent Phrase Boundaries Using Language and Acoustic Models Hiroshi Suzuki, Tomoki Koriyama (Tokyo Tech), Takashi Nose (Tohoku Univ.), Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech) SP2013-89 |
This paper proposes a technique for automatically estimating accent phrase boundaries for text-to-speech synthesis syste... [more] |
SP2013-89 pp.97-102 |
MVE, IE, WIT, IMQ, CQ (Joint) [detail] |
2013-03-12 11:10 |
Fukuoka |
Fukuoka Institute of Technology |
Preliminary Study of Captioning Method Considering User Characteristics Yosuke Shirai, Mai Yanagimura, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ.), Toshiki Endo, Eiji Utsunomiya (KDDI Labs) IMQ2012-78 IE2012-182 MVE2012-139 WIT2012-88 |
[more] |
IMQ2012-78 IE2012-182 MVE2012-139 WIT2012-88 pp.245-250 |
MVE, IE, WIT, IMQ, CQ (Joint) [detail] |
2013-03-12 11:35 |
Fukuoka |
Fukuoka Institute of Technology |
Sign Language Recognition Using Kinect and Particle Filter Yoshihiro Furuya, Daisuke Imamura, Yasuo Horiuchi, Kazuhiko Kawamoto, Takahiro Shinozaki, Shingo Kuroiwa (Chiba Univ.) IMQ2012-79 IE2012-183 MVE2012-140 WIT2012-89 |
In this paper, we will discuss a sign language recognition method using a Particle Filter and Kinect. We have previously... [more] |
IMQ2012-79 IE2012-183 MVE2012-140 WIT2012-89 pp.251-256 |
WIT |
2013-02-02 15:25 |
Aichi |
Nagoya Institute of Technology |
Eye motion input based speech synthesis interface for communication aids Fuming Fang, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ), Sadaoki Furui (Tokyo Tech), Toshimitsu Musha (BFL) WIT2012-38 |
In order to provide an efficient means of communication for those who cannot move muscles of their whole body except eye... [more] |
WIT2012-38 pp.29-34 |
HCGSYMPO (2nd) |
2012-12-10 - 2012-12-12 |
Kumamoto |
Kumamoto-Shintoshin-plaza |
Impression Classification Using Speech Segments in the Utterance Masahiro Uchida, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ.) |
Impression that a speaker gives to a receiver plays an important role in human speech conversation. However, the... [more] |
|
PRMU, SP |
2012-02-09 15:15 |
Miyagi |
|
Electrooculogram recognition using hidden Markov model Fuming Fang, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ), Sadaoki Furui (Tokyo Tech), Toshimitsu Musha (BFL) PRMU2011-202 SP2011-117 |
In order to provide an efficient means of communication for those who cannot move muscles of their whole body except eye... [more] |
PRMU2011-202 SP2011-117 pp.97-102 |
PRMU, SP |
2012-02-09 16:30 |
Miyagi |
|
[Poster Presentation]
An Analysis of Foreign Student Utterances for Computerized Japanese Speaking Test Web System Risa Kurihara (Wakayama Univ.), Kenkichi Ishizuka (Univ. of Tsukuba), Ryuichi Nisimura (Wakayama Univ.), Takahiro Shinozaki (Chiba Univ.), Takeshi Yamada, Shingo Imai (Univ. of Tsukuba) PRMU2011-219 SP2011-134 |
We are developing a computerized Japanese speaking test system, which are implemented using web protocols of the Interne... [more] |
PRMU2011-219 SP2011-134 pp.141-142 |
PRMU, SP |
2012-02-10 10:30 |
Miyagi |
|
HMM Sign Language Recognition Using Kinect and Particle Filter Yosuke Nishimura, Daisuke Imamura, Yasuo Horiuchi, Kazuhiko Kawamoto, Takahiro Shinozaki, Shingo Kuroiwa (Chiba Univ.) PRMU2011-223 SP2011-138 |
In this paper, we will introduce a sign language recognition method using Kinect which is a motion sensing input device ... [more] |
PRMU2011-223 SP2011-138 pp.161-166 |
WIT |
2012-01-27 13:30 |
Aichi |
Nagoya Institute of Technology |
Comparative Analysis of Turn-taking between Japanese Sign Language and Japanese Speech Yumi Murase, Yasuo Horiuchi, Takahiro Shinozaki, Shingo Kuroiwa (Chiba Univ.) WIT2011-52 |
In this research, we analyzed turn-taking phenomena in spontaneous dialogue comparing Japanese Sign Language (JSL) and J... [more] |
WIT2011-52 pp.7-12 |
SP |
2010-06-17 16:45 |
Fukuoka |
Kyushu University |
Analysis on features and estimators for speech-based age estimation Toshiya Wada, Takahiro Shinozaki, Sadaoki Furui (Tokyo Inst. of Tech.) SP2010-27 |
Age estimation systems can be implemented using either a discrete classifier such as
support vector machine (SVM) or... [more] |
SP2010-27 pp.31-36 |