Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 15:20 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Domain and language adaptation of large-scale pretrained model for speech recognition of low-resource language Kak Soky (Kyoto University), Sheng Li (NICT), Chenhui Chu, Tatsuya Kawahara (Kyoto University) NLC2022-17 SP2022-37 |
The self-supervised learning (SSL) models are effective for automatic speech recognition (ASR). Due to the huge paramete... [more] |
NLC2022-17 SP2022-37 pp.45-49 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 17:35 |
Online |
Online (Online) |
[Short Paper]
Comparison of End-to-End Models for Joint Speaker and Speech Recognition Kak Soky (Kyoto Univ.), Sheng Li (NICT), Masato Mimura, Chenhui Chu, Tatsuya Kawahara (Kyoto Univ.) EA2020-78 SIP2020-109 SP2020-43 |
In this paper, we investigate the effectiveness of using speaker information on the performance of speaker-imbalanced au... [more] |
EA2020-78 SIP2020-109 SP2020-43 pp.109-113 |
SP, EA, SIP |
2020-03-03 14:25 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
EA2019-163 SIP2019-165 SP2019-112 |
(To be available after the conference date) [more] |
EA2019-163 SIP2019-165 SP2019-112 pp.361-366 |
MVE |
2019-08-30 09:25 |
Aichi |
(Aichi) |
Improvement on Television Advertisement Analysis by Using Additional Text Information Li Tao, Shunsuke Nakamura, Xueting Wang (UTokyo), Tatsuya Kawahara (Video Research), Toshihiko Yamasaki (UTokyo) MVE2019-15 |
[more] |
MVE2019-15 pp.55-59 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2018-12-10 13:15 |
Tokyo |
Waseda Univ. Nishiwaseda Campus (Tokyo) |
[Invited Talk]
Review of Automatic Speech Recognition Methodology
-- Outlook of Acoustic-to-Word Model -- Tatsuya Kawahara (Kyoto Univ.) SP2018-48 |
The methodology of speech recognition has been changing due to the introduction of deep learning, in particular end-to-e... [more] |
SP2018-48 pp.25-30 |
IBISML |
2018-11-05 15:10 |
Hokkaido |
Hokkaido Citizens Activites Center (Kaderu 2.7) (Hokkaido) |
IBISML2018-49 |
(To be available after the conference date) [more] |
IBISML2018-49 pp.37-44 |
MVE |
2018-09-06 15:40 |
Osaka |
(Osaka) |
Prediction of Television Advertisement Effects Using Multimodal Convolutional Neural Networks Shunsuke Nakamura (UTokyo), Tatsuya Kawahara (VR), Toshihiko Yamasaki (UTokyo) MVE2018-18 |
[more] |
MVE2018-18 pp.31-35 |
SP |
2018-08-27 11:35 |
Kyoto |
Kyoto Univ. (Kyoto) |
SP2018-23 |
(To be available after the conference date) [more] |
SP2018-23 pp.7-8 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 11:15 |
Okinawa |
(Okinawa) |
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) EA2017-107 SIP2017-116 SP2017-90 |
(To be available after the conference date) [more] |
EA2017-107 SIP2017-116 SP2017-90 pp.33-38 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
(Okinawa) |
EA2017-144 SIP2017-153 SP2017-127 |
(To be available after the conference date) [more] |
EA2017-144 SIP2017-153 SP2017-127 pp.235-240 |
MVE |
2017-09-21 15:45 |
Chiba |
Chiba Univ. (Chiba) |
Predictions of Effectiveness of Television Advertising with Convolutional Neural Networks Shunsuke Nakamura (Univ. of Tokyo), Tatsuya Kawahara (VideoResearch), Toshihiko Yamasaki (Univ. of Tokyo) MVE2017-18 |
Predicting the recognition rate of television advertising is a critical issue for advertisers, but factors that contribu... [more] |
MVE2017-18 pp.21-24 |
SP |
2017-08-30 10:00 |
Kyoto |
Kyoto Univ. (Kyoto) |
Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) SP2017-20 |
[more] |
SP2017-20 pp.1-6 |
SP |
2017-08-30 11:00 |
Kyoto |
Kyoto Univ. (Kyoto) |
[Poster Presentation]
Semi-blind speech separation and enhancement using recurrent neural network Masaya Wake, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) SP2017-22 |
This paper describes a semi-blind speech enhancement method using a neural network.
In a human-robot speech interaction... [more] |
SP2017-22 pp.13-18 |
SP |
2017-08-30 11:00 |
Kyoto |
Kyoto Univ. (Kyoto) |
[Poster Presentation]
Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) SP2017-23 |
(To be available after the conference date) [more] |
SP2017-23 pp.19-24 |
IBISML |
2016-11-17 14:00 |
Kyoto |
Kyoto Univ. (Kyoto) |
[Poster Presentation]
Kousuke Itakura, Yoshiaki Bando, Eita Nakamura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) IBISML2016-95 |
[more] |
IBISML2016-95 pp.353-359 |
SP |
2016-08-24 16:15 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
[Poster Presentation]
Kousuke Itakura, Yoshiaki Bando, Eita Nakamura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.) SP2016-31 |
[more] |
SP2016-31 pp.25-28 |
SP |
2016-08-25 10:45 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
Yoshiaki Bando, Katsutoshi Itoyama (Kyoto Univ.), Masashi Konyo, Satoshi Tadokoro (Tohoku Univ.), Kazuhiro Nakadai (Tokyo Tech/HRI-JP), Kazuyoshi Yoshii, Tatsuya Kawahara (Kyoto Univ.), Hiroshi G. Okuno (Waseda Univ.) SP2016-36 |
[more] |
SP2016-36 pp.47-52 |
SP |
2016-08-25 13:10 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
Pronunciation Error Detection using DNN Articulatory Model based on Transfer Learning Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji (Kyoto Univ.) SP2016-39 |
Aiming at detecting pronunciation errors produced by second language learners and providing corrective feedbacks related... [more] |
SP2016-39 pp.65-70 |
SP |
2016-08-25 13:35 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
Diversity-driven Semi-supervised Ensemble DNN Acoustic Model Training Sheng Li (Kyoto Univ.), Xugang Lu (NICT), Shinsuke Sakai, Tatsuya Kawahara (Kyoto Univ.) SP2016-40 |
We focus on effective training DNN (Deep Neural Network) acoustic models for Chinese spoken lectures with only limited l... [more] |
SP2016-40 pp.71-76 |
PRMU |
2015-12-22 13:10 |
Nagano |
(Nagano) |
[Special Talk]
State of the Speech Recognition Technology Tatsuya Kawahara (Kyoto Univ.) PRMU2015-111 |
Speech recognition technology has made a significant progress over the past decade and has been used in speech transcrip... [more] |
PRMU2015-111 pp.111-116 |