SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-23
(Primary: On-site, Secondary: Online)
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information
Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] SP2023-12
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
(Primary: On-site, Secondary: Online)
Domain adaptation of speech recognition models based on self-supervised learning using target domain speech
Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yuko Wakabayashi, Norihide Kitaoka (TUT) SP2023-19
In this study, we propose a domain adaptation method using only speech data in the target domain without using transcrib... [more] SP2023-19
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
(Primary: On-site, Secondary: Online)
Automatic speech recognition model simultaneously recognizes linguistic information and verbal/non-verbal phenomena
Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) SP2023-22
Although speech recognition technology has advanced in recent years, most of them recognize only linguistic information ... [more] SP2023-22
SP, IPSJ-SLP, EA, SIP [detail] 2023-03-01
(Primary: On-site, Secondary: Online)
Construction of Language Model for Low-resource Domain Speech Recognition Based on Sentence Generation
Ryo Maejima, Daiki Mori, Youkoh Wakabayashi, Norihide Kitaoka (TUT)
SP, IPSJ-SLP, EA, SIP [detail] 2023-03-01
(Primary: On-site, Secondary: Online)
Automatic Speech Recognition model using data with verbal and non-verbal information tag
Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT)
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2022-11-29
(Primary: On-site, Secondary: Online)
Density Ratio Approach-based multiple Encoder-Decoder ASR model integration
Keigo Hojo, Daiki Mori, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) NLC2022-10 SP2022-30
One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models... [more] NLC2022-10 SP2022-30
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2022-12-01
(Primary: On-site, Secondary: Online)
ASR model adaptation to target domain with large-scale audio data without transcription
Takahiro Kinouchi, Daiki Mori (TUT), Ogawa Atsunori (NTT), Norihide Kitaoka (TUT) NLC2022-18 SP2022-38
Nowadays, speech recognition is used in various services and businesses thanks to the advent of high-performance models ... [more] NLC2022-18 SP2022-38
WIT, SP, IPSJ-SLP [detail] 2020-10-22
Online Online Early Dementia Detection based on Speech and Language Information
Maina Umezawa, Yurie Iribe (Aichi Prefectural Univ.), Norihide Kitaoka (Toyohashi Tech) SP2020-12 WIT2020-13
In recent years, research has been conducted to detect people with mild dementia from dialogue voices of the elderly. Bu... [more] SP2020-12 WIT2020-13
PRMU, SP 2018-06-29
Nagano   Mapping Acoustic Vector Sequence to Document Vector Based on RNN
Ryota Nishimura, Miho Higaki, Norihide Kitaoka (Tokushima Univ.) PRMU2018-32 SP2018-12
In this research, we propose a method of searching between different media (cross media mapping) using deep learning (Ma... [more] PRMU2018-32 SP2018-12
(Joint) [detail]
Tokyo Waseda Univ. Green Computing Systems Research Organization [Poster Presentation] Selecting Response from Conversational Spoken Dialogue System Based on Distributed Representation of User Utterances
Kengo Ohta (NIT, Anan College), Ryota Nishimura, Norihide Kitaoka (Tokushima Univ.) SP2017-55
 [more] SP2017-55
WIT, SP 2017-10-19
Fukuoka Tobata Library of Kyutech (Kitakyushu) User adaptation of examples for example-based reminiscence therapy spoken dialog system using word embedding
Eichi Seto, Ryota Nishimura, Norihide Kitaoka (Tokushima Univ.) SP2017-38 WIT2017-34
We are developing a spoken dialog system for reminiscence therapy. We propose an example-based dialog system featuring a... [more] SP2017-38 WIT2017-34
SP 2016-08-24
Kyoto ACCMS, Kyoto Univ. Adaptation Methods for Daily Activity Recognition Based on Deep Neural Network
Tomoki Hayashi (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-27
Our objective is to build a monitoring system which enables elderly people to live actively, and the key technology to a... [more] SP2016-27
SP 2015-10-16
Hyogo Kobe Univ. Multi-modal speech recognition using deep bottleneck features
Satoshi Tamura (Gifu Univ), Hiroshi Ninomiya (Nagoya Univ), Norihide Kitaoka (Tokushima Univ), Shin Osuga (Aisin Seiki), Yurie Iribe (Aichi Prefectural Univ), Kazuya Takeda (Nagoya Univ), Satoru Hayamizu (Gifu Univ) SP2015-69
In this paper, we propose a novel multi-modal speech recognition method which uses speech and lip images, employing Deep... [more] SP2015-69
SP 2015-08-21
Iwate Iwate Prefectural Univ. Evaluation of speaker engagement using turn-taking behavior entropy
Bohan Chen (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Mihoko Otake (Chiba Univ.), Kazuya Takeda (Nagoya Univ.) SP2015-52
We introduce a framework to evaluate conversational engagement using entropy of statistical turn-taking model. We demons... [more] SP2015-52
(Joint) [detail]
Kanagawa Tokyo Institute of Technology (Suzukakedai Campus) [Poster Presentation] relationship between speakers' characteristics and the information transmission quality in Dialog
Bohan Chen (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Kazuya Takeda (Nagoya Univ.) SP2014-124
We investigate the correlation between speakers’ characteristics similarity and their information transmission efficienc... [more] SP2014-124
SP, WIT, ASJ-H 2014-06-20
Ishikawa   Accurate Recognition of Overlapped Speech -- High Speed Speech Separation by Spectral Subtraction and Acoustic Model Training using Separated Speeches --
Yuto Dekiura, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo, Noboru Ohnishi, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2014-56 WIT2014-11
The purpose of this study is to recognize overlapped speech more accurately. In order to achieve this, it is necessary t... [more] SP2014-56 WIT2014-11
SP 2013-03-01
Aichi Daido University Classification of speech under stress using physical features based on two-mass model
Xiao Yao (Nagoya Univ.), Takatoshi Jitsuhiro (Aichi Univ. of Tech./Nagoya Univ.), Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2012-128
We propose the classification methods of speech under stress based on a physical model, which characterizes the vocal fo... [more] SP2012-128
SP, IPSJ-SLP 2012-12-21
Tokyo TITECH(Ookayama) Reduction of cross spectrum for feature-domain sound source separation
Atsushi Ando (Nagoya Univ.), Kenta Niwa (NTT), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2012-93
Speech source separation is utilized for recognition of simultaneous speech. Conventional source separation methods, esp... [more] SP2012-93
EA 2012-12-13
Tokyo National Institute of Informatics Reducing Computational Complexity of FDICA Source Separation Based on Source Number Evaluation
Yusuke Mizuno (Nagoya Univ.), Kazunobu Kondo (Yamaha), Takanori Nishino (Mie Univ.), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) EA2012-110
A faster computation method for performing frequency domain independent component analysis (FDICA) is proposed. Source s... [more] EA2012-110
ITS, IE, ITE-AIT, ITE-HI, ITE-ME [detail] 2012-02-21
Hokkaido Hokkaido Univ. Driving data collection using in-vehicle network and analysis of driving behavior on different types of vehicles
Hiroaki Ishikawa, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) ITS2011-50 IE2011-126
A portable driving data recording system is developed using a smartphone with the Android OS and an in-vehicle network f... [more] ITS2011-50 IE2011-126
