Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12 |
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] |
SP2023-12 pp.60-64 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Domain adaptation of speech recognition models based on self-supervised learning using target domain speech Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yuko Wakabayashi, Norihide Kitaoka (TUT) SP2023-19 |
In this study, we propose a domain adaptation method using only speech data in the target domain without using transcrib... [more] |
SP2023-19 pp.91-96 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Automatic speech recognition model simultaneously recognizes linguistic information and verbal/non-verbal phenomena Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) SP2023-22 |
Although speech recognition technology has advanced in recent years, most of them recognize only linguistic information ... [more] |
SP2023-22 pp.109-113 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 15:05 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Construction of Language Model for Low-resource Domain Speech Recognition Based on Sentence Generation Ryo Maejima, Daiki Mori, Youkoh Wakabayashi, Norihide Kitaoka (TUT) |
[more] |
|
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 15:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Automatic Speech Recognition model using data with verbal and non-verbal information tag Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) |
[more] |
|
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-29 14:35 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Density Ratio Approach-based multiple Encoder-Decoder ASR model integration Keigo Hojo, Daiki Mori, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) NLC2022-10 SP2022-30 |
One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models... [more] |
NLC2022-10 SP2022-30 pp.5-9 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 15:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
ASR model adaptation to target domain with large-scale audio data without transcription Takahiro Kinouchi, Daiki Mori (TUT), Ogawa Atsunori (NTT), Norihide Kitaoka (TUT) NLC2022-18 SP2022-38 |
Nowadays, speech recognition is used in various services and businesses thanks to the advent of high-performance models ... [more] |
NLC2022-18 SP2022-38 pp.50-53 |
WIT, SP, IPSJ-SLP [detail] |
2020-10-22 14:10 |
Online |
Online |
Early Dementia Detection based on Speech and Language Information Maina Umezawa, Yurie Iribe (Aichi Prefectural Univ.), Norihide Kitaoka (Toyohashi Tech) SP2020-12 WIT2020-13 |
In recent years, research has been conducted to detect people with mild dementia from dialogue voices of the elderly. Bu... [more] |
SP2020-12 WIT2020-13 pp.21-26 |
PRMU, SP |
2018-06-29 11:30 |
Nagano |
|
Mapping Acoustic Vector Sequence to Document Vector Based on RNN Ryota Nishimura, Miho Higaki, Norihide Kitaoka (Tokushima Univ.) PRMU2018-32 SP2018-12 |
In this research, we propose a method of searching between different media (cross media mapping) using deep learning (Ma... [more] |
PRMU2018-32 SP2018-12 pp.59-64 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Selecting Response from Conversational Spoken Dialogue System Based on Distributed Representation of User Utterances Kengo Ohta (NIT, Anan College), Ryota Nishimura, Norihide Kitaoka (Tokushima Univ.) SP2017-55 |
[more] |
SP2017-55 pp.1-5 |
WIT, SP |
2017-10-19 14:20 |
Fukuoka |
Tobata Library of Kyutech (Kitakyushu) |
User adaptation of examples for example-based reminiscence therapy spoken dialog system using word embedding Eichi Seto, Ryota Nishimura, Norihide Kitaoka (Tokushima Univ.) SP2017-38 WIT2017-34 |
We are developing a spoken dialog system for reminiscence therapy. We propose an example-based dialog system featuring a... [more] |
SP2017-38 WIT2017-34 pp.23-28 |
SP |
2016-08-24 13:00 |
Kyoto |
ACCMS, Kyoto Univ. |
Adaptation Methods for Daily Activity Recognition Based on Deep Neural Network Tomoki Hayashi (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Tomoki Toda, Kazuya Takeda (Nagoya Univ.) SP2016-27 |
Our objective is to build a monitoring system which enables elderly people to live actively, and the key technology to a... [more] |
SP2016-27 pp.1-6 |
SP |
2015-10-16 11:15 |
Hyogo |
Kobe Univ. |
Multi-modal speech recognition using deep bottleneck features Satoshi Tamura (Gifu Univ), Hiroshi Ninomiya (Nagoya Univ), Norihide Kitaoka (Tokushima Univ), Shin Osuga (Aisin Seiki), Yurie Iribe (Aichi Prefectural Univ), Kazuya Takeda (Nagoya Univ), Satoru Hayamizu (Gifu Univ) SP2015-69 |
In this paper, we propose a novel multi-modal speech recognition method which uses speech and lip images, employing Deep... [more] |
SP2015-69 pp.57-62 |
SP |
2015-08-21 10:50 |
Iwate |
Iwate Prefectural Univ. |
Evaluation of speaker engagement using turn-taking behavior entropy Bohan Chen (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Mihoko Otake (Chiba Univ.), Kazuya Takeda (Nagoya Univ.) SP2015-52 |
We introduce a framework to evaluate conversational engagement using entropy of statistical turn-taking model. We demons... [more] |
SP2015-52 pp.13-17 |
NLC, IPSJ-NL, SP, IPSJ-SLP, JSAI-SLUD (Joint) [detail] |
2014-12-16 13:30 |
Kanagawa |
Tokyo Institute of Technology (Suzukakedai Campus) |
[Poster Presentation]
relationship between speakers' characteristics and the information transmission quality in Dialog Bohan Chen (Nagoya Univ.), Norihide Kitaoka (Tokushima Univ.), Kazuya Takeda (Nagoya Univ.) SP2014-124 |
We investigate the correlation between speakers’ characteristics similarity and their information transmission efficienc... [more] |
SP2014-124 pp.147-152 |
SP, WIT, ASJ-H |
2014-06-20 10:25 |
Ishikawa |
|
Accurate Recognition of Overlapped Speech
-- High Speed Speech Separation by Spectral Subtraction and Acoustic Model Training using Separated Speeches -- Yuto Dekiura, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo, Noboru Ohnishi, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2014-56 WIT2014-11 |
The purpose of this study is to recognize overlapped speech more accurately. In order to achieve this, it is necessary t... [more] |
SP2014-56 WIT2014-11 pp.57-62 |
SP |
2013-03-01 11:30 |
Aichi |
Daido University |
Classification of speech under stress using physical features based on two-mass model Xiao Yao (Nagoya Univ.), Takatoshi Jitsuhiro (Aichi Univ. of Tech./Nagoya Univ.), Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2012-128 |
We propose the classification methods of speech under stress based on a physical model, which characterizes the vocal fo... [more] |
SP2012-128 pp.47-52 |
SP, IPSJ-SLP |
2012-12-21 14:40 |
Tokyo |
TITECH(Ookayama) |
Reduction of cross spectrum for feature-domain sound source separation Atsushi Ando (Nagoya Univ.), Kenta Niwa (NTT), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) SP2012-93 |
Speech source separation is utilized for recognition of simultaneous speech. Conventional source separation methods, esp... [more] |
SP2012-93 pp.107-112 |
EA |
2012-12-13 14:10 |
Tokyo |
National Institute of Informatics |
Reducing Computational Complexity of FDICA Source Separation Based on Source Number Evaluation Yusuke Mizuno (Nagoya Univ.), Kazunobu Kondo (Yamaha), Takanori Nishino (Mie Univ.), Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) EA2012-110 |
A faster computation method for performing frequency domain independent component analysis (FDICA) is proposed. Source s... [more] |
EA2012-110 pp.5-10 |
ITS, IE, ITE-AIT, ITE-HI, ITE-ME [detail] |
2012-02-21 09:40 |
Hokkaido |
Hokkaido Univ. |
Driving data collection using in-vehicle network and analysis of driving behavior on different types of vehicles Hiroaki Ishikawa, Chiyomi Miyajima, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) ITS2011-50 IE2011-126 |
A portable driving data recording system is developed using a smartphone with the Android OS and an in-vehicle network f... [more] |
ITS2011-50 IE2011-126 pp.257-262 |