Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-03 11:40 |
Okinawa |
(Okinawa) |
Traffic Volume and Speed Estimation Using Pre-trained Audio Model Tomohiro Takahashi (TMU), Natsuki Ueno (TMU/Kumamoto Univ.), Yuma Kinoshita (Tokai Univ.), Yukoh Wakabayashi (TUT), Nobutaka Ono (TMU), Makiho Sukekawa, Seishi Fukuma, Hiroshi Nakagawa (NEE) |
[more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-04 11:05 |
Okinawa |
(Okinawa) |
[Poster Presentation]
Construction of a ASR model based on self-supervised learning using intermediate layer outputs Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) |
[more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-04 11:05 |
Okinawa |
(Okinawa) |
[Poster Presentation]
Improvement and Evaluation of Utterance End Time Estimation Method for Spoken Dialog Systems Takanori Kanai, Yukoh Wakabayashi (TUT), Ryota Nishimura (Tokushima Univ.), Norihide Kitaoka (TUT) |
(To be available after the conference date) [more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-04 11:05 |
Okinawa |
(Okinawa) |
[Poster Presentation]
Improvement of Speech Recognition Performance for Elderly Speech by Alternating Learning of Acoustic and Linguistic information Kaito Takahashi, Yukoh Wakabayashi (TUT), Kengo Ohta (NIT, Anan College), Norihide Kitaoka (TUT) |
(To be available after the conference date) [more] |
|
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2024-06-15 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Improving CTC-based ASR model by weighting encoder layers using attention mechanisms Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) SP2024-9 |
[more] |
SP2024-9 pp.43-48 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Analysis of Overlapped Utterances in Everyday Conversation and Source Separation by Online Independent Vector Analysis for Asynchronous Distributed Recordings Haruki Nammoku, Taishi Nakashima, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono (TMU) EA2023-67 SIP2023-114 SP2023-49 |
In this study, we investigate the effects of overlapped utterances on transcription in everyday conversation and propose... [more] |
EA2023-67 SIP2023-114 SP2023-49 pp.37-42 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Domain adaptation of speech recognition model based on multilingual SSL model with only nonparallel corpus. Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yukoh Wakabayashi (TUT), Kengo Ohta (NITA), Norihide Kitaoka (TUT) EA2023-100 SIP2023-147 SP2023-82 |
Automatic speech recognition (ASR) models are used in various services and businesses, and each domain’s recognition acc... [more] |
EA2023-100 SIP2023-147 SP2023-82 pp.232-237 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Improving speech recognition system consisting of multiple speech recognition models Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-101 SIP2023-148 SP2023-83 |
[more] |
EA2023-101 SIP2023-148 SP2023-83 pp.238-243 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation. Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84 |
Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more] |
EA2023-102 SIP2023-149 SP2023-84 pp.244-249 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Intermediate speaker speech synthesis between two speakers using x-vector speaker space Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka (TUT) EA2023-103 SIP2023-150 SP2023-85 |
Recent advancements in speech synthesis technologies have enabled the synthesis of speeches of speakers not in the train... [more] |
EA2023-103 SIP2023-150 SP2023-85 pp.250-255 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88 |
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] |
EA2023-106 SIP2023-153 SP2023-88 pp.268-273 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12 |
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] |
SP2023-12 pp.60-64 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Automatic speech recognition model simultaneously recognizes linguistic information and verbal/non-verbal phenomena Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) SP2023-22 |
Although speech recognition technology has advanced in recent years, most of them recognize only linguistic information ... [more] |
SP2023-22 pp.109-113 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 15:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Automatic Speech Recognition model using data with verbal and non-verbal information tag Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) |
[more] |
|
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-29 14:35 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Density Ratio Approach-based multiple Encoder-Decoder ASR model integration Keigo Hojo, Daiki Mori, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) NLC2022-10 SP2022-30 |
One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models... [more] |
NLC2022-10 SP2022-30 pp.5-9 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online (Online) |
Source Separation for Asynchronous Recordings of Conversation Using Time-Frequency Masking and Independent Vector Analysis Haruki Nammoku, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono (TMU) SP2021-22 |
In this study, we investigate the source separation for conversational speech recorded by multiple voice recorders that ... [more] |
SP2021-22 pp.101-106 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) (Nagasaki) |
[Poster Presentation]
Study of acoustic scene analysis using sound-to-light conversion devices "blinky'' Yuto Oishi, Jin-cheng Zhang, Yutaka Yamamoto, Fumikazu Saze, Hiroyuki Moriyama, Robin Scheibler, Yukoh Wakabayashi, Nobutaka Ono (TMU) EA2018-142 SIP2018-148 SP2018-104 |
This paper presents a preliminary study of acoustic scene analysis with sound-to-light conversion devices ``Blinky'', wh... [more] |
EA2018-142 SIP2018-148 SP2018-104 pp.251-256 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-22 10:00 |
Ishikawa |
Hotel Koshuen (Ishikawa) |
[Invited Talk]
Phase reconstruction for speech enhancement and its effect on array processing Yukoh Wakabayashi (TMU) EA2018-80 EMM2018-80 |
Phase spectrum processing for speech enhancement, so called ``phase reconstruction,'' has been particularly received att... [more] |
EA2018-80 EMM2018-80 pp.163-168 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
(Okinawa) |
[Poster Presentation]
Performance evaluation of unknown sound clustering for indoor-environmental sound classification based on self-generated acoustic model Sakiko Mishima, Yukoh Wakabayashi, Takahiro Fukumori, Keisuke Imoto, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-152 SIP2017-161 SP2017-135 |
Indoor-environmental sound classification is useful for surveillance systems which monitor the situations in the dark an... [more] |
EA2017-152 SIP2017-161 SP2017-135 pp.277-280 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
(Okinawa) |
[Poster Presentation]
Phase-aware spectral gain estimation using phase reconstruction based on phase distortion averaging Yukoh Wakabayashi, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-158 SIP2017-167 SP2017-141 |
[more] |
EA2017-158 SIP2017-167 SP2017-141 pp.305-310 |