Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-29 10:00 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Link Prediction from Text Content by NLP Graph Embedding
-- A Study on Chinese Journal Articles -- Tzu-Ying Yang, Hsuan Lei Shao, Chih-Chuan Fan, Wei-Hsin Wang (NTNU) NLC2022-9 SP2022-29 |
Abstract This paper is an extended research of the project “The Knowledge Database/ Graph of China-studies”. The main re... [more] |
NLC2022-9 SP2022-29 pp.1-4 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-29 14:35 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Density Ratio Approach-based multiple Encoder-Decoder ASR model integration Keigo Hojo, Daiki Mori, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) NLC2022-10 SP2022-30 |
One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models... [more] |
NLC2022-10 SP2022-30 pp.5-9 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-30 09:30 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Detecting Persona Information in Chat using Subject Recovery with Machine Translation Shinji Muraji, Toshihiko Ito, Kenji Araki (Hokkaido Univ.) NLC2022-11 SP2022-31 |
The state-of-the-art systems for chatting have a major problem of not being able to predict and utilize the user’s pers... [more] |
NLC2022-11 SP2022-31 pp.10-15 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-30 10:00 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Handling time information for matching broadcast metadata with user-generated text Takeshi S. Kobayakawa (NHK), Takeshi Sakaki, Masanao Ochi, Sakata Ichiro (Univ. Tokyo) NLC2022-12 SP2022-32 |
[more] |
NLC2022-12 SP2022-32 pp.16-20 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-30 10:30 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Dialogue disfluency detection using context Hiroto Nakashima, Kazutaka Shimada (KIT) NLC2022-13 SP2022-33 |
Recent automatic speech recognition (ASR) techniques have been improved by a large amount of training data and machine l... [more] |
NLC2022-13 SP2022-33 pp.21-26 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-30 15:30 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Semi-supervised joint training of text to speech and automatic speech recognition using unpaired text data Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura (NTT) NLC2022-14 SP2022-34 |
This paper presents a novel joint training of text to speech (TTS) and automatic speech recognition (ASR) with small amo... [more] |
NLC2022-14 SP2022-34 pp.27-32 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-11-30 16:00 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Representing how it is said with what is said
-- Creation and analysis of an English corpus of focused speech and text reflecting paralinguistically expressed implications -- Naoaki Suzuki, Satoshi Nakamura (NAIST) NLC2022-15 SP2022-35 |
In speech communication, people convey intentions through what is said (linguistic information) and how it is said (para... [more] |
NLC2022-15 SP2022-35 pp.33-38 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 14:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
A Japanese Automatic Speech Recognition System on the Next-Gen Kaldi Framework Wen Shen Teo, Yasuhiro Minami (UEC) NLC2022-16 SP2022-36 |
2021 saw the introduction of the cutting-edge successor to the Kaldi speech processing toolkit, known as Next-Gen Kaldi.... [more] |
NLC2022-16 SP2022-36 pp.39-44 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 15:20 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Domain and language adaptation of large-scale pretrained model for speech recognition of low-resource language Kak Soky (Kyoto University), Sheng Li (NICT), Chenhui Chu, Tatsuya Kawahara (Kyoto University) NLC2022-17 SP2022-37 |
The self-supervised learning (SSL) models are effective for automatic speech recognition (ASR). Due to the huge paramete... [more] |
NLC2022-17 SP2022-37 pp.45-49 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 15:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
ASR model adaptation to target domain with large-scale audio data without transcription Takahiro Kinouchi, Daiki Mori (TUT), Ogawa Atsunori (NTT), Norihide Kitaoka (TUT) NLC2022-18 SP2022-38 |
Nowadays, speech recognition is used in various services and businesses thanks to the advent of high-performance models ... [more] |
NLC2022-18 SP2022-38 pp.50-53 |