IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (Searched in: Recent 10 Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 24  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
EA, SIP, SP, IPSJ-SLP [detail] 2025-03-04
11:05
Okinawa (Okinawa) [Poster Presentation] Construction of a ASR model based on self-supervised learning using intermediate layer outputs
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT)
 [more]
EA, SIP, SP, IPSJ-SLP [detail] 2025-03-04
11:05
Okinawa (Okinawa) [Poster Presentation] Improvement and Evaluation of Utterance End Time Estimation Method for Spoken Dialog Systems
Takanori Kanai, Yukoh Wakabayashi (TUT), Ryota Nishimura (Tokushima Univ.), Norihide Kitaoka (TUT)
(To be available after the conference date) [more]
EA, SIP, SP, IPSJ-SLP [detail] 2025-03-04
11:05
Okinawa (Okinawa) [Poster Presentation] Improvement of Speech Recognition Performance for Elderly Speech by Alternating Learning of Acoustic and Linguistic information
Kaito Takahashi, Yukoh Wakabayashi (TUT), Kengo Ohta (NIT, Anan College), Norihide Kitaoka (TUT)
(To be available after the conference date) [more]
SP, IPSJ-MUS, IPSJ-SLP [detail] 2024-06-15
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
[Poster Presentation] Improving CTC-based ASR model by weighting encoder layers using attention mechanisms
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) SP2024-9
 [more] SP2024-9
pp.43-48
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Constructing and Evaluating a Batch Voice Input System for Electronic Medical Records Using Large Language Models
Ryo Maejima, Norihide Kitaoka (TUT) EA2023-99 SIP2023-146 SP2023-81
This study aims to develop an electronic medical record with a voice input interface that lets users input several items... [more] EA2023-99 SIP2023-146 SP2023-81
pp.226-231
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Domain adaptation of speech recognition model based on multilingual SSL model with only nonparallel corpus.
Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yukoh Wakabayashi (TUT), Kengo Ohta (NITA), Norihide Kitaoka (TUT) EA2023-100 SIP2023-147 SP2023-82
Automatic speech recognition (ASR) models are used in various services and businesses, and each domain’s recognition acc... [more] EA2023-100 SIP2023-147 SP2023-82
pp.232-237
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Improving speech recognition system consisting of multiple speech recognition models
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-101 SIP2023-148 SP2023-83
 [more] EA2023-101 SIP2023-148 SP2023-83
pp.238-243
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation.
Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84
Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more] EA2023-102 SIP2023-149 SP2023-84
pp.244-249
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
10:40
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Intermediate speaker speech synthesis between two speakers using x-vector speaker space
Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka (TUT) EA2023-103 SIP2023-150 SP2023-85
Recent advancements in speech synthesis technologies have enabled the synthesis of speeches of speakers not in the train... [more] EA2023-103 SIP2023-150 SP2023-85
pp.250-255
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
10:40
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models
Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] EA2023-106 SIP2023-153 SP2023-88
pp.268-273
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-23
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Streaming End-to-End speech recognition using a CTC decoder with substituted linguistic information
Tatsunari Takagi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka, Yukoh Wakabayashi (TUT) SP2023-12
Speech recognition technology has been employed in various fields due to the enhancement of speech recognition model acc... [more] SP2023-12
pp.60-64
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Domain adaptation of speech recognition models based on self-supervised learning using target domain speech
Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yuko Wakabayashi, Norihide Kitaoka (TUT) SP2023-19
In this study, we propose a domain adaptation method using only speech data in the target domain without using transcrib... [more] SP2023-19
pp.91-96
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Automatic speech recognition model simultaneously recognizes linguistic information and verbal/non-verbal phenomena
Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT) SP2023-22
Although speech recognition technology has advanced in recent years, most of them recognize only linguistic information ... [more] SP2023-22
pp.109-113
SP, IPSJ-SLP, EA, SIP [detail] 2023-03-01
15:05
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Construction of Language Model for Low-resource Domain Speech Recognition Based on Sentence Generation
Ryo Maejima, Daiki Mori, Youkoh Wakabayashi, Norihide Kitaoka (TUT)
 [more]
SP, IPSJ-SLP, EA, SIP [detail] 2023-03-01
15:10
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Automatic Speech Recognition model using data with verbal and non-verbal information tag
Nagito Shione, Yukoh Wakabayashi, Norihide Kitaoka (TUT)
 [more]
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2022-11-29
14:35
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Density Ratio Approach-based multiple Encoder-Decoder ASR model integration
Keigo Hojo, Daiki Mori, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) NLC2022-10 SP2022-30
One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models... [more] NLC2022-10 SP2022-30
pp.5-9
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2022-12-01
15:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
ASR model adaptation to target domain with large-scale audio data without transcription
Takahiro Kinouchi, Daiki Mori (TUT), Ogawa Atsunori (NTT), Norihide Kitaoka (TUT) NLC2022-18 SP2022-38
Nowadays, speech recognition is used in various services and businesses thanks to the advent of high-performance models ... [more] NLC2022-18 SP2022-38
pp.50-53
WIT, SP, IPSJ-SLP [detail] 2020-10-22
14:10
Online Online (Online) Early Dementia Detection based on Speech and Language Information
Maina Umezawa, Yurie Iribe (Aichi Prefectural Univ.), Norihide Kitaoka (Toyohashi Tech) SP2020-12 WIT2020-13
In recent years, research has been conducted to detect people with mild dementia from dialogue voices of the elderly. Bu... [more] SP2020-12 WIT2020-13
pp.21-26
PRMU, SP 2018-06-29
11:30
Nagano (Nagano) Mapping Acoustic Vector Sequence to Document Vector Based on RNN
Ryota Nishimura, Miho Higaki, Norihide Kitaoka (Tokushima Univ.) PRMU2018-32 SP2018-12
In this research, we propose a method of searching between different media (cross media mapping) using deep learning (Ma... [more] PRMU2018-32 SP2018-12
pp.59-64
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2017-12-21
12:50
Tokyo Waseda Univ. Green Computing Systems Research Organization (Tokyo) [Poster Presentation] Selecting Response from Conversational Spoken Dialogue System Based on Distributed Representation of User Utterances
Kengo Ohta (NIT, Anan College), Ryota Nishimura, Norihide Kitaoka (Tokushima Univ.) SP2017-55
 [more] SP2017-55
pp.1-5
 Results 1 - 20 of 24  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan