Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
HCS |
2024-03-03 09:20 |
Shizuoka |
Tokoha University(Shizuoka-Kusanagi Campus) |
Environment Recognition System for Visually Impaired People Using LIDAR Mounted on User's Head
-- Improvements of Distance Measurement/Acoustic Signal Conversion and Experimental Evaluation -- Liqi Zhu, Tetsuo Tsujioka, Shigeyoshi Nakajima, Hitoshi Watanabe, Ikuo Oka (Osaka Metropolitan Univ.) HCS2023-103 |
LIDAR technology is often used to detect surrounding environment around users. For visually impaired people, a system th... [more] |
HCS2023-103 pp.86-91 |
AI |
2024-03-01 15:20 |
Aichi |
Room0221, Bldg.2-C, Nagoya Institute of Technology |
On Using Existing Facial Expression Recognition Model for Student Behavior Tracking Yuna Kaneko, Masato Kikuchi, Tadachika Ozono (NIT) AI2023-43 |
It is challenging for teachers to understand students' reactions during online lectures. Estimating student behaviors by... [more] |
AI2023-43 pp.37-40 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multi-task learning with age information model for highly accurate elderly speech recognition. Shine Takumi, Kinouchi Takahiro, Wakabayashi Yukoh, Kitaoka Norihide (TUT) EA2023-64 SIP2023-111 SP2023-46 |
The speech recognition of the elderly is less accurate, especially in smart speaker speech recognition, due to aging-rel... [more] |
EA2023-64 SIP2023-111 SP2023-46 pp.19-24 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
|
We have developed automatic speech recognition and dialect identification techniques by using COJADS, a corpus of Japane... [more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Constructing and Evaluating a Batch Voice Input System for Electronic Medical Records Using Large Language Models Ryo Maejima, Norihide Kitaoka (TUT) EA2023-99 SIP2023-146 SP2023-81 |
This study aims to develop an electronic medical record with a voice input interface that lets users input several items... [more] |
EA2023-99 SIP2023-146 SP2023-81 pp.226-231 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Domain adaptation of speech recognition model based on multilingual SSL model with only nonparallel corpus. Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yukoh Wakabayashi (TUT), Kengo Ohta (NITA), Norihide Kitaoka (TUT) EA2023-100 SIP2023-147 SP2023-82 |
Automatic speech recognition (ASR) models are used in various services and businesses, and each domain’s recognition acc... [more] |
EA2023-100 SIP2023-147 SP2023-82 pp.232-237 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluation of Automatic Speech Recognition for Deaf and Hard-of-Hearing People by Speaker Adaptation. Kaito Takahashi, Takahiro Kinouchi, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Akio Kobayashi (Yamato Univ.), Norihide Kitaoka (TUT) EA2023-102 SIP2023-149 SP2023-84 |
Communication between normal-hearing people and the deaf is generally used sign language, written communication, and spe... [more] |
EA2023-102 SIP2023-149 SP2023-84 pp.244-249 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An Investigation into Weighting Strategies for Model Averaging in Continual Learning for Automatic Speech Recognition Kentaro Shinayama, Hiroshi Sato, Tomoharu Iwata, Takeshi Mori, Taichi Asami (NTT) EA2023-105 SIP2023-152 SP2023-87 |
In recent years, the application scope of speech recognition AI has expanded, enabling the acquisition of diverse data d... [more] |
EA2023-105 SIP2023-152 SP2023-87 pp.262-267 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88 |
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] |
EA2023-106 SIP2023-153 SP2023-88 pp.268-273 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Evaluations of Multi-channel Blind Source Separation for Speech Recognition in Car Environments Yutsuki Takeuchi, Natsuki Ueno, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Tomoki Tanemura (MIRISE Technologies) EA2023-127 SIP2023-174 SP2023-109 |
In car environments, speech recognition is difficult due to various types of noise. For this issue, speech enhancement b... [more] |
EA2023-127 SIP2023-174 SP2023-109 pp.388-393 |
NS, IN (Joint) |
2024-02-29 10:10 |
Okinawa |
Okinawa Convention Center |
Multimodal Object Recognition Method Using Bayesian Attractor Model For 3D Point Clouds and RGB Images Haruhito Ando, Daichi Kominami, Ryoga Seki, Masayuki Murata, Hideyuki Shimonishi (Osaka Univ.) NS2023-192 |
Beyond 5G/6G, technology is driving the development of digital twins. In recent years, the amount of information that ca... [more] |
NS2023-192 pp.119-124 |
HIP, ITE-HI, VRPSY, ASJ-H [detail] |
2024-02-23 16:00 |
Okinawa |
|
A Piano-Fingering Recognition Method for Overlapped Fingers based on Machine Learning of Sequential Hand Images Takuto Yagawa, Ryo Miyoshi, Shuichi Akizuki, Manabu Hashimoto (Chukyo Univ.) HIP2023-111 |
The purpose of this study is to propose a piano-fingering recognition method for educating beginners and analyzing exper... [more] |
HIP2023-111 pp.90-94 |
PRMU, MVE, VRSJ-SIG-MR, IPSJ-CVIM |
2024-01-25 10:03 |
Kanagawa |
Keio Univ. (Hiyoshi Campus) |
Estimation of 3D Coordinates of Fingertips using Contrastive Embeddings from Hand Images Tatsuya Abe, Takeshi Umezawa, Noritaka Osawa (Chiba Univ.) PRMU2023-40 |
This study evaluated a method for estimating the 3D coordinates of fingertips from hand images when manipulating objects... [more] |
PRMU2023-40 pp.7-12 |
PRMU, MVE, VRSJ-SIG-MR, IPSJ-CVIM |
2024-01-26 10:30 |
Kanagawa |
Keio Univ. (Hiyoshi Campus) |
Body Parts Motion Guided Deformable Attention for Action Recognition Yuji Sato, Umihiro Kamoto, Takeo Ueta (Panasonic Connect), Yasunori Ishii (Panasonic Holdings), Takayoshi Yamashita (Chubu Univ.) PRMU2023-44 |
In recent years, video transformers for action recognition have been
proposed and achieve high performance. Most of the... [more] |
PRMU2023-44 pp.26-31 |
PRMU, MVE, VRSJ-SIG-MR, IPSJ-CVIM |
2024-01-26 15:34 |
Kanagawa |
Keio Univ. (Hiyoshi Campus) |
Comparison of Imbalanced Data Handling Techniques in Emotion Estimation of Expressway Service Area Workers using Stacking Ensemble Learners for Complex Decision Boundaries Akihiro Sato, Satoki Ogiso, Ryosuke Ichikari, Takeshi Kurata (AIST) PRMU2023-47 |
Estimating emotions of workers is promising to promote health and productivity management, while it has difficulty in c... [more] |
PRMU2023-47 pp.40-45 |
ICTSSL, CAS |
2024-01-25 11:45 |
Kanagawa |
(Primary: On-site, Secondary: Online) |
Comparison of transfer learning and fine tuning Ohata Shunsuke, Okazaki Hideaki (SIT) CAS2023-88 ICTSSL2023-41 |
This report examines the principal image recognition methods. First, we show the experimental results of image recogniti... [more] |
CAS2023-88 ICTSSL2023-41 pp.31-33 |
SIP, IT, RCS |
2024-01-19 14:55 |
Miyagi |
(Primary: On-site, Secondary: Online) |
Dataset Generation System for Hand Gesture Recognition Using FMCW-MIMO Radar Katsuhisa Kashiwagi (Murata Manufacturing/Yokohama National Univ.), Koichi Ichige (Yokohama National Univ.) IT2023-72 SIP2023-105 RCS2023-247 |
In this paper, we propose a dataset generation system for hand gesture recognition using Frequency Modulated Continuous ... [more] |
IT2023-72 SIP2023-105 RCS2023-247 pp.229-234 |
SeMI |
2024-01-18 15:35 |
Yamanashi |
Raki House Kaiji |
[Poster Presentation]
Towards Activity Recognition Using Wi-Fi CSI from Backscatter Tags in Indoor Environment Kazuki Miyao, Erdelyi Viktor, Akira Uchiyama (Osaka Univ.), Tomoki Murakami (NTT) SeMI2023-60 |
Recently, activity recognition using Wi-Fi CSI has received significant attention due to its low deployment cost, such a... [more] |
SeMI2023-60 pp.56-57 |
EMM |
2024-01-17 11:20 |
Miyagi |
Tohoku Univ. (Primary: On-site, Secondary: Online) |
A study on 3D model adaptation for generating patch-based adversarial perturbations Hiroto Takiwaki (Okayama Univ.), Minoru Kuribayashi (Touhoku Univ.), Nobuo Funabiki (Okayama Univ.) EMM2023-88 |
The development of facial recognition technology using machine learning has made it possible to recognize individuals fr... [more] |
EMM2023-88 pp.44-49 |
ET |
2023-12-16 13:15 |
Okayama |
Tsushima Campus,Okayama University |
Recognition of student facial expressions and movements during class
-- Effects of recognition th rough generated images and frontalization frontalization -- Kazuo Ohzeki, Aki Saito Shiojiri, Koichi Kamijo, Masami Suzuki (IPUT) ET2023-39 |
Using cameras and electroencephalograms, we are conducting research to identify teacher actions to improve student conce... [more] |
ET2023-39 pp.19-23 |