Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2024-12-14 16:30 |
Aichi |
Nagoya Univ. (Aichi, Online) (Primary: On-site, Secondary: Online) |
Research of attack performance of adversarial example based on difference of speaker embeddings against speaker verification models Yu Kitamura, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) NLC2024-26 SP2024-17 |
[more] |
NLC2024-26 SP2024-17 pp.41-46 |
WIT |
2024-10-19 13:55 |
Tochigi |
Teikyo University Utsunomiya Campus (Tochigi) |
Development of an English Reading-aloud System to Support Japanese Children with Dyslexia Rina Katsumata (UTokyo), Ayumi Narita (mojikojuku), Daisuke Saito, Nobuaki Minematsu (UTokyo) WIT2024-6 |
Dyslexia is a learning disability that causes difficulties in reading and writing despite having no issues with intellec... [more] |
WIT2024-6 pp.7-12 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
An experimental survey on speaker embedding spaces for controlling speaker identity in speech synthesis system Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2023-93 SIP2023-140 SP2023-75 |
This study investigated the influence of the discriminability of speaker encoders on speech synthesis models that can co... [more] |
EA2023-93 SIP2023-140 SP2023-75 pp.190-195 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
A Study on Environmental Sound Synthesis in the Case of Pausing in Virtual Walking Applications Hiroshi Nishijima, Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2023-96 SIP2023-143 SP2023-78 |
In this study, we investigated the synthesis of environmental sounds for pausing states in virtual walking applications ... [more] |
EA2023-96 SIP2023-143 SP2023-78 pp.208-213 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Analysis of speech synthesis of text-free audio using a self-supervised learning model
-- focusing on multilingual applications -- Joonyong Park, Daisuke Saito, Nobuaki Minematsu (The Univ. of Tokyo) EA2023-97 SIP2023-144 SP2023-79 |
[more] |
EA2023-97 SIP2023-144 SP2023-79 pp.214-219 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
modal-to-falsetto singing voice conversion focused on the shape of glottal sound wave and parameter control of the glottal wave Shota Okada, Yu Kitamura, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) EA2023-109 SIP2023-156 SP2023-91 |
When singing, falsetto is important as a high-pitched singing and expressive technique. However, while vocal
synthesize... [more] |
EA2023-109 SIP2023-156 SP2023-91 pp.283-288 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 15:45 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Prediction of Voice Processing Intensity Matching the Impression of a Voice Agent Ren Miyamoto, Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) EA2023-132 SIP2023-179 SP2023-114 |
When a voice agent such as a robot interacts with a human, it is important in terms of familiarity with the agent to sel... [more] |
EA2023-132 SIP2023-179 SP2023-114 pp.415-420 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-02 16:00 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
Development and effects of English speech training drills to improve perception and production skills seamlessly with interactive gamification Nobuaki Minematsu, Yingxiang Gao (UTokyo), Noriko Nakanishi (KGU), Yusuke Inoue, Hiroaki Mizuno (Carriage) NLC2023-15 SP2023-35 |
To improve aural/oral proficiency in English, various skills have to be acquired such as 1) spoken word perception, 2) m... [more] |
NLC2023-15 SP2023-35 pp.7-12 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 10:00 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) NLC2023-17 SP2023-37 |
[more] |
NLC2023-17 SP2023-37 pp.19-24 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Enhancing Multi-Accent Automated Speech Recognition with Accent-Activated Adapters Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) NLC2023-18 SP2023-38 |
This paper proposes the Accent-Activated adapter (AccentAct) approach to address the challenge of speech variations in m... [more] |
NLC2023-18 SP2023-38 pp.25-30 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Enhancing Dysarthric Speech Recognition with Auxiliary Feature Fusion Module: Exploring Articulatory-related Features from Foundation Models Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) NLC2023-19 SP2023-39 |
Addressing dysarthric speech variability in Automatic Speech Recognition (ASR) is crucial for improving human-computer i... [more] |
NLC2023-19 SP2023-39 pp.31-36 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Integration of Throat Microphone Recording and Bandwidth Extension for Robust Assessment of L2 Listening Yu Xu, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) NLC2023-20 SP2023-40 |
In an active classroom, L2 assessment is often a challenging issue, since everyone in the crowded classroom can be a noi... [more] |
NLC2023-20 SP2023-40 pp.37-42 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Self-supervised learning model based emotion transfer and intensity control technology for expressive speech synthesis Wei Li, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) NLC2023-21 SP2023-41 |
Emotion transfer techniques, which transfersba the speaking style from the reference speech to the target speech, are wi... [more] |
NLC2023-21 SP2023-41 pp.43-48 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 16:15 |
Fukuoka |
Kyushu Institute of Technology (Fukuoka, Online) (Primary: On-site, Secondary: Online) |
Comparative study on different speaker embedding spaces focusing on the relation to perceptual inter-speaker similarity Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2023-31 WIT2023-22 |
This study examines the correspondence between inter-speaker similarity based on speaker embeddings and perceptual speak... [more] |
SP2023-31 WIT2023-22 pp.21-26 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Choral Singing Voice Synthesis with Modulation Acoustic Features Sora Miyazawa, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-110 SIP2022-154 SP2022-74 |
In this paper, we analyzed the sense of multipule singing focused on unison and implemented it for a singing voice
synt... [more] |
EA2022-110 SIP2022-154 SP2022-74 pp.209-214 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Predominant Instrument Recognition in Polyphonic Music Based on Transfer Learning with Vanilla ResNet-50 Lifan Zhong, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-114 SIP2022-158 SP2022-78 |
Instrument recognition is an active research field in MIR (Music Information Retrieval) and has great potential for real... [more] |
EA2022-114 SIP2022-158 SP2022-78 pp.232-237 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 16:50 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Effects of Voice Artificiality on the Degree of Compatibility between Voice and Appearance of Voice Agents Kota Iura, Naotake Masuda, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-121 SIP2022-165 SP2022-85 |
For a spoken agent such as interactive robots, it is important to use a voice that fits the image of the agent in terms ... [more] |
EA2022-121 SIP2022-165 SP2022-85 pp.264-269 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 17:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Quantification of Voice Register Information including Mixed Voice based on Class Posterior Probabilities Yu Kitamura, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-122 SIP2022-166 SP2022-86 |
Methods to distinguish between modal and falsetto have been proposed so far,
but there are few studies analyzing mixed ... [more] |
EA2022-122 SIP2022-166 SP2022-86 pp.270-275 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-02 11:30 |
Online |
Online (Online) |
Multi-faceted assessment of language learners' ability of perception and production of English speech based on shadowing Takuya Kunihara, Chuanbo Zhu, Daisuke Saito, Nobuaki Minematsu (UTokyo), Noriko Nakanishi (KGU) NLC2021-19 SP2021-40 |
(To be available after the conference date) [more] |
NLC2021-19 SP2021-40 pp.7-12 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online (Online) |
[Poster Presentation]
Investigation of DNN-based speech synthesis utilizing oral reading skills obtained from large scale subjective evaluation Shun Akui (UTokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-71 SIP2020-102 SP2020-36 |
So far, we have been suggested the value of `oral reading skill' based on a listening evaluation experiment as a quantit... [more] |
EA2020-71 SIP2020-102 SP2020-36 pp.68-73 |