Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2024-12-14 16:30 |
Aichi |
Nagoya Univ. (Primary: On-site, Secondary: Online) |
Research of attack performance of adversarial example based on difference of speaker embeddings against speaker verification models Yu Kitamura, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) NLC2024-26 SP2024-17 |
[more] |
NLC2024-26 SP2024-17 pp.41-46 |
WIT |
2024-10-19 13:55 |
Tochigi |
Teikyo University Utsunomiya Campus |
Development of an English Reading-aloud System to Support Japanese Children with Dyslexia Rina Katsumata (UTokyo), Ayumi Narita (mojikojuku), Daisuke Saito, Nobuaki Minematsu (UTokyo) WIT2024-6 |
Dyslexia is a learning disability that causes difficulties in reading and writing despite having no issues with intellec... [more] |
WIT2024-6 pp.7-12 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
An experimental survey on speaker embedding spaces for controlling speaker identity in speech synthesis system Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2023-93 SIP2023-140 SP2023-75 |
This study investigated the influence of the discriminability of speaker encoders on speech synthesis models that can co... [more] |
EA2023-93 SIP2023-140 SP2023-75 pp.190-195 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
A Study on Environmental Sound Synthesis in the Case of Pausing in Virtual Walking Applications Hiroshi Nishijima, Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2023-96 SIP2023-143 SP2023-78 |
In this study, we investigated the synthesis of environmental sounds for pausing states in virtual walking applications ... [more] |
EA2023-96 SIP2023-143 SP2023-78 pp.208-213 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Analysis of speech synthesis of text-free audio using a self-supervised learning model
-- focusing on multilingual applications -- Joonyong Park, Daisuke Saito, Nobuaki Minematsu (The Univ. of Tokyo) EA2023-97 SIP2023-144 SP2023-79 |
[more] |
EA2023-97 SIP2023-144 SP2023-79 pp.214-219 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
modal-to-falsetto singing voice conversion focused on the shape of glottal sound wave and parameter control of the glottal wave Shota Okada, Yu Kitamura, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) EA2023-109 SIP2023-156 SP2023-91 |
When singing, falsetto is important as a high-pitched singing and expressive technique. However, while vocal
synthesize... [more] |
EA2023-109 SIP2023-156 SP2023-91 pp.283-288 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Prediction of Voice Processing Intensity Matching the Impression of a Voice Agent Ren Miyamoto, Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Tokyo Univ.) EA2023-132 SIP2023-179 SP2023-114 |
When a voice agent such as a robot interacts with a human, it is important in terms of familiarity with the agent to sel... [more] |
EA2023-132 SIP2023-179 SP2023-114 pp.415-420 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 10:00 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) NLC2023-17 SP2023-37 |
[more] |
NLC2023-17 SP2023-37 pp.19-24 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Integration of Throat Microphone Recording and Bandwidth Extension for Robust Assessment of L2 Listening Yu Xu, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) NLC2023-20 SP2023-40 |
In an active classroom, L2 assessment is often a challenging issue, since everyone in the crowded classroom can be a noi... [more] |
NLC2023-20 SP2023-40 pp.37-42 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-03 11:05 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Self-supervised learning model based emotion transfer and intensity control technology for expressive speech synthesis Wei Li, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) NLC2023-21 SP2023-41 |
Emotion transfer techniques, which transfersba the speaking style from the reference speech to the target speech, are wi... [more] |
NLC2023-21 SP2023-41 pp.43-48 |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 16:15 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Comparative study on different speaker embedding spaces focusing on the relation to perceptual inter-speaker similarity Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2023-31 WIT2023-22 |
This study examines the correspondence between inter-speaker similarity based on speaker embeddings and perceptual speak... [more] |
SP2023-31 WIT2023-22 pp.21-26 |
PN |
2023-08-29 16:40 |
Hokkaido |
(Primary: On-site, Secondary: Online) |
Capacity Enhancement of Resilient Optical Networks with Multi-band Virtual Bypass Links Daisuke Saito, Yojiro Mori (Nagoya Univ), Kohei Hosokawa, Shigeyuki Yanagimachi (NEC), Hiroshi Hasegawa (Nagoya Univ) PN2023-26 |
We propose a cost-effective capacity enhancement for resilient networks that adopt dedicated path protection. This enhan... [more] |
PN2023-26 pp.55-58 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Choral Singing Voice Synthesis with Modulation Acoustic Features Sora Miyazawa, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-110 SIP2022-154 SP2022-74 |
In this paper, we analyzed the sense of multipule singing focused on unison and implemented it for a singing voice
synt... [more] |
EA2022-110 SIP2022-154 SP2022-74 pp.209-214 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Predominant Instrument Recognition in Polyphonic Music Based on Transfer Learning with Vanilla ResNet-50 Lifan Zhong, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-114 SIP2022-158 SP2022-78 |
Instrument recognition is an active research field in MIR (Music Information Retrieval) and has great potential for real... [more] |
EA2022-114 SIP2022-158 SP2022-78 pp.232-237 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 16:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Effects of Voice Artificiality on the Degree of Compatibility between Voice and Appearance of Voice Agents Kota Iura, Naotake Masuda, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-121 SIP2022-165 SP2022-85 |
For a spoken agent such as interactive robots, it is important to use a voice that fits the image of the agent in terms ... [more] |
EA2022-121 SIP2022-165 SP2022-85 pp.264-269 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 17:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Quantification of Voice Register Information including Mixed Voice based on Class Posterior Probabilities Yu Kitamura, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-122 SIP2022-166 SP2022-86 |
Methods to distinguish between modal and falsetto have been proposed so far,
but there are few studies analyzing mixed ... [more] |
EA2022-122 SIP2022-166 SP2022-86 pp.270-275 |
MWPTHz, PN, EMT, IEE-EMT [detail] |
2023-01-24 09:40 |
Osaka |
(Primary: On-site, Secondary: Online) |
Cost-effective Network Capacity Expansion by Supplemental Multi-band Transmission on Congested Links Daisuke Saito, Yojiro Mori (Nagoya Univ.), Kohei Hosokawa, Shigeyuki Yanagimachi (NEC), Hiroshi Hasegawa (Nagoya Univ.) PN2022-36 EMT2022-74 MWPTHz2022-62 |
A cost-effective capacity enhancement for photonic networks is proposed, which adopts multi-band transmission only on li... [more] |
PN2022-36 EMT2022-74 MWPTHz2022-62 pp.30-34 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-02 11:30 |
Online |
Online |
Multi-faceted assessment of language learners' ability of perception and production of English speech based on shadowing Takuya Kunihara, Chuanbo Zhu, Daisuke Saito, Nobuaki Minematsu (UTokyo), Noriko Nakanishi (KGU) NLC2021-19 SP2021-40 |
(To be available after the conference date) [more] |
NLC2021-19 SP2021-40 pp.7-12 |
SDM, ICD, ITE-IST [detail] |
2021-08-18 09:30 |
Online |
Online |
[Invited Talk]
Analog in-memory computing in FeFET based 1T1R array for low-power edge AI applications Daisuke Saito, Toshiyuki Kobayashi, Hiroki Koga (SONY), Yusuke Shuto, Jun Okuno, Kenta Konishi (SSS), Masanori Tsukamoto, Kazunobu Ohkuri (SONY), Taku Umebayashi (SSS), Takayuki Ezaki (SONY) SDM2021-36 ICD2021-7 |
Deep neural network (DNN) inference for edge AI requires low-power operation, which can be achieved by implementing mass... [more] |
SDM2021-36 ICD2021-7 pp.33-37 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
Investigation of DNN-based speech synthesis utilizing oral reading skills obtained from large scale subjective evaluation Shun Akui (UTokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-71 SIP2020-102 SP2020-36 |
So far, we have been suggested the value of `oral reading skill' based on a listening evaluation experiment as a quantit... [more] |
EA2020-71 SIP2020-102 SP2020-36 pp.68-73 |