Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
WIT, SP, IPSJ-SLP [detail] |
2023-10-14 16:15 |
Fukuoka |
Kyushu Institute of Technology (Primary: On-site, Secondary: Online) |
Comparative study on different speaker embedding spaces focusing on the relation to perceptual inter-speaker similarity Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) |
(To be available after the conference date) [more] |
|
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Choral Singing Voice Synthesis with Modulation Acoustic Features Sora Miyazawa, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-110 SIP2022-154 SP2022-74 |
In this paper, we analyzed the sense of multipule singing focused on unison and implemented it for a singing voice
synt... [more] |
EA2022-110 SIP2022-154 SP2022-74 pp.209-214 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Predominant Instrument Recognition in Polyphonic Music Based on Transfer Learning with Vanilla ResNet-50 Lifan Zhong, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-114 SIP2022-158 SP2022-78 |
Instrument recognition is an active research field in MIR (Music Information Retrieval) and has great potential for real... [more] |
EA2022-114 SIP2022-158 SP2022-78 pp.232-237 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 16:50 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Effects of Voice Artificiality on the Degree of Compatibility between Voice and Appearance of Voice Agents Kota Iura, Naotake Masuda, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-121 SIP2022-165 SP2022-85 |
For a spoken agent such as interactive robots, it is important to use a voice that fits the image of the agent in terms ... [more] |
EA2022-121 SIP2022-165 SP2022-85 pp.264-269 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 17:10 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Quantification of Voice Register Information including Mixed Voice based on Class Posterior Probabilities Yu Kitamura, Anan Kikuchi, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2022-122 SIP2022-166 SP2022-86 |
Methods to distinguish between modal and falsetto have been proposed so far,
but there are few studies analyzing mixed ... [more] |
EA2022-122 SIP2022-166 SP2022-86 pp.270-275 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-02 11:30 |
Online |
Online |
Multi-faceted assessment of language learners' ability of perception and production of English speech based on shadowing Takuya Kunihara, Chuanbo Zhu, Daisuke Saito, Nobuaki Minematsu (UTokyo), Noriko Nakanishi (KGU) NLC2021-19 SP2021-40 |
(To be available after the conference date) [more] |
NLC2021-19 SP2021-40 pp.7-12 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
Investigation of DNN-based speech synthesis utilizing oral reading skills obtained from large scale subjective evaluation Shun Akui (UTokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-71 SIP2020-102 SP2020-36 |
So far, we have been suggested the value of `oral reading skill' based on a listening evaluation experiment as a quantit... [more] |
EA2020-71 SIP2020-102 SP2020-36 pp.68-73 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-04 10:15 |
Online |
Online |
A quantitative measure of discriminability between NMF dictionaries Eisuke Konno, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-82 SIP2020-113 SP2020-47 |
Supervised nonnegative matrix factorization (NMF) is a popular approach for monaural audio source separation. It realize... [more] |
EA2020-82 SIP2020-113 SP2020-47 pp.134-139 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
Implementation of a high-accuracy method for automatic fluency scoring of spontaneous English utterances by Japanese learners Ayano Yasukagawa, Shintaro Ando, Eisuke Konno, Zhenchao Lin, Yusuke Inoue, Daisuke Saito, Nobuaki Minematsu (UTokyo), Kazuya Saito (UCL) EA2019-134 SIP2019-136 SP2019-83 |
These days, many teachers claim importance of not native-likeness-based but intelligibility-based assessment of pronunci... [more] |
EA2019-134 SIP2019-136 SP2019-83 pp.189-194 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
Initial analysis of oral reading skills obtained from large scale subjective evaluation Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2019-135 SIP2019-137 SP2019-84 |
Speech of professional newscasters easily suggest us his/her occupation, that is newscaster. So far, we have analyzed pr... [more] |
EA2019-135 SIP2019-137 SP2019-84 pp.195-200 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
Automatic estimation of prosodic control made in English utterances using DNN-based acoustic models trained with prosodic features and labels Yang Shen, Shintarou Ando, Nobuaki Minematsu, Daisuke Saito (UTokyo), Satoshi Kobashikawa (NTT) EA2019-136 SIP2019-138 SP2019-85 |
This paper investigate how to utilize DNN acoustic models trained with prosodic features and labels to detect prosodic e... [more] |
EA2019-136 SIP2019-138 SP2019-85 pp.201-206 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Poster Presentation]
An Educational Study on Prosodic Symbols and Their Acoustic Realization Using Japanese End-to-end Speech Synthesis Fuki Yoshizawa (UTokyo), Tadashi Kumano (NHK), Nobuaki Minematsu (UTokyo), Kiyoshi Kurihara (NHK) EA2019-137 SIP2019-139 SP2019-86 |
In order to examine the educational effect of presenting prosodic symbols to learners of Japanese, a method was proposed... [more] |
EA2019-137 SIP2019-139 SP2019-86 pp.207-212 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. |
[Poster Presentation]
Analysis of prosodic differences between a newscaster and amateur speakers using partial-substituted synthetic speech Takuya Ozuru (Univ. of Tokyo), Yusuke Ijima (NTT), Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2019-11 |
This paper analyzes prosodic differences between a professional newscaster and amateur speakers which affects listeners’... [more] |
SP2019-11 pp.13-18 |
SP |
2019-06-13 14:20 |
Kanagawa |
Tokyo Institute of Technology |
A large collection of sentences read aloud by Vietnamese learners of Japanese and native speakers' reverse shadowings Shintaro Ando, Tasavat Trisitichoke, Yusuke Inoue, Fuki Yoshizawa, Daisuke Saito, Nobuaki Minematsu (UTokyo) SP2019-3 |
The main objective of language learning is to acquire good communication skills in the target language.
From that viewp... [more] |
SP2019-3 pp.13-17 |
SP |
2019-06-13 14:45 |
Kanagawa |
Tokyo Institute of Technology |
Evaluation of Comprehensibility of L2 Speech Based on Native Listeners’ Reverse Shadowing and Their Facial Expressions Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu (UTokyo) SP2019-4 |
Recently, researchers' attention has been paid to pronunciation assessment not based on comparison between L2 utterances... [more] |
SP2019-4 pp.19-24 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
An experimental study of influence of classroom babble noise on automatic assessment of learners' shadowing speech Suguru Kabashima, Daisuke Saito, Nobuaki Minematsu (UTokyo), Yutaka Yamauchi (Soka Univ.), Kayoko Ito (Koyasan Univ.) EA2018-118 SIP2018-124 SP2018-80 |
(To be available after the conference date) [more] |
EA2018-118 SIP2018-124 SP2018-80 pp.113-118 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 13:00 |
Okinawa |
|
[Poster Presentation]
Quantitative and corpus-based analysis of pronunciation diversity observed in Japanese English Suguru Kabashima, Haoyu Zhang, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo), Satoshi Kobashikawa, Ryo Masumura (NTT) EA2017-113 SIP2017-122 SP2017-96 |
In foreign language teaching, corrective feedback to learners' pronunciation is regarded
as highly important and automa... [more] |
EA2017-113 SIP2017-122 SP2017-96 pp.69-74 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 13:00 |
Okinawa |
|
[Poster Presentation]
An Experimental Study on Segmental and Prosodic Comparison of Utterances for Automatic Assessment of Dubbing Speech Takuya Ozuru, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) EA2017-114 SIP2017-123 SP2017-97 |
In Japanese language education, especially in its speech training, dubbing-based training has gained a
huge popularity.... [more] |
EA2017-114 SIP2017-123 SP2017-97 pp.75-80 |
EA, SP, SIP |
2016-03-29 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
Amplitude limiters based on phase optimization Akira Kakitani, Daisuke Saito, Yasuhiro Kosugi, Nobuaki Minematsu (UTokyo) EA2015-111 SIP2015-160 SP2015-139 |
In order to reduce the peak value of source waveforms without quality degradation, a novel method is proposed. In this m... [more] |
EA2015-111 SIP2015-160 SP2015-139 pp.249-254 |
EA, SP, SIP |
2016-03-29 09:00 |
Oita |
Beppu International Convention Center B-ConPlaza |
[Poster Presentation]
An experimental study of designing context labels for infant-directed storytelling speech synthesis Kyota Hyakutake, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2015-112 SIP2015-161 SP2015-140 |
Context labels for infant-directed storytelling speech synthesis are investigated. After collecting one-hour storytellin... [more] |
EA2015-112 SIP2015-161 SP2015-140 pp.255-260 |