Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-03 10:45 |
Okinawa |
(Okinawa) |
Measurement of time delay tolerance for third-person game live audio commentary Ryosuke Matsushita, Ryosuke Sakai, Koki Fukuda (Keio Univ.), Shinnosuke Takamichi (Keio Univ./UTokyo), Kota Iura, Yuki Saito (UTokyo), Graham Neubig (CMU), Katsuhito Sudoh (NWU), Hiroya Takamura, Tatsuya Ishigaki (AIST) |
[more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-03 14:08 |
Okinawa |
(Okinawa) |
[No paper] Construction of subjective evaluation dataset for automatic evaluation of input-output relevance in text-to-audio Yusuke Kanamori, Yuki Okamoto, Taisei Takano (UTokyo), Shinnosuke Takamichi (Keio Univ./UTokyo), Yuki Saito, Hiroshi Saruwatari (UTokyo) |
[more] |
|
EA |
2024-05-22 14:55 |
Online |
Online (Online) |
Environmental sound synthesis and creation of dataset using vocal imitations Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) EA2024-5 |
One way to represent the characteristics of environmental sounds is to imitate the environmental sounds by human voice c... [more] |
EA2024-5 p.22 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 10:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Noise-Robust Voice Conversion by Denoising Training Conditioned with Latent Variables of Speech Quality and Recording Environment Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) EA2023-63 SIP2023-110 SP2023-45 |
In this paper, we propose noise-robust voice conversion by conditioning latent variables representing speech quality and... [more] |
EA2023-63 SIP2023-110 SP2023-45 pp.13-18 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:35 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
SRC4VC: Smartphone-Recorded Corpus for Benchmarking Multi-Speaker Voice Conversion Models Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi (UT), Ryuichi Yamamoto, Kentaro Tachibana (LY), Hiroshi Saruwatari (UT) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:40 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Preliminary Evaluation of Japanese Speech Corpus J-SpAW for Speaker Verification and Spoofing Detection Kota Kanno (Tokyo Metropolitan Univ.), Shinnosuke Takamichi (UTokyo), Sayaka Shiota (Tokyo Metropolitan Univ.) |
[more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 16:05 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Evaluating speech generation based on objective measures for text generation Takaaki Saeki (UTokyo), Soumi Maiti (CMU), Shinnosuke Takamichi (UTokyo), Shinji Watanabe (CMU), Hiroshi Saruwatari (UTokyo) EA2023-133 SIP2023-180 SP2023-115 |
In the evaluation of speech generation, while subjective judgments have long been the gold standard, objective metrics s... [more] |
EA2023-133 SIP2023-180 SP2023-115 pp.421-426 |
NLC, IPSJ-NL |
2023-03-18 16:40 |
Okinawa |
OIST (Okinawa, Online) (Primary: On-site, Secondary: Online) |
Collection of Textual Expressions in the Wild Toward Voice-quality Control from Free Description Aya Watanabe, Shinnosuke Takamichi, Yuki Saito, Hiroshi Saruwatari (UTokyo) NLC2022-29 |
[more] |
NLC2022-29 pp.55-60 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 16:15 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images Hien Ohnaka (NITTC), Shinnosuke Takamichi (UT), Keisuke Imoto (DU), Yuki Okamoto (Rits), Kazuki Fujii, Hiroshi Saruwatari (UT) EA2022-90 SIP2022-134 SP2022-54 |
(To be available after the conference date) [more] |
EA2022-90 SIP2022-134 SP2022-54 pp.83-88 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 11:00 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Representation and Prediction of Accent Phrase Prosodic Features in Japanese Text-to-Speech Masaki Sato, Shinnosuke Takamichi, Hiroshi Saruwatari (The Univ. of Tokyo) EA2022-108 SIP2022-152 SP2022-72 |
In order to use speech synthesis in a variety of situations such as dialogue systems and emotional expression in audiobo... [more] |
EA2022-108 SIP2022-152 SP2022-72 pp.197-202 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 14:50 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Corpus construction toward multi-domain empathetic dialogue speech synthesis Yuki Saito, Eiji Iimori, Shinnosuke Takamichi (UT), Kentaro Tachibana (LINE), Hiroshi Saruwatari (UT) |
(To be available after the conference date) [more] |
|
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 10:45 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64 |
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] |
EA2021-79 SIP2021-106 SP2021-64 pp.96-101 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 12:00 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Evaluating the robustness of signal processing-based pseudonymization using parameter optimization against inversion attack. Hiroto Kai (Tokyo Metro. Univ.), Shinnosuke Takamichi (The Univ. of Tokyo), Sayaka Shiota, Hitoshi Kiya (Tokyo Metro. Univ.) EA2021-82 SIP2021-109 SP2021-67 |
[more] |
EA2021-82 SIP2021-109 SP2021-67 pp.114-119 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online (Online) |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online (Online) |
[Poster Presentation]
End-to-end incremental TTS with lookahead generation with large pretrained language model Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2020-74 SIP2020-105 SP2020-39 |
(To be available after the conference date) [more] |
EA2020-74 SIP2020-105 SP2020-39 pp.85-90 |
SP, IPSJ-MUS, IPSJ-SLP |
2020-06-07 15:45 |
Online |
Online (Online) |
HumanGAN: generative adversarial network with human-based discriminator and its naturalness evaluation in synthesized voice Kazuki Fujii (NITTC), Yuki Saito, Shinnosuke Takamichi (UTokyo), Yukino Baba (UTsukuba), Hiroshi Saruwatari (UTokyo) SP2020-6 |
[more] |
SP2020-6 pp.15-20 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP |
2019-06-13 15:25 |
Kanagawa |
Tokyo Institute of Technology (Kanagawa) |
[Invited Talk]
Constructing voice corpus for next-generation speech research Shinnosuke Takamichi (UTokyo) SP2019-5 |
Thanks to developments of machine learning techniques including deep learning, solving more diverse issues is required i... [more] |
SP2019-5 p.25 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-21 13:30 |
Ishikawa |
Hotel Koshuen (Ishikawa) |
Evaluation of DNN-based Low-Musical-Noise Speech Enhancement Using Kurtosis Matching Satoshi Mizoguchi, Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2018-66 EMM2018-66 |
This paper proposes DNN-based speech enhancement with low musical noise by kurtosis matching. Musical noise, artifacts g... [more] |
EA2018-66 EMM2018-66 pp.19-24 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-19 10:00 |
Okinawa |
(Okinawa) |
Experimental Evaluation of Multichannel Audio Source Separation Based on IDLMA Daichi Kitamura, Hayato Sumino, Norihiro Takamune, Shinnosuke Takamichi, Hiroshi Saruwatari (Univ. of Tokyo), Nobutaka Ono (Tokyo Metropolitan Univ.) EA2017-104 SIP2017-113 SP2017-87 |
In this paper, we propose a new informed multichannel audio source separation called independent deeply learned matrix a... [more] |
EA2017-104 SIP2017-113 SP2017-87 pp.13-20 |