IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (Searched in: Recent 10 Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 135  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
EA, SIP, SP, IPSJ-SLP [detail] 2025-03-02
13:00
Okinawa (Okinawa) Zero-Shot Speech Synthesis Directly Referring Target Speech Through Attention Mechanisms
Kyohei Nakatsuka, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.)
(To be available after the conference date) [more]
EA, SIP, SP, IPSJ-SLP [detail] 2025-03-04
13:25
Okinawa (Okinawa) [Poster Presentation] Speech Synthesis from Electrocorticogram During Imagined Speech Using a Transformer-Based Decoder
Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendou Univ.), Koichi Shinoda (Science Tokyo), Kohei Yatabe, Toshihisa Tanaka (TUAT)
(To be available after the conference date) [more]
EMM 2025-01-21
16:15
Miyagi Tohoku Univ. (Miyagi) Study on signal modification method for protecting voice personae against speech synthesis system
Nopparut Li, Candy Olivia Mawalim, Masashi Unoki (JAIST) EMM2024-109
This paper explores the audibility and effectiveness of various signal modification methods in protecting voice personae... [more] EMM2024-109
pp.37-42
HCGSYMPO
(2nd)
2024-12-11
- 2024-12-13
Ishikawa The Kanazawa Theatre (Ishikawa) Recognition of Auditory Distance Across Speakers and Languages: Is Physical "Sense of Distance" Universal?
Haruka Murakami (Tamagawa Univ.)
(To be available after the conference date) [more]
CS
(2nd)
2024-11-07
14:05
Osaka I-site Namba, Osaka Metropolitan Univ. (Osaka) [Invited Talk]
Takuma Okamoto (NICT)
NICT has successfully developed a 21-language, fast and high-fidelity neural text-to-speech technology. The development ... [more]
WIT 2024-10-19
14:20
Tochigi Teikyo University Utsunomiya Campus (Tochigi) Development of an SSL (Social Skills Learning) Metaverse for Safe and Secure Interactions -- Implementation of AI avatars in the non-attendance support metaverse --
Mariko Oda, Ryosuke Yasaka, Taiga Haruta, Hiroshi Kono (Kurume IT) WIT2024-7
In recent years, school refusal has become a serious problem, and the number of truant students in primary and secondary... [more] WIT2024-7
pp.13-16
EA, ASJ-H, ASJ-MA, ASJ-SP 2024-07-06
14:50
Hokkaido (Hokkaido) A technique of generating source signals for speech synthesis taking account of fluctuation phenomena
Naofumi Aoki (Hokkaido Univ.) EA2024-13
This study has investigated a technique that reproduces waveform fluctuations for synthesized speech by limiting Q of it... [more] EA2024-13
pp.13-18
SP, IPSJ-MUS, IPSJ-SLP [detail] 2024-06-15
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
[Poster Presentation] A voice synthesizer operated by fingers to control its vocal-tract area function.
Amane Koriki, Masashi Ito (Tohtech) SP2024-7
Several studies have been made on real-time speech synthesizers in which users’ arm or hand motions are
immediately co... [more]
SP2024-7
pp.33-36
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
An experimental survey on speaker embedding spaces for controlling speaker identity in speech synthesis system
Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) EA2023-93 SIP2023-140 SP2023-75
This study investigated the influence of the discriminability of speaker encoders on speech synthesis models that can co... [more] EA2023-93 SIP2023-140 SP2023-75
pp.190-195
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
09:30
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Multi-Dialect Speech Synthesis with Interpretable Accent latent Variable based on VQ-VAE
Kazuki Yamauchi, Yuki Saito, Hiroshi Saruwatari (UTokyo) EA2023-98 SIP2023-145 SP2023-80
In this paper, we address two tasks: "Intra-dialect Text-to-Speech (TTS)," aiming to synthesize speech in the same diale... [more] EA2023-98 SIP2023-145 SP2023-80
pp.220-225
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
10:40
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Intermediate speaker speech synthesis between two speakers using x-vector speaker space
Sota Hosoi, Takahiro Kinouchi, Yukoh Wakabayashi, Norihide Kitaoka (TUT) EA2023-103 SIP2023-150 SP2023-85
Recent advancements in speech synthesis technologies have enabled the synthesis of speeches of speakers not in the train... [more] EA2023-103 SIP2023-150 SP2023-85
pp.250-255
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
10:40
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
An Investigation on the Speech Recovery from EEG Signals Using Transformer
Tomoaki Mizuno (The Univ. of Electro-Communications), Takuya Kishida (Aichi Shukutoku Univ.), Natsue Yoshimura (Tokyo Tech), Toru Nakashika (The Univ. of Electro-Communications) EA2023-108 SIP2023-155 SP2023-90
Synthesizing full speech from ElectroEncephaloGraphy(EEG) signals is a challenging task. In this paper, speech reconstru... [more] EA2023-108 SIP2023-155 SP2023-90
pp.277-282
SIP, SP, EA, IPSJ-SLP [detail] 2024-03-01
16:35
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
Discrimination of rotation direction of virtual sound source in binaural synthesis using sound source radiation characteristics
Orie Nishiyama (Chiba Institute of Technology), Toshiharu Horiuchi, Shota Okubo (KDDI Research, Inc.), Yoshifumi Chisaki (Chiba Institute of Technology) EA2023-125 SIP2023-172 SP2023-107
In order to provide the sensation of being there, research has been conducted on realistic communication that acquires, ... [more] EA2023-125 SIP2023-172 SP2023-107
pp.376-381
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] 2023-12-03
10:00
Tokyo Kikai-Shinko-Kaikan Bldg. (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism
Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) NLC2023-17 SP2023-37
 [more] NLC2023-17 SP2023-37
pp.19-24
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] 2023-12-03
11:05
Tokyo Kikai-Shinko-Kaikan Bldg. (Tokyo, Online)
(Primary: On-site, Secondary: Online)
[Poster Presentation] Self-supervised learning model based emotion transfer and intensity control technology for expressive speech synthesis
Wei Li, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) NLC2023-21 SP2023-41
Emotion transfer techniques, which transfersba the speaking style from the reference speech to the target speech, are wi... [more] NLC2023-21 SP2023-41
pp.43-48
PRMU, IPSJ-CVIM, IPSJ-DCC, IPSJ-CGVI 2023-11-17
09:20
Tottori (Tottori, Online)
(Primary: On-site, Secondary: Online)
Co-speech Gesture Generation with Variational Auto Encoder
Shihichi Ka, Koichi Shinoda (Tokyo Tech) PRMU2023-29
Co-speech gesture generation is the study of generating gestures from speech. In prior works, deterministic methods lear... [more] PRMU2023-29
pp.74-79
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-23
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
[Poster Presentation] MS-Harmonic-Net++ vs SiFi-GAN: Comparison of fundamental frequency controllable fast neural waveform generative models.
Sota Shimizu (Kobe Univ./NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ.), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ.), Tomoki Toda (Nagoya Univ./NICT), Hisashi Kawai (NICT) SP2023-5
Although Harmonic-Net+ has been proposed as a fundamental frequency (fo) and speech rate (SR) controllable fast neural v... [more] SP2023-5
pp.20-25
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Fast Neural Waveform Generation Model With Fully Connected Upsampling
Haruki Yamashita (Kobe cniv/NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-15
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more]
SP2023-15
pp.73-78
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Effect of pause length ratio in speech length on the perception of speech rate induced by speech length
Maho Tamakawa, Shuichi Sakamoto (Tohoku Univ.) SP2023-23
The goal of this study is to investigate the mechanism of the perception of speech rate. In this preliminary study, we i... [more] SP2023-23
pp.114-118
SP, IPSJ-MUS, IPSJ-SLP [detail] 2023-06-24
13:50
Tokyo (Tokyo, Online)
(Primary: On-site, Secondary: Online)
Evaluation of multi-speaker text-to-speech synthesis using a corpus for speech recognition with x-vectors for various speech styles
Koki Hida (Wakayama Univ/NICT), Takuma Okamoto (NICT), Ryuichi Nisimura (Wakayama Univ), Yamato Ohtani (NICT), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-25
We have implemented multi-speaker end-to-end text-to-speech synthesis based on JETS using x-vectors as speaker embedding... [more] SP2023-25
pp.125-130
 Results 1 - 20 of 135  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan