Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-04 13:25 |
Okinawa |
(Okinawa) |
[Poster Presentation]
A dynamic data augmentation method using diffusion models for classification of intensive care EEG Takuma Bingo, Hajime Yano, Taichiro Ashizaki, Kazuma Koda, Masaya Togo (Kobe Univ.), Riki Matsumoto (Kobe Univ./Kyoto Univ.), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.) |
[more] |
|
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2024-06-14 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Opera-singing voice synthesis from inexperienced voice considering vowel and tempo change. Aoto Sugahara (Kobe Univ.), Soma Kishimoto, Yuji Adachi, Kiyoto Tai (MEC), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.) SP2024-4 |
Singing voice synthesis technology is widely used in the entertainment field, and in the medical field, it has attracted... [more] |
SP2024-4 pp.17-22 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
MS-Harmonic-Net++ vs SiFi-GAN: Comparison of fundamental frequency controllable fast neural waveform generative models. Sota Shimizu (Kobe Univ./NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ.), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ.), Tomoki Toda (Nagoya Univ./NICT), Hisashi Kawai (NICT) SP2023-5 |
Although Harmonic-Net+ has been proposed as a fundamental frequency (fo) and speech rate (SR) controllable fast neural v... [more] |
SP2023-5 pp.20-25 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Opera-singing voice synthesis using Diff-SVC Aoto Sugahara (Kobe Univ.), Soma Kishimoto, Yuji Adachi, Kiyoto Tai (MEC Company Ltd.), Ryoichi Takashima, Testuya Takiguchi (Kobe Univ.) SP2023-7 |
Singing voice synthesis technology is widely used in the entertainment field, it has attracted attention as a method to ... [more] |
SP2023-7 pp.30-35 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Generation of colored subtitle images based on emotional information of speech utterances Fumiya Nakamura (Kobe Univ.), Ryo Aihara (Mitsubishi Electric), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.), Yusuke Itani (Mitsubishi Electric) SP2023-11 |
Conventional automatic subtitle generation systems based on speech recognition do not take into account paralinguistic i... [more] |
SP2023-11 pp.54-59 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Fast Neural Waveform Generation Model With Fully Connected Upsampling Haruki Yamashita (Kobe cniv/NICT), Takuma Okamoto (NICT), Ryoichi Takashima (Kobe Univ), Yamato Ohtani (NICT), Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) SP2023-15 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
SP2023-15 pp.73-78 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 09:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Comparison of fundamental frequency controllable fast neural waveform generative models. Sota Shimizu (Kobe Univ./NICT), Takuma Okamoto (NICT), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.), Tomoki Toda (Nagoya Univ./NICT), Hisashi Kawai (NICT) EA2022-75 SIP2022-119 SP2022-39 |
Neural vocoders, which reconstruct speech waveforms from acoustic features with deep neural networks, have significantly... [more] |
EA2022-75 SIP2022-119 SP2022-39 pp.1-6 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
MS-FC-HiFiGAN : Fast Neural Waveform Generation Model With Learnable Lightweight Upsampling Haruki Yamashita (Kobe Univ/NICT), Takuma Okamoto (NICT), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ), Tomoki Toda (Nagoya Univ/NICT), Hisashi Kawai (NICT) EA2022-76 SIP2022-120 SP2022-40 |
In recent years, in text-to-speech synthesis, it is required to improve the inference speed while keeping the quality.
... [more] |
EA2022-76 SIP2022-120 SP2022-40 pp.7-12 |
SP, EA, SIP |
2020-03-02 16:10 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
Dysarthric Speech Recognition Based on Deep Metric Learning Yuki Takashima, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) EA2019-132 SIP2019-134 SP2019-81 |
[more] |
EA2019-132 SIP2019-134 SP2019-81 pp.181-186 |
WIT, SP |
2019-10-26 16:40 |
Kagoshima |
Daiichi Institute of Technology (Kagoshima) |
Transfer Learning Using the Speech Data of Persons with Dysarthria Speaking Different Languages for Dysarthric Speech Recognition Yuki Takashima, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2019-25 WIT2019-24 |
[more] |
SP2019-25 WIT2019-24 pp.45-50 |
SP |
2019-08-28 14:40 |
Kyoto |
Kyoto Univ. (Kyoto) |
[Poster Presentation]
Improvement of generalization performance of non-task-oriented Dialogue System by use of WordNet Taisei Aso, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2019-12 |
(To be available after the conference date) [more] |
SP2019-12 pp.19-24 |
SP, IPSJ-SLP |
2012-12-21 10:15 |
Tokyo |
TITECH(Ookayama) (Tokyo) |
Interpolation of unlearned position based on local regression for single-channel talker localization using acoustic transfer function Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2012-92 |
This paper presents a sound source (talker) localization method using only a single microphone. In our previous work, we... [more] |
SP2012-92 pp.75-80 |
SP, IPSJ-SLP |
2012-12-21 15:25 |
Tokyo |
TITECH(Ookayama) (Tokyo) |
Sparse Coding-Based Voice Conversion from Lip Information Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2012-95 |
A technology to recognize speech content from lip motion is called visual speech recognition (VSR). VSRis an important c... [more] |
SP2012-95 pp.119-124 |
SP |
2011-07-23 13:00 |
Hokkaido |
Jozankei Grand Hotel (Hokkaido) |
Estimation of Head Orientation Based on Discrimination of Cross-power Spectrum Phase Coefficients Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2011-51 |
This paper presents a talker's head orientation estimation method using 2ch microphones. In recent researches, some app... [more] |
SP2011-51 pp.57-62 |
EA, SIP, SP |
2011-05-13 13:50 |
Osaka |
Ritsumeikan Univ. (Osaka) |
Estimation of Head Orientation Based on Discrimination of Acoustic Transfer Functions Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) EA2011-29 SIP2011-29 SP2011-29 |
This paper presents a talker's head orientation estimation method using only a single microphone, where phoneme HMMs (Hi... [more] |
EA2011-29 SIP2011-29 SP2011-29 pp.167-172 |
SP |
2011-01-28 10:15 |
Kyoto |
NICT (Kyoto) |
Feature Selection for Single-Channel Sound Source Localization Using the Acoustic Transfer Function Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2010-111 |
This paper presents a sound source (talker) localization method using only a single microphone. In our previous work, w... [more] |
SP2010-111 pp.49-54 |