Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
PRMU, IPSJ-CVIM, IPSJ-CGVI, IPSJ-DCC |
2024-11-30 11:00 |
Fukui |
(Fukui, Online) (Primary: On-site, Secondary: Online) |
[Invited Talk]
Deep image synthesis with physical and geometrical constraints Takuhiro Kaneko (NTT) PRMU2024-26 |
Since the third AI boom began in the 2000s, deep learning has been making tremendous progress in various fields. One of ... [more] |
PRMU2024-26 p.99 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 09:30 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
SELECTING N-LOWEST SCORES FOR TRAINING MOS PREDICTION MODELS Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT) EA2023-94 SIP2023-141 SP2023-76 |
Automatic speech quality assessment (SQA) is a task to evaluate the quality of speech samples without resorting to time-... [more] |
EA2023-94 SIP2023-141 SP2023-76 pp.196-201 |
SP |
2019-08-28 13:30 |
Kyoto |
Kyoto Univ. (Kyoto) |
WaveCycleGAN2: Neural Waveform Post-Filter For High-Quality Speech Generation Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo (NTT) SP2019-9 |
[more] |
SP2019-9 pp.1-6 |
SP |
2019-08-28 13:55 |
Kyoto |
Kyoto Univ. (Kyoto) |
Sequence-to-Sequence Voice Conversion Using Context Preservation Mechanism Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo (NTT) SP2019-10 |
[more] |
SP2019-10 pp.7-12 |
PRMU, SP |
2017-06-22 14:45 |
Miyagi |
(Miyagi) |
Postfiltering of STFT Spectrograms Based on Generative Adversarial Networks Takuhiro Kaneko (NTT), Shinji Takaki (NII), Hirokazu Kameoka (NTT), Junichi Yamagishi (NII) PRMU2017-28 SP2017-4 |
This paper presents postfiltering of short-term Fourier transform (STFT) spectrograms based on Generative Adversarial Ne... [more] |
PRMU2017-28 SP2017-4 pp.17-22 |
SP, SIP, EA |
2017-03-02 12:45 |
Okinawa |
Okinawa Industry Support Center (Okinawa) |
Non-native speech conversion with consistency-aware recursive network and generative adversarial network Keisuke Oyamada (Univ. of Tsukuba), Hirokazu Kameoka, Takuhiro Kaneko (NTT), Hiroyasu Ando (Univ. of Tsukuba), Kaoru Hiramatsu, Kunio Kashino (NTT) EA2016-139 SIP2016-194 SP2016-134 |
This paper deals with the problem of automatically modifying the pronunciation of non-native speech.
Since the pronunci... [more] |
EA2016-139 SIP2016-194 SP2016-134 pp.315-320 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 16:40 |
Tokyo |
NTT Musashino R&D (Tokyo) |
Generative Adversarial Network-based Postfiltering for Statistical Parametric Speech Synthesis Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino (NTT) SP2016-61 |
In the field of speech synthesis, statistical parametric speech synthesis has been widely used due to the flexibility an... [more] |
SP2016-61 pp.89-94 |
PRMU, IBISML, IPSJ-CVIM (Joint) [detail] |
2012-09-03 11:00 |
Tokyo |
(Tokyo) |
Multiple-person activity recognition with group relationships Shigeyuki Odashima, Masamichi Shimosaka, Takuhiro Kaneko, Rui Fukui, Tomomasa Sato (The Univ. of Tokyo) PRMU2012-44 IBISML2012-27 |
In this paper, we propose an activity localization method with contextual information of multiple person relationships.
... [more] |
PRMU2012-44 IBISML2012-27 pp.127-132 |
PRMU, IBISML, IPSJ-CVIM (Joint) [detail] |
2012-09-03 15:30 |
Tokyo |
(Tokyo) |
Group Activity Recognition by CRFs with Multiscale Dependency Takuhiro Kaneko, Masamichi Shimosaka, Shigeyuki Odashima, Rui Fukui, Tomomasa Sato (The Univ. of Tokyo) PRMU2012-49 IBISML2012-32 |
Group activity recognition has gained attention in computer vision community. Examples of group activities are queueing ... [more] |
PRMU2012-49 IBISML2012-32 pp.185-190 |