Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Modeling learners’ pronunciation variations and its application to automatic phoneme error detection Zhang Haoyu, Saito Daisuke, Minematsu Nobuaki (UTokyo), Kobashikawa Satoshi, Masumura Ryo (NTT) EA2018-119 SIP2018-125 SP2018-81 |
[more] |
EA2018-119 SIP2018-125 SP2018-81 pp.119-124 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Initial analysis of emotional speech acted in noise Yi Zhao (NII), Atsushi Ando (NTT), Shinji Takaki, Junichi Yamagishi (NII), Satoshi Kobashikawa (NTT) EA2018-120 SIP2018-126 SP2018-82 |
Speakers usually adjust their way of talking in noisy environments involuntarily for effective communication, this adapt... [more] |
EA2018-120 SIP2018-126 SP2018-82 pp.125-130 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
CWT spectral loss for training a DNN-based speech waveform model Shinji Takaki (NII), Hirokazu Kameoka (NTT), Junichi Yamagishi (NII) EA2018-121 SIP2018-127 SP2018-83 |
[more] |
EA2018-121 SIP2018-127 SP2018-83 pp.131-135 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
A robust algorithm of phase recovery for speech enhancement Dongxiao Wang, Koichi Shinoda (TokyoTech), Hirokazu Kameoka (NTT) EA2018-122 SIP2018-128 SP2018-84 |
[more] |
EA2018-122 SIP2018-128 SP2018-84 pp.137-142 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Adaptive beamformer for desired source extraction with neural network based direction of arrival estimation Yu Nakagome (Waseda Univ.), masahito togami (LINE) EA2018-123 SIP2018-129 SP2018-85 |
[more] |
EA2018-123 SIP2018-129 SP2018-85 pp.143-147 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
MVDR beamformer based on time-frequency-bin-wise switching technique for underdetermined speech enhancement Kouei Yamaoka (Univ. of Tsukuba), Nobutaka Ono (Tokyo Metropolitan Univ.), Shoji Makino, Takeshi Yamada (Univ. of Tsukuba) EA2018-124 SIP2018-130 SP2018-86 |
In this paper, we present an underdetermined speech enhancement method called the time-frequency-bin-wise switching beam... [more] |
EA2018-124 SIP2018-130 SP2018-86 pp.149-154 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Diffuse noise reduction using adversarial denoising autoencoder Hikari Tanabe, Naohiro Tawara, Tetsunori Kobayashi (Waseda Univ.), Masaru Fujieda, Katagiri Kazuhiro, Takashi Yazu (OKI), Tetsuji Ogawa (Waseda Univ.) EA2018-125 SIP2018-131 SP2018-87 |
In this study, we attempted to remove diffuse noise by a model combining a prefilter and an adversarial denoising autoen... [more] |
EA2018-125 SIP2018-131 SP2018-87 pp.155-160 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Use and evaluation of Tacotron and context features in rakugo speech synthesis Shuhei Kato (SOKENDAI/NII), Shinji Takaki, Junichi Yamagishi (NII), Yusuke Yasuda (SOKENDAI/NII), Xin Wang (NII) EA2018-126 SIP2018-132 SP2018-88 |
We have been working on constructing rakugo (a traditional Japanese verbal entertainment) speech synthesis toward speech... [more] |
EA2018-126 SIP2018-132 SP2018-88 pp.161-166 |
EA, SIP, SP |
2019-03-14 15:15 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Convergence-guaranteed independent positive semidefinite tensor analysis for blind source separation Kanta Fukushige, Norihiro Takamune (UTokyo), Daichi Kitamura (Kagawa-NICT), Hiroshi Saruwatari (UTokyo), Rintaro Ikeshita, Tomohiro Nakatani (NTT) EA2018-127 SIP2018-133 SP2018-89 |
This paper focuses on independent positive semidefinite tensor analysis (IPSDTA), which is a technique for over-determin... [more] |
EA2018-127 SIP2018-133 SP2018-89 pp.167-172 |
EA, SIP, SP |
2019-03-14 15:40 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Estimation of rank-constrained spatial covariance model based on multivariate complex Student's t distribution for blind source separation Yuki Kubo, Norihiro Takamune (UTokyo), Daichi Kitamura (Kagawa NCIT), Hiroshi Saruwatari (UTokyo) EA2018-128 SIP2018-134 SP2018-90 |
In this paper, we generalize a generative model in estimation of rank-constrained spatial covariance model that separate... [more] |
EA2018-128 SIP2018-134 SP2018-90 pp.173-178 |
EA, SIP, SP |
2019-03-14 16:05 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
A Study on Speech Synthesis Based on Deep Gaussain Processes and Latent Variable Representation of Accent Tomoki Koriyama, Takao Kobayashi (Tokyo Tech) EA2018-129 SIP2018-135 SP2018-91 |
[more] |
EA2018-129 SIP2018-135 SP2018-91 pp.179-184 |
EA, SIP, SP |
2019-03-14 16:40 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Invited Talk]
Encouragement of Participation in Competition
-- Looking Back on Hitachi's Participation in DCASE 2018 Challenge -- Yohei Kawaguchi, Ryo Tanabe, Takashi Endo, Yuki Nikaido, Kenji Ichige, Phong Nguyen, Koichi Hamada (Hitachi) |
Hitachi participated in Task 5 of DCASE 2018 Challenge, which is a flagship competition of acoustic scene classification... [more] |
|
EA, SIP, SP |
2019-03-15 10:00 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Consideration on Effectiveness of Relative Phase from Residual Speech for Speaker Recognition Seiichi Nakagawa, Kazumasa Yamamoto, Kazumasa Yamamoto (Chubu Univ.) EA2018-130 SIP2018-136 SP2018-92 |
We have focused on phase spectrum for speaker recognition. So we proposed relative phase as a feature parameter for spea... [more] |
EA2018-130 SIP2018-136 SP2018-92 pp.185-190 |
EA, SIP, SP |
2019-03-15 10:25 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Neural Language Models based on Conditional Hierarchical Recurrent Encoder-Decoder for Multi-Party Conversational Speech Recognition Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Takanobu Oba, Yushi Aono (NTT) EA2018-131 SIP2018-137 SP2018-93 |
This paper presents fully neural network based language models (LMs) that can leverage long-range conversational context... [more] |
EA2018-131 SIP2018-137 SP2018-93 pp.191-196 |
EA, SIP, SP |
2019-03-15 10:50 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
Likability Estimation Model Training of Call-center Agents Based on Annotators' Skills Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono (NTT) EA2018-132 SIP2018-138 SP2018-94 |
This paper proposes a new technique for estimating the likability of call-center agents.
Most techniques of likability ... [more] |
EA2018-132 SIP2018-138 SP2018-94 pp.197-202 |
EA, SIP, SP |
2019-03-15 11:25 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Invited Talk]
Realization of real-time blind source separation with auxiliary-function-based algorithms Nobutaka Ono (TMU) EA2018-133 SIP2018-139 SP2018-95 |
Blind source separation is a signal processing technique to estimate sound source signals only from the observation of m... [more] |
EA2018-133 SIP2018-139 SP2018-95 p.203 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
A Design of Reduced Phoneme Set Based on a Language Model Shuji Komeiji, Toshihisa Tanaka (Tokyo Univ. of Agriculture and Tech.) EA2018-134 SIP2018-140 SP2018-96 |
A design of reduced phoneme set based on a language model is proposed. The reduction of the phoneme set improves discrim... [more] |
EA2018-134 SIP2018-140 SP2018-96 pp.205-210 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
Pseudo-Multidimensional Processing of Geomagnetic Field Data Measured using HTS-SQUID Magnetometers for Removing Flux Trapping Noise Kai Yokoyama, Kiyoshi Nishikawa (Tokyo Metro Univ) EA2018-135 SIP2018-141 SP2018-97 |
(To be available after the conference date) [more] |
EA2018-135 SIP2018-141 SP2018-97 pp.211-216 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
EA2018-136 SIP2018-142 SP2018-98 |
Epilepsy is chronic brain disorder that affects 50 million people in the world. To diagnose epilepsy, specialists manual... [more] |
EA2018-136 SIP2018-142 SP2018-98 pp.217-222 |
EA, SIP, SP |
2019-03-15 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) |
[Poster Presentation]
A compressed sensing approach to hyperspectral pansharpening Saori Takeyama, Shunsuke Ono, Itsuo Kumazawa (Tokyo Tech) EA2018-137 SIP2018-143 SP2018-99 |
(To be available after the conference date) [more] |
EA2018-137 SIP2018-143 SP2018-99 pp.223-227 |