Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
PRMU, SP |
2018-06-28 13:00 |
Nagano |
|
[Invited Talk]
Active perception and object handling by robots with deep learning Tetsuya Ogata (Waseda University/AIST) PRMU2018-21 SP2018-1 |
[more] |
PRMU2018-21 SP2018-1 pp.1-2 |
PRMU, SP |
2018-06-28 14:10 |
Nagano |
|
Language model utilizing image features for automatic speech recognition Aiko Hagiwara, Hitoshi Ito, Manon Ichiki, Takeshi Mishima, Shoei Sato (NHK) PRMU2018-22 SP2018-2 |
NHK is pursuing the development of a system using speech recognition for the closed caption production of live broadcast... [more] |
PRMU2018-22 SP2018-2 pp.3-6 |
PRMU, SP |
2018-06-28 14:40 |
Nagano |
|
Study of improving speech intelligibility for glossectomy patients via voice conversion with sound and lip movement. Seiya Ogino, Hiroki Murakami, Sunao Hara, Masanobu Abe (Okayama Univ.) PRMU2018-23 SP2018-3 |
In this paper, we propose the multimodal voice conversion based on Deep Neural Network using audio and lip movement info... [more] |
PRMU2018-23 SP2018-3 pp.7-12 |
PRMU, SP |
2018-06-28 15:10 |
Nagano |
|
Multimodal voice conversion using deep bottleneck features and deep canonical correlation analysis Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu (Gifu Univ.), Tomoki Toda (Nagoya Univ.) PRMU2018-24 SP2018-4 |
In this paper, we aim at improving the speech quality in voice conversion and propose a novel multi-modal voice conversi... [more] |
PRMU2018-24 SP2018-4 pp.13-18 |
PRMU, SP |
2018-06-28 15:40 |
Nagano |
|
Sound recovery using vibration mode of an object in video Yohei Fuse, Yusuke Yasumi, Tetsuya Takiguchi (Kobe Univ.) PRMU2018-25 SP2018-5 |
When a sound hits an object, it causes the surface of the object to vibrate. Some research has been carried out on the r... [more] |
PRMU2018-25 SP2018-5 pp.19-24 |
PRMU, SP |
2018-06-28 16:30 |
Nagano |
|
Analysis of solution diversity about topic model Toshio Uchiyama (HIU) PRMU2018-26 SP2018-6 |
A Topic model is known as a useful statistical model for analyzing text
data and images. Many different parameters (s... [more] |
PRMU2018-26 SP2018-6 pp.25-30 |
PRMU, SP |
2018-06-28 16:45 |
Nagano |
|
Saemi Choi (UT), Gloria Zen, Nicu Sebe (UniTrento), Kiyoharu Aizawa (UT) PRMU2018-27 SP2018-7 |
[more] |
PRMU2018-27 SP2018-7 pp.31-33 |
PRMU, SP |
2018-06-28 17:00 |
Nagano |
|
PRMU2018-28 SP2018-8 |
(To be available after the conference date) [more] |
PRMU2018-28 SP2018-8 pp.35-39 |
PRMU, SP |
2018-06-29 10:00 |
Nagano |
|
Revisiting interference-free power spectral representations of periodic signals Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois) PRMU2018-29 SP2018-9 |
We propose two algorithms to calculate interference-free power spectra of periodic signals. This set of algorithms is ou... [more] |
PRMU2018-29 SP2018-9 pp.41-46 |
PRMU, SP |
2018-06-29 10:30 |
Nagano |
|
Analysis of speech-to-texture sentiment association characteristics Win Thuzar Kyaw, Yoshinori Sagisaka (Waseda Univ.) PRMU2018-30 SP2018-10 |
Aiming at speech visualization using textures or finding texture generation scheme from sentiment information embedded i... [more] |
PRMU2018-30 SP2018-10 pp.47-52 |
PRMU, SP |
2018-06-29 11:00 |
Nagano |
|
Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11 |
This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more] |
PRMU2018-31 SP2018-11 pp.53-58 |
PRMU, SP |
2018-06-29 11:30 |
Nagano |
|
Mapping Acoustic Vector Sequence to Document Vector Based on RNN Ryota Nishimura, Miho Higaki, Norihide Kitaoka (Tokushima Univ.) PRMU2018-32 SP2018-12 |
In this research, we propose a method of searching between different media (cross media mapping) using deep learning (Ma... [more] |
PRMU2018-32 SP2018-12 pp.59-64 |
PRMU, SP |
2018-06-29 13:00 |
Nagano |
|
[Invited Talk]
Koichi Shinoda (TokyoTech) PRMU2018-33 SP2018-13 |
(To be available after the conference date) [more] |
PRMU2018-33 SP2018-13 p.65 |
PRMU, SP |
2018-06-29 14:10 |
Nagano |
|
Discovery of Corresponding Dimensions Between Multiple Multidimensional Sequences
-- Applications and Accelerations of Equivalence Structure Extraction -- Seiya Satoh (AIST), Yoshinobu Takahashi (UEC), Hiroshi Yamakawa (Dwango) PRMU2018-34 SP2018-14 |
[more] |
PRMU2018-34 SP2018-14 pp.67-71 |
PRMU, SP |
2018-06-29 14:25 |
Nagano |
|
Symbol Classification and Pitch Recognition in Offline Handwritten Musical Score Yuki Hayakawa, Tetsushi Wakabayashi, Yasuji Miyake (Mie Univ.), Wataru Ohyama (Kyushu Univ.) PRMU2018-35 SP2018-15 |
To realize automatic recognition of handwritten music score in Optical Music Recognition (OMR), there are many problems ... [more] |
PRMU2018-35 SP2018-15 pp.73-77 |