Online edition: ISSN 2432-6380
[TOP] | [2015] | [2016] | [2017] | [2018] | [2019] | [2020] | [2021] | [Japanese] / [English]
SP2018-1
[Invited Talk]
Active perception and object handling by robots with deep learning
Tetsuya Ogata (Waseda University/AIST)
pp. 1 - 2
SP2018-2
Language model utilizing image features for automatic speech recognition
Aiko Hagiwara, Hitoshi Ito, Manon Ichiki, Takeshi Mishima, Shoei Sato (NHK)
pp. 3 - 6
SP2018-3
Study of improving speech intelligibility for glossectomy patients via voice conversion with sound and lip movement.
Seiya Ogino, Hiroki Murakami, Sunao Hara, Masanobu Abe (Okayama Univ.)
pp. 7 - 12
SP2018-4
Multimodal voice conversion using deep bottleneck features and deep canonical correlation analysis
Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu (Gifu Univ.), Tomoki Toda (Nagoya Univ.)
pp. 13 - 18
SP2018-5
Sound recovery using vibration mode of an object in video
Yohei Fuse, Yusuke Yasumi, Tetsuya Takiguchi (Kobe Univ.)
pp. 19 - 24
SP2018-6
Analysis of solution diversity about topic model
Toshio Uchiyama (HIU)
pp. 25 - 30
SP2018-7
Saemi Choi (UT), Gloria Zen, Nicu Sebe (UniTrento), Kiyoharu Aizawa (UT)
pp. 31 - 33
SP2018-8
(See Japanese page.)
pp. 35 - 39
SP2018-9
Revisiting interference-free power spectral representations of periodic signals
Hideki Kawahara (Wakayama Univ.), Masanori Morise (Univ. Yamanashi), Kanru Hua (Univ. Illinois)
pp. 41 - 46
SP2018-10
Analysis of speech-to-texture sentiment association characteristics
Win Thuzar Kyaw, Yoshinori Sagisaka (Waseda Univ.)
pp. 47 - 52
SP2018-11
Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling
Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT)
pp. 53 - 58
SP2018-12
Mapping Acoustic Vector Sequence to Document Vector Based on RNN
Ryota Nishimura, Miho Higaki, Norihide Kitaoka (Tokushima Univ.)
pp. 59 - 64
SP2018-13
[Invited Talk]
Koichi Shinoda (TokyoTech)
p. 65
SP2018-14
Discovery of Corresponding Dimensions Between Multiple Multidimensional Sequences
-- Applications and Accelerations of Equivalence Structure Extraction --
Seiya Satoh (AIST), Yoshinobu Takahashi (UEC), Hiroshi Yamakawa (Dwango)
pp. 67 - 71
SP2018-15
Symbol Classification and Pitch Recognition in Offline Handwritten Musical Score
Yuki Hayakawa, Tetsushi Wakabayashi, Yasuji Miyake (Mie Univ.), Wataru Ohyama (Kyushu Univ.)
pp. 73 - 77
Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.