Online edition: ISSN 2432-6380
[TOP] | [2018] | [2019] | [2020] | [2021] | [2022] | [2023] | [2024] | [Japanese] / [English]
EA2024-1
Trends in Sound Source Separation and Enhancement at ICASSP2024
Yoshiki Masuyama (TMU)
pp. 1 - 6
EA2024-2
(See Japanese page.)
pp. 7 - 13
EA2024-3
Determined BSS based on the proximal average of IVA and DNNs
Kazuki Matsumoto (Waseda Univ.), Koki Yamada, Kohei Yatabe (TUAT)
pp. 14 - 19
EA2024-4
Why speech enhancement degrades speech recognition performance?
-- Analysis of effect of speech enhancement errors on speech recognition performance --
Tsubasa Ochiai (NTT), Kazuma Iwamoto (Doshisha Univ.), Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki (NTT), Shigeru Katagiri (Doshisha Univ.)
pp. 20 - 21
EA2024-5
Environmental sound synthesis and creation of dataset using vocal imitations
Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.)
p. 22
EA2024-6
An anomalous sound detection for industrial machines using acoustical features related to timbral-based metrics
Yasuji Ota, Ryoya Ogura, Masashi Unoki (JAIST)
pp. 23 - 28
EA2024-7
Audio-change Captioning to Explain Machine-sound Anomalies
Shunsuke Tsubaki (Doshisha Univ./Hitachi), Yohei Kawaguchi, Tomoya Nishida (Hitachi), Keisuke Imoto (Doshisha Univ.), Yuki Okamoto (Ritsumeikan Univ./Hitachi), Kota Dohi, Takashi Endo (Hitachi)
pp. 29 - 33
EA2024-8
Incremental Learning for Joint analysis of Acoustic Scenes and Sound Events
Kaori Inoue, Yuka Fukumoto, Naoki Koga, Keisuke Imoto (Doshisha Univ.)
pp. 34 - 37
EA2024-9
[Invited Talk]
Fundamentals of Diffusion Models and their Application to Speech Enhancement and Separation
Robin Scheibler (LY Corp.)
p. 38
Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.