Wed, May 22 PM 13:00 - 14:40 |
(1) |
13:00-13:25 |
Trends in Sound Source Separation and Enhancement at ICASSP2024 |
Yoshiki Masuyama (TMU) |
(2) |
13:25-13:50 |
|
|
(3) |
13:50-14:15 |
Determined BSS based on the proximal average of IVA and DNNs |
Kazuki Matsumoto (Waseda Univ.), Koki Yamada, Kohei Yatabe (TUAT) |
(4) |
14:15-14:40 |
未定
-- 未定 -- |
Tsubasa Ochiai (NTT), Kazuma Iwamoto (Doshisha Univ.), Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki (NTT), Shigeru Katagiri (Doshisha Univ.) |
|
14:40-14:55 |
Break ( 15 min. ) |
Wed, May 22 PM 14:55 - 16:35 |
(5) |
14:55-15:20 |
Environmental sound synthesis and creation of dataset using vocal imitations |
Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) |
(6) |
15:20-15:45 |
Anomaly sound detection of industrial equipment using acoustical features related to timbral attribute |
Yasuji Ota, Ryoya Ogura, Masashi Unoki (JAIST) |
(7) |
15:45-16:10 |
Audio-change Captioning to Explain Machine-sound Anomalies |
Shunsuke Tsubaki (Doshisha Univ./Hitachi), Yohei Kawaguchi, Tomoya Nishida (Hitachi), Keisuke Imoto (Doshisha Univ.), Yuki Okamoto (Ritsumeikan Univ./Hitachi), Kota Dohi, Takashi Endo (Hitachi) |
(8) |
16:10-16:35 |
Incremental Learning for Joint analysis of Acoustic Scenes and Sound Events |
Kaori Inoue, Yuka Fukumoto, Naoki Koga, Keisuke Imoto (Doshisha Univ.) |
|
16:35-16:50 |
Break ( 15 min. ) |
Wed, May 22 PM 16:50 - 17:40 |
(9) |
16:50-17:40 |
[Invited Talk]
Fundamentals of Diffusion-based Generative Models and their Application to Speech Enhancement and Separation |
Scheibler Robin (LY Corp.) |