Sat, Dec 2 AM 10:15 - 10:30 |
|
- |
|
Sat, Dec 2 AM 10:30 - 12:00 |
(1) |
10:30-11:00 |
|
(2) |
11:00-11:30 |
|
(3) |
11:30-12:00 |
|
|
12:00-13:30 |
Break ( 90 min. ) |
Sat, Dec 2 PM 13:30 - 14:30 |
(4) SP |
13:30-14:00 |
Effectiveness of Signal Compression in Speech Enhancement with Diffusion Models |
Yuki Nishi (Titech), Koji Iwano (Tokyo City Univ.), Koichi Shinoda (Titech) |
(5) |
14:00-14:30 |
|
|
14:30-14:45 |
Break ( 15 min. ) |
Sat, Dec 2 PM 14:45 - 15:45 |
(6) |
14:45-15:45 |
|
|
15:45-16:00 |
Break ( 15 min. ) |
Sat, Dec 2 PM 16:00 - 17:30 |
(7) SP |
16:00-16:30 |
Development and effects of English speech training drills to improve perception and production skills seamlessly with interactive gamification |
Nobuaki Minematsu, Yingxiang Gao (UTokyo), Noriko Nakanishi (KGU), Yusuke Inoue, Hiroaki Mizuno (Carriage) |
(8) |
16:30-17:00 |
|
(9) |
17:00-17:30 |
|
Sun, Dec 3 AM 09:30 - 11:00 |
(10) SP |
09:30-10:00 |
Enhancing Recognition of Rare Words in ASR through Error Detection and Context-Aware Error Correction |
Jiajun He, Zekun Yang, Tomoki Toda (Nagoya Univ.) |
(11) SP |
10:00-10:30 |
Improvement of Tacotron2 text-to-speech model based on masking operation and positional attention mechanism |
Tong Ma, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) |
(12) |
10:30-11:00 |
|
|
11:00-11:05 |
Break ( 5 min. ) |
Sun, Dec 3 AM 11:05 - 12:00 |
(13) SP |
11:05-12:00 |
[Poster Presentation]
Enhancing Multi-Accent Automated Speech Recognition with Accent-Activated Adapters |
Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) |
(14) SP |
11:05-12:00 |
[Poster Presentation]
Enhancing Dysarthric Speech Recognition with Auxiliary Feature Fusion Module: Exploring Articulatory-related Features from Foundation Models |
Yuqin Lin, Longbiao Wang, Jianwu Dang (Tianjin Univ. & Univ. of Tokyo), Nobuaki Minematsu (Univ. of Tokyo) |
(15) SP |
11:05-12:00 |
[Poster Presentation]
Integration of Throat Microphone Recording and Bandwidth Extension for Robust Assessment of L2 Listening |
Yu Xu, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) |
(16) SP |
11:05-12:00 |
[Poster Presentation]
Self-supervised learning model based emotion transfer and intensity control technology for expressive speech synthesis |
Wei Li, Nobuaki Minematsu, Daisuke Saito (Univ. of Tokyo) |
|
12:00-13:00 |
Break ( 60 min. ) |
Sun, Dec 3 PM 13:00 - 14:00 |
(17) |
13:00-14:00 |
|
|
14:00-14:15 |
Break ( 15 min. ) |
Sun, Dec 3 PM 14:15 - 15:45 |
(18) |
14:15-14:45 |
|
(19) |
14:45-15:15 |
|
(20) |
15:15-15:45 |
|
|
15:45-16:00 |
Break ( 15 min. ) |
Sun, Dec 3 PM 16:00 - 18:00 |
(21) |
16:00-16:30 |
|
(22) |
16:30-17:00 |
|
(23) |
17:00-17:30 |
|
(24) |
17:30-18:00 |
|
Mon, Dec 4 AM 09:00 - 10:00 |
(25) |
09:00-09:30 |
|
(26) |
09:30-10:00 |
|
|
10:00-10:10 |
Break ( 10 min. ) |
Mon, Dec 4 AM 10:10 - 11:30 |
(27) |
10:10-10:30 |
|
(28) |
10:30-11:00 |
|
(29) |
11:00-11:30 |
|
|
11:30-11:40 |
Break ( 10 min. ) |
Mon, Dec 4 AM 11:40 - 12:10 |
(30) |
11:40-12:10 |
|
|
12:10-13:00 |
Break ( 50 min. ) |
Mon, Dec 4 PM 13:00 - 13:45 |
(31) SP |
13:00-13:45 |
Report on Participation in Interspeech2023 |
Kentaro Mitsui (rinna), Kohei Matsuura (NTT) |
|
13:45-14:00 |
Break ( 15 min. ) |
Mon, Dec 4 PM 14:00 - 14:45 |
(32) |
14:00-14:25 |
|
(33) |
14:25-14:45 |
|
Mon, Dec 4 PM 14:45 - 15:15 |
|
- |
|