===============================================
Special Interest Group on Music and Computer (IPSJ-MUS)
===============================================
Technical Committee on Speech (SP)
Chair: Takahiro Shinozaki (Tokyo Inst. of Tech)
Secretary: Atsushi Ando (NTT), Kei Hashimoto (Nagoya Inst. of Tech.)
Assistant: Motoi Omachi (LY Corp.), Yuuki Saito (Univ. of Tokyo)
===============================================
Special Interest Group on Spoken Language Processing (IPSJ-SLP)
DATE:
Fri, Jun 14, 2024 09:20 - 17:40
Sat, Jun 15, 2024 09:30 - 18:00
PLACE:
TOPICS:
----------------------------------------
Fri, Jun 14 AM (09:20 - 09:30)
----------------------------------------
(1) 09:20 - 09:30
----------------------------------------
Fri, Jun 14 AM (09:30 - 10:40)
----------------------------------------
(2) 09:30 - 10:40
----- Break ( 10 min. ) -----
----------------------------------------
Fri, Jun 14 AM (10:50 - 12:00)
----------------------------------------
(3) 10:50 - 12:00
----- Break ( 10 min. ) -----
----------------------------------------
Fri, Jun 14 PM (12:10 - 12:50)
----------------------------------------
----- Break ( 60 min. ) -----
----------------------------------------
Fri, Jun 14 PM (13:50 - 16:10)
----------------------------------------
(4) 13:50 - 16:10
(5) 13:50 - 16:10
(6) 13:50 - 16:10
(7) 13:50 - 16:10
(8) 13:50 - 16:10
(9) 13:50 - 16:10
(10) 13:50 - 16:10
(11) 13:50 - 16:10
(12) 13:50 - 16:10
(13) 13:50 - 16:10
(14) 13:50 - 16:10
(15) 13:50 - 16:10
(16) 13:50 - 16:10
(17) 13:50 - 16:10
(18) 13:50 - 16:10
(19) 13:50 - 16:10
(20) 13:50 - 16:10
(21) 13:50 - 16:10
(22) 13:50 - 16:10
(23) 13:50 - 16:10
(24) 13:50 - 16:10
(25) 13:50 - 16:10
(26) 13:50 - 16:10
(27) 13:50 - 16:10
(28) 13:50 - 16:10
(29) 13:50 - 16:10
(30) 13:50 - 16:10
(31) 13:50 - 16:10
(32) 13:50 - 16:10
(33)/SP 13:50 - 16:10
[Poster Presentation]
Computation of acoustic field in the vicinity of the wedge-shaped cut imitating the lips by using FDTD method
Chiune Sato, Kunitoshi Motoki (Hokkai-Gakuen Univ.)
(34)/SP 13:50 - 16:10
Real-time target speaker extraction using direction and voice features
Kengo Ohara, Kosuke Osumi, Erika Yamamoto, Masanori Morobishi, Takayuki Arakawa (Kyocera Corp.)
(35)/SP 13:50 - 16:10
Evaluation of nonlinear distortion for hearing loss simulators and its relationship to algorithms
Shintaro Doan, Toshio Irino, Minami Ishikawa (Wakayama Univ.)
(36)/SP 13:50 - 16:10
[Poster Presentation]
Opera-singing voice synthesis from inexperienced voice considering vowel and tempo change.
Aoto Sugahara (Kobe Univ.), Soma Kishimoto, Yuji Adachi, Kiyoto Tai (MEC), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.)
(37)/SP 13:50 - 16:10
Beyond Word Count: Exploring Approximated Target Lengths for CIF-RNNT
Wen Shen Teo, Yasuhiro Minami (UEC)
----- Break ( 20 min. ) -----
----------------------------------------
Fri, Jun 14 PM (16:30 - 17:40)
----------------------------------------
(38) 16:30 - 17:40
----------------------------------------
Sat, Jun 15 AM (09:30 - 10:40)
----------------------------------------
(39) 09:30 - 09:50
(40) 09:50 - 10:40
----- Break ( 10 min. ) -----
----------------------------------------
Sat, Jun 15 AM (10:50 - 12:00)
----------------------------------------
(41) 10:50 - 12:00
----- Break ( 10 min. ) -----
----------------------------------------
Sat, Jun 15 AM (12:10 - 12:50)
----------------------------------------
----- Break ( 60 min. ) -----
----------------------------------------
Sat, Jun 15 PM (13:50 - 16:10)
----------------------------------------
(42) 13:50 - 16:10
(43) 13:50 - 16:10
(44) 13:50 - 16:10
(45) 13:50 - 16:10
(46) 13:50 - 16:10
(47) 13:50 - 16:10
(48) 13:50 - 16:10
(49) 13:50 - 16:10
(50) 13:50 - 16:10
(51) 13:50 - 16:10
(52) 13:50 - 16:10
(53) 13:50 - 16:10
(54) 13:50 - 16:10
(55) 13:50 - 16:10
(56) 13:50 - 16:10
(57) 13:50 - 16:10
(58) 13:50 - 16:10
(59) 13:50 - 16:10
(60) 13:50 - 16:10
(61) 13:50 - 16:10
(62) 13:50 - 16:10
(63) 13:50 - 16:10
(64) 13:50 - 16:10
(65) 13:50 - 16:10
(66) 13:50 - 16:10
(67) 13:50 - 16:10
(68) 13:50 - 16:10
(69) 13:50 - 16:10
(70)/SP 13:50 - 16:10
Acoustic-to-articulatory Inversion using real-time MRI for Pronunciation Practice
Anna Oura, Hideaki Kikuchi, Tetsunori Kobayashi (Waseda Univ.)
(71)/SP 13:50 - 16:10
[Poster Presentation]
A voice synthesizer operated by fingers to control its vocal-tract area function.
Amane Koriki, Masashi Ito (Tohtech)
(72)/SP 13:50 - 16:10
Discussion toward Accurate Speech Signal Analysis
Shigeki Sagayama (UT/UEC)
(73)/SP 13:50 - 16:10
[Poster Presentation]
Improving CTC-based ASR model by weighting encoder layers using attention mechanisms
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT)
----- Break ( 20 min. ) -----
----------------------------------------
Sat, Jun 15 PM (16:30 - 17:40)
----------------------------------------
(74) 16:30 - 17:40
----------------------------------------
Sat, Jun 15 PM (17:40 - 18:00)
----------------------------------------
=== Special Interest Group on Music and Computer (IPSJ-MUS) ===
=== Technical Committee on Speech (SP) ===
=== Special Interest Group on Spoken Language Processing (IPSJ-SLP) ===
Last modified: 2024-06-07 16:55:31
|
Notification: Mail addresses are partially hidden against SPAM.
|