Fri, Jun 17 AM 09:20 - 09:30 |
(1) |
09:20-09:30 |
|
Fri, Jun 17 AM 09:30 - 10:40 |
(2) |
09:30-10:40 |
|
|
10:40-10:50 |
Break ( 10 min. ) |
Fri, Jun 17 AM 10:50 - 12:00 |
(3) SP |
10:50-12:00 |
[Invited Talk]
Hearing and vocalization in birds |
Kazuo Okanoya (Teikyo U) |
|
12:00-13:00 |
Break ( 60 min. ) |
Fri, Jun 17 PM 13:00 - 15:00 |
(4) |
13:00-15:00 |
|
(5) |
13:00-15:00 |
|
(6) |
13:00-15:00 |
|
(7) |
13:00-15:00 |
|
(8) |
13:00-15:00 |
|
(9) |
13:00-15:00 |
|
(10) |
13:00-15:00 |
|
(11) |
13:00-15:00 |
|
(12) |
13:00-15:00 |
|
(13) |
13:00-15:00 |
|
(14) SP |
13:00-15:00 |
Issues emerged from implementation of GUIs for WORLD vocoder |
Hideki Kawahara (Wakayama Univ.), Masanori Morise (Meiji Umiv.) |
(15) SP |
13:00-15:00 |
Study and Comparison of Direction Estimation Methods for Instrumental Sound Sources |
Kaho Yamamoto, Akio Ogihara (Kindai Univ.), Harumi Murata (Chukyo Univ.) |
(16) SP |
13:00-15:00 |
A Study of Speech Recognition Result Correction Using BERT for Speech Translation |
Tadashi Ogura, Masakiyo Fujimoto, Peng Shen, Xugang Lu, Hisashi Kawai (NICT) |
(17) SP |
13:00-15:00 |
Characterization of Audio-Vocal Mirror Neurons in The Songbird Basal Ganglia |
Yuka Suzuki (The Univ. of Tokyo), Shin Yanagihara, Kazuo Okanoya (Teikyo Univ.) |
(18) SP |
13:00-15:00 |
|
|
Fri, Jun 17 PM 15:00 - 17:00 |
(19) SP |
15:00-17:00 |
Effects of sequential grouping on rhythm perception |
Jun Nitta, Sotaro Kondoh, Ryosuke O. Tachibana (UT), Kazuo Okanoya (Teikyo Univ.) |
(20) SP |
15:00-17:00 |
Blind Source Separation based on Independent Low-Rank Matrix Analysis using Restricted Boltzmann Machines |
Shotaro Furuta, Takuya Kishida, Toru Nakashika (UEC) |
(21) |
15:00-17:00 |
|
(22) |
15:00-17:00 |
|
(23) |
15:00-17:00 |
|
(24) |
15:00-17:00 |
|
(25) |
15:00-17:00 |
|
(26) |
15:00-17:00 |
|
(27) SP |
15:00-17:00 |
Examination of "sasae-naosu" technique in opera singing using real-time MRI |
Natsuki Toda, Hironori Takemoto (CIT), Jun Takahashi (OUA), Seiji Adachi (TGU) |
(28) SP |
15:00-17:00 |
Neural beamformer with automatic detection of notable sounds for acoustic scene classification |
Sota Ichikawa, Takeshi Yamada (Univ. of Tsukuba), Shoji Makino (Waseda Univ./Univ. of Tsukuba) |
(29) SP |
15:00-17:00 |
Representation and analytical normalization for vocal-tract-length transformation by group theory |
Atsushi Miyashita, Tomoki Toda (Nagoya Univ) |
(30) SP |
15:00-17:00 |
Conditions for octave equivalence : based on verification in rat. |
Riseru Koshiishi (Tokyo Univ.), Kazuo Okanoya (Teikyo Univ.) |
(31) SP |
15:00-17:00 |
|
|
(32) SP |
15:00-17:00 |
Study of End-to-End Text-to-Speech that can seamlessly control speaker's individuality by Manipulating Speaker features |
Naoki Aotani, Sunao Hara, Msanobu Abe (Okayama Univ) |
Fri, Jun 17 PM 17:00 - 18:10 |
(33) |
17:00-18:10 |
|
Sat, Jun 18 AM 09:30 - 10:40 |
(34) |
09:30-10:40 |
|
|
10:40-10:50 |
Break ( 10 min. ) |
Sat, Jun 18 AM 10:50 - 12:00 |
(35) SP |
10:50-12:00 |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- |
Masanori Morise (Meiji Univ.) |
|
12:00-13:00 |
Break ( 60 min. ) |
Sat, Jun 18 PM 13:00 - 15:00 |
(36) |
13:00-15:00 |
|
(37) |
13:00-15:00 |
|
(38) |
13:00-15:00 |
|
(39) |
13:00-15:00 |
|
(40) |
13:00-15:00 |
|
(41) |
13:00-15:00 |
|
(42) |
13:00-15:00 |
|
(43) |
13:00-15:00 |
|
(44) |
13:00-15:00 |
|
(45) |
13:00-15:00 |
|
(46) |
13:00-15:00 |
|
(47) SP |
13:00-15:00 |
[Poster Presentation]
Recording of children's speech and lip movements in the Corona disaster |
Tatsuya Kitamura (Konan Univ.), Ayako Shirose (Tokyo Gakugei Univ.) |
(48) SP |
13:00-15:00 |
Speech intelligibility prediction of simulated hearing loss sounds using the Gammachirp Envelope Similarity Index (GESI)
-- Subjective data from laboratory and crowdsourced remote experiments -- |
Toshio Irino, Honoka Tamaru, Ayako Yamamoto (Wakayama Univ.) |
(49) SP |
13:00-15:00 |
Anomalous sound detection using multi-class classifier and reconstructor of its intermediate layer output |
Keita Matsumoto, Takeshi Yamada (Univ. of Tsukuba), Shoji Makino (Waseda Univ./Univ. of Tsukuba) |
(50) SP |
13:00-15:00 |
[Poster Presentation]
Proposal of Speech Content Conversion and the Initial Trial: Conversion of Linguistic Information Depending on Situations |
Kohei Takita, Saizo Aoyagi, Tatsunori Hirai (Komazawa Univ.) |
Sat, Jun 18 PM 15:00 - 17:00 |
(51) SP |
15:00-17:00 |
[Poster Presentation]
Subjective intensity of musical beats: a psychophysical quantification |
Sotaro Kondoh (UTokyo), Kazuo Okanoya (Teikyo Univ. UTokyo), Ryosuke O. Tachibana (UTokyo) |
(52) SP |
15:00-17:00 |
Improved speech analysis using F0-adaptive lag window |
Michiki Koshimori, Shigeki Sagayama, Takuya Kishida, Toru Nakashika (UEC) |
(53) |
15:00-17:00 |
|
(54) |
15:00-17:00 |
|
(55) |
15:00-17:00 |
|
(56) |
15:00-17:00 |
|
(57) |
15:00-17:00 |
|
(58) |
15:00-17:00 |
|
(59) |
15:00-17:00 |
|
(60) |
15:00-17:00 |
|
(61) |
15:00-17:00 |
|
(62) SP |
15:00-17:00 |
[Poster Presentation]
The current situation and problems of comics transliteration from the production site |
Sumire Mori (Seika Univ.) |
(63) SP |
15:00-17:00 |
VAE-VC based on cross-entropy error minimization of LSP frequency intervals |
Yoshihiro Hiramoto, Shigeki Sagayama, Takuya Kishida, Toru Nakashika (UEC) |
(64) SP |
15:00-17:00 |
[Poster Presentation]
Worker Filtering Criteria for Subjective Evaluation of Synthesized Voice Sound Quality Using Crowdsourcing |
Moe Yaegashi (Waseda Univ.), Susumu Saito, Teppei Nakano (Waseda Univ./ifLab.), Tetsuji Ogawa (Waseda Univ.) |
(65) SP |
15:00-17:00 |
Unsupervised Training of Sequential Neural Beamformer Using Blindly-separated and Non-separated Signals |
Kohei Saijo, Tetsuji Ogawa (Waseda Univ.) |
|
- |
|
|
- |
|
Sat, Jun 18 PM 17:00 - 18:10 |
(66) |
17:00-18:10 |
|
Sat, Jun 18 PM 18:10 - 18:30 |
(67) |
18:10-18:30 |
|