Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 10:50 |
Online |
Online |
[Invited Talk]
Hearing and vocalization in birds Kazuo Okanoya (Teikyo U) SP2022-1 |
Songbirds learn their territorial and mating songs through auditory feedback. Their songs have complex and precise acous... [more] |
SP2022-1 pp.1-2 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 13:00 |
Online |
Online |
Issues emerged from implementation of GUIs for WORLD vocoder Hideki Kawahara (Wakayama Univ.), Masanori Morise (Meiji Umiv.) SP2022-2 |
[more] |
SP2022-2 pp.3-6 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 13:00 |
Online |
Online |
Study and Comparison of Direction Estimation Methods for Instrumental Sound Sources Kaho Yamamoto, Akio Ogihara (Kindai Univ.), Harumi Murata (Chukyo Univ.) SP2022-3 |
Surround sound is becoming more and more familiar as digital contents such as TV broadcasting support surround sound. Ho... [more] |
SP2022-3 pp.7-9 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 13:00 |
Online |
Online |
A Study of Speech Recognition Result Correction Using BERT for Speech Translation Tadashi Ogura, Masakiyo Fujimoto, Peng Shen, Xugang Lu, Hisashi Kawai (NICT) SP2022-4 |
Speech translation (ST) technology consists of automatic speech recognition (ASR) and machine translation technologies. ... [more] |
SP2022-4 pp.10-13 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 13:00 |
Online |
Online |
Characterization of Audio-Vocal Mirror Neurons in The Songbird Basal Ganglia Yuka Suzuki (The Univ. of Tokyo), Shin Yanagihara, Kazuo Okanoya (Teikyo Univ.) SP2022-5 |
Integration of auditory and vocal-motor information is essential for songbirds to learn a song. Neurons in the premotor ... [more] |
SP2022-5 pp.14-17 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 13:00 |
Online |
Online |
SP2022-6 |
Rank-constrained spatial covariance matrix estimation (RCSCME) is a method for blind speech extraction. In RCSCME, we de... [more] |
SP2022-6 pp.18-23 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Effects of sequential grouping on rhythm perception Jun Nitta, Sotaro Kondoh, Ryosuke O. Tachibana (UT), Kazuo Okanoya (Teikyo Univ.) SP2022-7 |
(To be available after the conference date) [more] |
SP2022-7 pp.24-25 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Blind Source Separation based on Independent Low-Rank Matrix Analysis using Restricted Boltzmann Machines Shotaro Furuta, Takuya Kishida, Toru Nakashika (UEC) SP2022-8 |
In this paper, we propose a new blind source separation method that combines independent low-rank source separation (ILR... [more] |
SP2022-8 pp.26-29 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Examination of "sasae-naosu" technique in opera singing using real-time MRI Natsuki Toda, Hironori Takemoto (CIT), Jun Takahashi (OUA), Seiji Adachi (TGU) SP2022-9 |
Amazing rich voice of opera singers could be acquired by developing sophisticated control of the diaphragm and chest. T... [more] |
SP2022-9 pp.30-34 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Neural beamformer with automatic detection of notable sounds for acoustic scene classification Sota Ichikawa, Takeshi Yamada (Univ. of Tsukuba), Shoji Makino (Waseda Univ./Univ. of Tsukuba) SP2022-10 |
Recently, acoustic scene classification using a beamformer with multi-channel signals as input has been proposed. Genera... [more] |
SP2022-10 pp.35-40 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Representation and analytical normalization for vocal-tract-length transformation by group theory Atsushi Miyashita, Tomoki Toda (Nagoya Univ) SP2022-11 |
In automatic speech recognition, a recognition result should be invariant with respect to acoustic changes caused by dif... [more] |
SP2022-11 pp.41-46 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Conditions for octave equivalence : based on verification in rat. Riseru Koshiishi (Tokyo Univ.), Kazuo Okanoya (Teikyo Univ.) SP2022-12 |
[more] |
SP2022-12 pp.47-48 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
SP2022-13 |
We investigate the method for unsupervised learning of artifacts correction networks used for post-processing of Multi B... [more] |
SP2022-13 pp.49-54 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-17 15:00 |
Online |
Online |
Study of End-to-End Text-to-Speech that can seamlessly control speaker's individuality by Manipulating Speaker features Naoki Aotani, Sunao Hara, Msanobu Abe (Okayama Univ) SP2022-14 |
In this paper, we investigate an End-to-End speech synthesis scheme that enables to seamlessly control speaker individua... [more] |
SP2022-14 pp.55-60 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 10:50 |
Online |
Online |
[Invited Talk]
Crazy vocoder is unbreakable
-- But let's talk about an informal vision of the future -- Masanori Morise (Meiji Univ.) SP2022-15 |
When current speech synthesis researchers refer to Vocoder in their papers, they are most likely referring to Neural voc... [more] |
SP2022-15 pp.61-66 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
[Poster Presentation]
Recording of children's speech and lip movements in the Corona disaster Tatsuya Kitamura (Konan Univ.), Ayako Shirose (Tokyo Gakugei Univ.) SP2022-16 |
Children's lip movements and speech were recorded by a head-mounted, downward facing action camera and voice recorder du... [more] |
SP2022-16 pp.67-70 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
Speech intelligibility prediction of simulated hearing loss sounds using the Gammachirp Envelope Similarity Index (GESI)
-- Subjective data from laboratory and crowdsourced remote experiments -- Toshio Irino, Honoka Tamaru, Ayako Yamamoto (Wakayama Univ.) SP2022-17 |
We aim at developing an objective intelligibility measure (OIM) to predict speech intelligibility (SI) for individual el... [more] |
SP2022-17 pp.71-76 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
Anomalous sound detection using multi-class classifier and reconstructor of its intermediate layer output Keita Matsumoto, Takeshi Yamada (Univ. of Tsukuba), Shoji Makino (Waseda Univ./Univ. of Tsukuba) SP2022-18 |
In recent years, there has been a growing demand for techniques to detect unknown anomalous sounds by unsupervised learn... [more] |
SP2022-18 pp.77-81 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 13:00 |
Online |
Online |
[Poster Presentation]
Proposal of Speech Content Conversion and the Initial Trial: Conversion of Linguistic Information Depending on Situations Kohei Takita, Saizo Aoyagi, Tatsunori Hirai (Komazawa Univ.) SP2022-19 |
It is important to speak dialects, honorifics, and simple words for listeners and the environment in order to smooth com... [more] |
SP2022-19 pp.82-87 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2022-06-18 15:00 |
Online |
Online |
[Poster Presentation]
Subjective intensity of musical beats: a psychophysical quantification Sotaro Kondoh (UTokyo), Kazuo Okanoya (Teikyo Univ. UTokyo), Ryosuke O. Tachibana (UTokyo) SP2022-20 |
Meter is the core of the musical structure. We perceive meter as one strong beat and several weak beats. The intensity o... [more] |
SP2022-20 pp.88-89 |