Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA, SIP, SP, IPSJ-SLP [detail] |
2025-03-03 14:36 |
Okinawa |
|
[No paper] Impression Caption Dataset for Environmental Sounds Yuki Okamoto (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Junichi Yamagishi (NII), Yuki Saito (UTokyo), Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) |
[more] |
|
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2024-12-12 14:50 |
Aichi |
Nagoya Univ. (Primary: On-site, Secondary: Online) |
Study on Data Creation and Model Construction for Speech Emotion Captioning Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) NLC2024-21 SP2024-12 |
In previous studies on speech emotion recognition, the results of the prediction are represented by categorical or dimen... [more] |
NLC2024-21 SP2024-12 pp.12-17 |
EA |
2024-05-22 14:55 |
Online |
Online |
Environmental sound synthesis and creation of dataset using vocal imitations Yuki Okamoto (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Shinnosuke Takamichi (The Univ. of Tokyo/Keio Univ.), Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) EA2024-5 |
One way to represent the characteristics of environmental sounds is to imitate the environmental sounds by human voice c... [more] |
EA2024-5 p.22 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Speech Emotion Recognition based on Emotional Label Sequence Estimation Considering Phoneme Class Attribute Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) SP2023-9 |
Recently, many researchers have tackled speech emotion recognition (SER), which predicts emotion conveyed by speech. In ... [more] |
SP2023-9 pp.42-47 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Environmental Sound Separation Considering Separation Distortion and Remixing Error Kanta Shimonishi, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) SP2023-24 |
This report aims to improve the performance of environmental sound separation by considering not only the separated soun... [more] |
SP2023-24 pp.119-124 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 15:35 |
Okinawa |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
A study of shout detection for clipped speech Taito Ishida, Kazuhiro Matsuda, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) EA2021-97 SIP2021-124 SP2021-82 |
Recently, several audio surveillance systems using shouted speech have been proposed for safety in daily life.
Although... [more] |
EA2021-97 SIP2021-124 SP2021-82 pp.207-212 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 15:00 |
Online |
Online |
[Poster Presentation]
Scream detection based on deep learning using time-sequential spectral and cepstral features Takahiro Fukumori (Ritsumeikan Univ.) SP2021-6 |
Discrimination between normal speech and scream is crucial in audio surveillance and monitoring. Although deep neural ne... [more] |
SP2021-6 pp.31-36 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2020-12-02 13:50 |
Online |
Online |
Multi-Modal Emotion Recognition by Integrating of Acoustic and Linguistic Features Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) NLC2020-14 SP2020-17 |
In recent years, the advanced techique of deep learning has improved the performance of Speech Emotional Recognition as ... [more] |
NLC2020-14 SP2020-17 pp.7-12 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
A study on suitable modulation parameters for spectral peak noise reduction based on frequency modulated carrier in parametric loudspeaker Kairi Mori, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-151 SIP2017-160 SP2017-134 |
In the conventional modulation method using a parametric loudspeaker, a resonance frequency of an ultrasonic element is ... [more] |
EA2017-151 SIP2017-160 SP2017-134 pp.275-276 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 09:00 |
Okinawa |
|
[Poster Presentation]
Performance evaluation of unknown sound clustering for indoor-environmental sound classification based on self-generated acoustic model Sakiko Mishima, Yukoh Wakabayashi, Takahiro Fukumori, Keisuke Imoto, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-152 SIP2017-161 SP2017-135 |
Indoor-environmental sound classification is useful for surveillance systems which monitor the situations in the dark an... [more] |
EA2017-152 SIP2017-161 SP2017-135 pp.277-280 |
EA, ASJ-H |
2017-12-01 15:20 |
Overseas |
University of Auckland (New Zealand) |
Range Control of Maximum Demodulation Distance in Parametric Loudspeaker Using Flexible Acoustic-lens Kirara Ariyoshi, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-79 |
In this paper, we propose the control method of the maximum demodulation distance in the parametric loudspeaker using fl... [more] |
EA2017-79 pp.109-114 |
EA, ASJ-H |
2017-12-01 16:00 |
Overseas |
University of Auckland (New Zealand) |
HRTF Personalization Based on Pinna Shape Estimation with Handy 3D Scanner Zhuan Zuo, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-81 |
A binaural system using only headphones with head-related transfer functions (HRTFs) has been studied for reproducing a ... [more] |
EA2017-81 pp.121-126 |
SP |
2017-08-30 15:50 |
Kyoto |
Kyoto Univ. |
Detection of noisy-and-reverberant shouted speech using rahmonic and mel-frequency cepstrum coefficients Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.), Hiroaki Nanjo (Kyoto Univ.) SP2017-31 |
This paper describes a method based on new combined features with mel-frequency cepstrum coefficients (MFCCs) and rahmon... [more] |
SP2017-31 pp.49-54 |
EA, ASJ-H |
2017-08-10 14:00 |
Miyagi |
Tohoku Univ., R. I. E. C. |
Evaluation on Performance of Spatial Sound-image Projection on Acoustic Hologram with Multiple Parametric Loudspeakers Array Yoshinori Ogami, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-40 |
A parametric loudspeaker can transmit an audible sound to a particular area called "audio spot". By utilizing the parame... [more] |
EA2017-40 pp.79-84 |
EA, ASJ-H |
2017-08-10 14:25 |
Miyagi |
Tohoku Univ., R. I. E. C. |
A Study on Sound-quality Improvement Based on Multiplexed Double Sideband Modulation in Parametric Loudspeaker Yusei Nakano, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2017-41 |
A parametric loudspeaker can reproduce an audible sound to a particular area. To reproduce an audible sound, the paramet... [more] |
EA2017-41 pp.85-90 |
SP, SIP, EA |
2017-03-01 10:25 |
Okinawa |
Okinawa Industry Support Center |
Noisy speech reconstruction based on deep neural network with optical microphone Tomoyuki Mizuno, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2016-84 SIP2016-139 SP2016-79 |
Measuring distant-talking speech with high accuracy is important for detecting criminal activity. Various microphones su... [more] |
EA2016-84 SIP2016-139 SP2016-79 pp.13-18 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Indoor-environmental sound identification based on deep neural network with higher-dimensional features Sakiko Mishima, Yukoh Wakabayashi, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2016-87 SIP2016-142 SP2016-82 |
Surveillance systems with a video camera have been utilized for the safety of people. It is important to identify the in... [more] |
EA2016-87 SIP2016-142 SP2016-82 pp.31-36 |
SP, SIP, EA |
2017-03-01 12:40 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Reverberant speech enhancement with deep auto encoder based on harmonic structure Rikuto Ota, Yukoh Wakabayashi, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2016-107 SIP2016-162 SP2016-102 |
This paper describes reverberant speech enhancement (RSE) with deep auto encoder (DAE) based on harmonic structure. DAEs... [more] |
EA2016-107 SIP2016-162 SP2016-102 pp.141-146 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
An evaluation of voice intelligibility in factory noise environment based on active noise control and auditory masking Rumi Ito, Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.) EA2016-126 SIP2016-181 SP2016-121 |
In factories, factory workers exposed to loud and continual noises feel strong discomfort. We have previously proposed a... [more] |
EA2016-126 SIP2016-181 SP2016-121 pp.249-254 |
SP, SIP, EA |
2017-03-02 09:00 |
Okinawa |
Okinawa Industry Support Center |
[Poster Presentation]
Performance evaluation of noisy shouted speech detection based on acoustic model with rahmonic and mel-frequency cepstrum coefficients Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.), Hiroaki Nanjo (Kyoto Univ.) EA2016-132 SIP2016-187 SP2016-127 |
This paper describes a method based on new combined features with mel-frequency cepstrum coefficients (MFCCs) and rahmon... [more] |
EA2016-132 SIP2016-187 SP2016-127 pp.283-286 |