IEICE Technical Committee Submission System
Advance Program
Online Proceedings
[Sign in]
Tech. Rep. Archives
 Go Top  Go Back   Prev SP Conf / [HTML] / [HTML(simple)] / [TEXT]  [Japanese] / [English] 


Technical Committee on Speech (SP) [schedule] [select]
Chair Takahiro Shinozaki (Tokyo Inst. of Tech)
Secretary Atsushi Ando (NTT), Kei Hashimoto (Nagoya Inst. of Tech.)
Assistant Motoi Oomachi (Line Yahoo), Yuuki Saito (Univ. of Tokyo)

Technical Committee on Engineering Acoustics (EA) [schedule] [select]
Chair Nobutaka Ono (Tokyo Metropolitan Univ.)
Vice Chair Takanobu Nishiura (RitsumeikanUniv.), Keigo Wakayama (NTT)
Secretary Yoshiaki Bando (AIST), Nobutaka Ito (Univ. of Tokyo)
Assistant Daichi Kitamura (NIT,Kagawa), Yuma Kinoshita (Tokai Univ.)

Technical Committee on Signal Processing (SIP) [schedule] [select]
Chair Koichi Ichige (Yokohama National Univ.)
Vice Chair Akira Tanaka (Hokkaido Univ.), Kiyoshi Nishikawa (okyo Metropolitan Univ.)
Secretary Shoko Imaizumi (Chiba Univ.), Taizo Suzuki (Univ. of Tsukubaba)
Assistant Masanari Nakamura (Hokkaido Univ.), Sayaka Shiota (Tokyo Metropolitan Univ.)

Special Interest Group on Spoken Language Processing (IPSJ-SLP) [schedule] [select]
Chair Takahiro Shinozaki (Tokyo Inst. of Tech)
Secretary Atsushi Ando (NTT), Kei Hashimoto (Nagoya Inst. of Tech.), Motoi Oomachi (Line Yahoo), Yuuki Saito (Univ. of Tokyo)

Conference Date Sun, Mar 2, 2025 09:30 - 16:40
Mon, Mar 3, 2025 09:45 - 16:45
Tue, Mar 4, 2025 09:30 - 16:00
Topics  
Conference Place  
Sponsors This conference is co-sponsored by Technical Committee on Electroacoustics of ASJ and IEEE Signal Processing Society Tokyo Joint Chapter. This conference is technical co-sponsored by IEEE SPS Tokyo Joint Chapter, IEEE Signal Processing Society Tokyo Joint Chapter and APSIPA Japan Chapter.
Registration Fee This workshop will be held as the IEICE workshop in fully electronic publishing. Registration fee will be necessary except the speakers and participants other than the participants to workshop(s) in non-electronic publishing. See the registration fee page. We request the registration fee or presentation fee to participants who will attend the workshop(s) on EA, SIP, SP.

Sun, Mar 2 AM 
09:30 - 10:50
(1)
SP
09:30-09:50 Uncertainty-Based Streaming ASR with Evidential Deep Learning Hiroaki Sato, Asahi Sakuma, Ryuga Sugano, Tadashi Kumano, Yoshihiko Kawai (NHK STRL), Ogawa Tetsuji (Waseda Univ.)
(2) 09:50-10:10  
(3) 10:10-10:30  
(4) 10:30-10:50  
Sun, Mar 2 AM 
09:30 - 10:50
(5)
EA
09:30-09:50 Sound field estimation method robust to microphone position error Takumi Koga, Ueno Natsuki (Kumamoto Univ.)
(6)
EA
09:50-10:10 Acoustic Wave Propagation Simulation based on Wave Equation-based Neural Networks Shota Okubo, Toshiharu Horiuchi (KDDI Research, Inc.)
(7)
EA
10:10-10:30 Sound field reconstruction with sparse channel acoustic signals based on simultaneous learning of graph and signal interpolation Shihori Kozuka, Takayuki Sasaki (NTT), Yukihiro Bando (Shimonoseki City Univ.), Hiroaki Itou, Kazuya Hayase, Noriyoshi Kamado, Masaki Kitahara (NTT)
(8)
EA
10:30-10:50 Implementation of Sound Field Synthesis Renderer for Volumetric Audio Yo Sasaki, Yasushige Nakayama (NHK)
  10:50-11:00 Break ( 10 min. )
Sun, Mar 2 AM 
11:00 - 12:00
(9)
SP
11:00-11:20 An Experimental Study on Text-independent Speaker Verification for Forensic Applications Shigeki Ozawa (YCU), Akira Gotoh, Yuko Saito, Hiroki Matsuura (NEC), Takafumi Koshinaka (YCU)
(10)
SP
11:20-11:40 Speaker Verification Based on Deformable Convolutional Networks Keiya Sato, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech)
(11)
SP
11:40-12:00 Speech-Activity-Guided Speaker Embedding Extraction Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix (NTT)
Sun, Mar 2 AM 
11:00 - 12:00
(12)
SIP
11:00-11:20 Joint Diagonalization Based on Equivalence Classes of Orthogonal Matrices by Signed Permutations and Weighted Averaging in the Cayley Transform Domain Akira Tanaka, Takafumi Edo (Hokkaido Univ.)
(13)
SIP
11:20-11:40 Algebraic representation of dynamical systems in time-frequency domain: An extension to integro-differential equations Shigeru Ando (Univ. Tokyo)
(14)
SIP
11:40-12:00 Toward nonlinear system identification Fumihiko Ishiyama (NTT)
  12:00-13:00 Break ( 60 min. )
Sun, Mar 2 PM 
13:00 - 14:20
(15)
SP
13:00-13:20 Zero-Shot Speech Synthesis Directly Referring Target Speech Through Attention Mechanisms Kyohei Nakatsuka, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.)
(16) 13:20-13:40  
(17) 13:40-14:00  
(18) 14:00-14:20  
Sun, Mar 2 PM 
13:00 - 14:20
(19)
EA
13:00-13:20 Affective Impression Structural Models and Individual Differences in Chord Listening Sakura Sakamoto (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Kenji Katahira (Waseda Univ.), Takashi Fujisawa (Univ. of Fukui), Noriko Nagata (Kwansei Gakuin Univ.)
(20)
EA
13:20-13:40 A Value Structure Model and Individual Differences for the Designing of Pleasant Motor Drive Sounds Jun Urayama, Noriko Nagata (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Yuto kobayashi, Yasunori Sugita (Nagaoka Univ. of Technology), Takashi Hoduki, Akira Satake, Hiroyasu Iwabuki (MELCO)
(21)
EA
13:40-14:00 Localization of Victims Using Equivalent Rotating Sound Sources Atsuhisa Nakane, Takaaki Nara (UTokyo)
(22)
EA
14:00-14:20 Sound image localization experiments using shoulder-mounted wearable speakers with an inverse filter applied using H-infinity control theory Kenji Kita (Daido Univ.)
  14:20-14:30 Break ( 10 min. )
Sun, Mar 2 PM 
14:30 - 15:30
(23)
EA
14:30-15:30
  15:30-15:40 Break ( 10 min. )
Sun, Mar 2 PM 
15:40 - 16:40
(24)
SIP
15:40-16:40 [Invited Talk]
Time-domain and spatial-domain linear predictive analysis and its application for audio and speech lossless coding standards
Yutaka Kamamoto (NTT)
Mon, Mar 3 AM 
09:45 - 11:05
(25) 09:45-10:05  
(26) 10:05-10:25  
(27)
SP
10:25-10:45 Study on a Japanese Speech Understanding Model Robust to Multi-Item Questioning Yuki Takashima, Atsushi Ando, Taichi Asami (NTT)
(28)
SP
10:45-11:05 Measurement of time delay tolerance for third-person game live audio commentary Ryosuke Matsushita, Ryosuke Sakai, Koki Fukuda (Keio Univ.), Shinnosuke Takamichi (Keio Univ./UTokyo), Kota Iura, Yuki Saito (UTokyo), Graham Neubig (CMU), Katsuhito Sudoh (NWU), Hiroya Takamura, Tatsuya Ishigaki (AIST)
Mon, Mar 3 AM 
09:45 - 11:05
(29)
EA
09:45-11:05 [Poster Presentation]
Machine-type dependent positive and negative division of training data for unsupervised anomalous detection of machinery sounds
Yuuki Tachioka (Denso IT Laboratory)
(30)
EA
09:45-11:05 [Poster Presentation]
Evaluation of Sound Field and Multizone Reproduction Performance in Loudspeaker Arrays with Different Enclosures
Tong Zhou, Kana Itahashi, Akitoshi Kataoka (Ryukoku Univ.)
(31)
EA
09:45-11:05 [Poster Presentation]
Shifted sound-image perception using pre-virtual-leading hypersonic signals with bass frequency envelopes
Ryota Imanaka, Yuting Geng (Ritsumeikan Univ.), Masato Nakayama (Osaka Sangyo Univ), Takanobu Nishiura (Ritsumeikan Univ.)
(32)
EA
09:45-11:05 [Poster Presentation]
Decentralized Independent Vector Analysis Based on Majorization-Minimization Algorithm for Distributed Microphone Arrays
Katsuhiro Morita, Kouei Yamaoka, Norihiro Takamune, Hiroshi Saruwatari (UTokyo)
(33)
EA
09:45-11:05 [Poster Presentation]
Evaluation of noise reduction performance of multichannel feedforward ANC system with optical laser microphone in reverberant environments
Maoto Mizutani, Kenta Iwai, Takanobu Nishiura (Ritsumeikan Univ.), Yoshiharu Soeta (AIST)
(34)
EA
09:45-11:05 [Poster Presentation]
Study on Virtual Sensing ANC Using Tetrahedral Microphone Arrays
Toma Yoshimatsu (UEC), Hiroaki Itou, Shihori Kozuka, Noriyoshi Kamado (NTT), Yoichi Haneda (UEC)
(35)
EA
09:45-11:05 [Poster Presentation]
Improvement of Localization Performance in Binaural Rendering with Panning for Transmission Systems with Delay
Kenta Takeuchi, Masayuki Nishiguchi, Koji Abe, Kanji Watanabe (Akita Prefectural Univ.)
(36)
EA
09:45-11:05 [Poster Presentation]
Creation of representative head-related impulse responses for smooth reproduction of moving audio objects
Kazuki Hoshito, Masayuki Nishiguchi, Kanji Watanabe, Koji Abe (Akita Prefectural Univ.)
(37)
EA
09:45-11:05 [Poster Presentation]
Augmentation of Asynchronous Data for Acoustic Scene Classification Using Asynchronous Distributed Microphone Arrays
Takao Kawamura, Nobutaka Ono (TMU)
(38)
EA
09:45-11:05 [Poster Presentation]
Performance Evaluation of Active Noise Control System without Error Microphone Introducing Primary Path Estimation under Moving Noise Source Position.
Ryo Matsuura, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Yoshinobu Kajikawa (Kansai Univ.)
(39)
EA
09:45-11:05 [Poster Presentation]
Numerical Simulation based Design of Moving Sound Sources Using Impulse Response Combination and Acoustic Effects Integration
Ryuuta Kouma, Sun Chang, Kan Okubo (TMU)
  11:05-11:20 Break ( 15 min. )
Mon, Mar 3 AM 
11:20 - 12:40
(40)
EA
11:20-11:40 Proposal and Analysis of Metric for Evaluating Sampling Frequency Independence Based on Local Equivariance Error Kanami Imamura (UTokyo/AIST), Tomohiko Nakamura (AIST), Norihiro Takamune (UTokyo), Kouhei Yatabe (TUAT), Hiroshi Saruwatari (UTokyo)
(41)
EA
11:40-12:00 Traffic Volume and Speed Estimation Using Pre-trained Audio Model Tomohiro Takahashi (TMU), Natsuki Ueno (TMU/Kumamoto Univ.), Yuma Kinoshita (Tokai Univ.), Yukoh Wakabayashi (TUT), Nobutaka Ono (TMU), Makiho Sukekawa, Seishi Fukuma, Hiroshi Nakagawa (NEE)
(42)
EA
12:00-12:20 A method of estimating the power of residual noise by using the auxiliary filter Kensaku Fujii (Kodaway Lab.), Mitsuji Muneyasu (Kansai Univ.), Yoshifumi Chisaki (CIT)
(43)
EA
12:20-12:40 Memory-efficient and low-computational hierarchical musical instruments classification using element selection Ryu Kato (Tokyo Metropolitan Univ.), Natsuki Ueno (Kumamoto Univ./), Nobutaka Ono (Tokyo Metropolitan Univ.), Ryo Matsuda, Kazunobu Kondo, Yu Takahashi (Yamaha Corp.)
Mon, Mar 3 AM 
11:20 - 12:40
(44)
SIP
11:20-12:40 [Poster Presentation]
Low-Dose DECT Image Reconstruction Using Edge Sparsity and Similarity
Akira Egashira, Daichi Kitahara (Keio Univ.)
(45)
SIP
11:20-12:40 [Poster Presentation]
Validation of the Optimality and Usefulness of Tight Windows Designed via Manifold Optimization
Keito Takahashi, Daichi Kitahara (Keio Univ.)
(46)
SIP
11:20-12:40 [Poster Presentation]
1D Nonnegative Spline Smoothing by Convex Semi-Infinite Programming
Hiroki Arai, Daichi Kitahara (Keio Univ.)
(47)
SIP
11:20-12:40 [Poster Presentation]
MMSE Beamforming with the Consistency of Multiple Covariance Matrices for Phased Array Weather Radar
Shinji Naito, Daichi Kitahara (Keio Univ.)
(48)
SIP
11:20-12:40 [Poster Presentation]
An Extension of Privacy-Preserving FedSGD Federated Learning with Random Binary Weights to FedAvg Federated Learning
Hiroto Sawada, Shoko Imaizumi (Chiba Univ.), Hitoshi Kiya (Tokyo Metropolitan Univ.)
(49)
SIP
11:20-12:40 [Poster Presentation]
Pseudo Artifacts and Data Augmentation for Real-World Video Deblurring Using Deep Learning
Sota Moriyama, Koichi Ichige (YNU)
(50)
SIP
11:20-12:40 [Poster Presentation]
Multichannel Speech Enhancement Method Using Dilated Semi-Dense Convolution Network
Tomohiro Ueyama, Koichi Ichige (Yokohama National Univ.), Takahiro Murakami (Meiji Univ.)
(51)
SIP
11:20-12:40 [Poster Presentation]
Detecting Human-Object Contact Using Human Region Enlargement on Video
Kaito Kira, Sota Moriyama, Koichi Ichige (Yokohama National Univ.)
(52)
SIP
11:20-12:40 [Poster Presentation]
Study on Hybrid Compensation Selective Fixed-Filter Active Noise Control Using One-Dimensional CNN
Hiroki Tsukahara, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.)
(53)
SIP
11:20-12:40 [Poster Presentation]
[Poster Presentation] Improvement of Estimation of Variance for Acoustic Echo and Noise Canceller Based on Variable-Step-Size-Shared-Error NLMS Algorithm
Kenta Iwai (Ritsumeikan Univ.)
(54)
SIP
11:20-12:40 [Poster Presentation]
On System Identification Based on Dynamic Mode Decomposition with Control for Model Predictive Control
Sekiya Futamura (Niigata grad school), Shogo Muramatsu (Niigata Univ)
  12:40-13:40 Break ( 60 min. )
Mon, Mar 3 PM 
13:10 - 13:40
  -  
Mon, Mar 3 PM 
13:40 - 15:04
(55)
SP
13:40-13:47 [No paper] Extension of the head-related transfer function from two front/back directions to any direction in the upper hemisphere Masaki Saito, Ryota Shimokura, Yoji Iiguni (Osaka Univ.)
(56)
SP
13:47-13:54 [No paper] Cross-modal effects using salty and sweet tastes to improve the accuracy of sound image localization for general HRTF Hikaru Yoshida, Kenji Kita (Daido Univ.)
(57)
SP
13:54-14:01 [No paper] Investigation of human perception based CLAPScore Taisei Takano, Yuki Okamoto, Yusuke Kanamori, Yuki Saito (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Hiroshi Saruwatari (UTokyo)
(58)
SP
14:01-14:08 [No paper] Cartilage Conduction-based approach to reduce discomfort from low-frequency noise Ito Hirata, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori univ.), Yoji Iiguni (Osaka Univ.)
(59)
SP
14:08-14:15 [No paper] Construction of subjective evaluation dataset for automatic evaluation of input-output relevance in text-to-audio Yusuke Kanamori, Yuki Okamoto, Taisei Takano (UTokyo), Shinnosuke Takamichi (Keio Univ./UTokyo), Yuki Saito, Hiroshi Saruwatari (UTokyo)
(60)
SP
14:15-14:22 [No paper] Selective Noise Control Using Virtual Sensing and Active Noise Control with Cartilage Conduction Yoshiki Kato, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori Univ.), Yoji Iiguni (Osaka Univ.)
(61)
SP
14:22-14:29 [No paper] Online Processing for Spatial Voice Conversion Using BSS, VC, and Remixing Kenta Takada, Kentaro Seki, Yuki Saito, Kouei Yamaoka, Yuto Ishikawa, Hiroshi Saruwatari (UTokyo)
(62)
SP
14:29-14:36
(63)
SP
14:36-14:43 [No paper] Impression Caption Dataset for Environmental Sounds Yuki Okamoto (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Junichi Yamagishi (NII), Yuki Saito (UTokyo), Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.)
(64)
SP
14:43-14:50 [No paper] Unsupervised EEG Channel Selection Based on Inter-individuals Distance in Covariance Matrix Sota Hayashi, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.)
(65)
SP
14:50-14:57 [No paper] Joint analysis of distance and class of environmental sound from single channel recording Yuki Hoshikawa, Keisuke Imoto, Takao Tsuchiya (Doshisha Univ.)
(66)
SP
14:57-15:04 [No paper] Analysis of time-frequency features in speech decoding from intracranial recordings Shoya Murakami, Shuji Komeiji, Yu Watanabe (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Science Tokyo), Toshihisa Tanaka (TUAT)
  15:04-15:20 Break ( 16 min. )
Mon, Mar 3 PM 
15:20 - 16:20
(67)
SIP
15:20-16:20 [Special Invited Talk]
Spatial Audio Intelligence: From Representation to Understanding and Control of Auditory Environments
Woon-Seng Gan (NTU Singapore)
  16:20-16:30 Break ( 10 min. )
Mon, Mar 3 PM 
16:30 - 16:45
  -  
Tue, Mar 4 AM 
09:30 - 10:50
(68)
SIP
09:30-09:50 CLaSP: Multimodal Foundation Model Using Time Series Signal Data and Natural Language Aoi Ito (Hitachi Ltd./Hosei Univ.), Kota Dohi, Yohei Kawaguchi (Hitachi Ltd.)
(69)
SIP
09:50-10:10 Domain-Independent Automatic Generation of Descriptive Texts for Time-Series Data Kota Dohi (Hitachi), Aoi Ito (Hitachi/Hosei), Harsh Purohit, Tomoya Nishida, Takashi Endo, Yohei Kawaguchi (Hitachi)
(70)
SIP
10:10-10:30 Riverbed Estimation using Locally-Structured Unitary Network with Multiresolution Representation Seiyu Hitomi, Godage Yasas, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.)
(71)
SIP
10:30-10:50 Online Short-term Prediction of Riverbed Evolution Using Extended Dynamic Mode Decomposition Reiya Asuke, Masahiro Yukawa (Keio Univ.), Shogo Muramatsu, Daichi Moteki, Hiroyasu Yasuda (Niigata Univ.)
Tue, Mar 4 AM 
09:30 - 10:50
(72)
SP
09:30-10:50 [Poster Presentation]
Improving Conv-TasNet for Multi-Channel Speech Enhancement and Examination of Microphone Placement
Taisuke Morikawa, Akitoshi Kataoka (Grad. Sch., Ryukoku Univ.)
(73)
SP
09:30-10:50 [Poster Presentation]
An Analysis of Speaker Representation for Target-Speaker Speech Processing
Takanori Ashihara, Takafumi Moriya, Shota Horiguchi (NTT), Junyi Peng (BUT), Tsubasa Ochiai, Marc Delcroix, Kohei Matsuura, Hiroshi Sato (NTT)
(74)
SP
09:30-10:50 [Poster Presentation]
Speech spoofing detection using deep learning model with multiple acoustic features
Haruto Namba, Sayaka Shiota (TMU)
(75)
SP
09:30-10:50 [Poster Presentation]
Necessity of Voice Sample Selection in Qualification Tests for Crowdsourced Subjective Audio Quality Evaluation
Takuma Yabe, Moe Yaegashi, Teppei Nakano, Tetsuji Ogawa (Waseda Univ.)
(76)
SP
09:30-10:50 [Poster Presentation]
JIS: Japanese Speech Corpus of Idol Speakers with Various Speaking Styles
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT)
(77)
SP
09:30-10:50
(78) 09:30-10:50  
(79) 09:30-10:50  
(80) 09:30-10:50  
  10:50-11:05 Break ( 15 min. )
Tue, Mar 4 PM 
11:05 - 12:25
(81)
SIP
11:05-11:25 Performance Evaluation of Data-driven Water Level Distribution Prediction for Integrated River Control Hiromu Kanauchi, Ryuto Ito, Hiroyasu Yasuda (Niigata Univ.), Masaaki Nagahara (Hiroshima Univ.), Shogo Muramatsu (Niigata Univ.)
(82)
SIP
11:25-11:45 Estimation of Riverbed Undulation using DMDc for Active River Channel Control with Groynes and Its Evaluation Chen Zhang, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.)
(83)
SIP
11:45-12:05 Clustering for time-varying graphs with varying number of nodes Tomoya Akabayashi (Osaka Univ.), Hayate Kojima (TUAT), Junya Hara, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.)
(84)
SIP
12:05-12:25 Generalized Graph Signal Sampling with Pre-selection of Critical Vertices Keitaro Yamashita, Kazuki Naganuma, Shunsuke Ono (Science Tokyo)
Tue, Mar 4 AM 
11:05 - 12:25
(85)
SP
11:05-12:25 [Poster Presentation]
Construction of a ASR model based on self-supervised learning using intermediate layer outputs
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT)
(86)
SP
11:05-12:25 [Poster Presentation]
Improvement and Evaluation of Utterance End Time Estimation Method for Spoken Dialog Systems
Takanori Kanai, Yukoh Wakabayashi (TUT), Ryota Nishimura (Tokushima Univ.), Norihide Kitaoka (TUT)
(87)
SP
11:05-12:25 [Poster Presentation]
Improvement of the GESI for Predicting Speech Intelligibility in Older Adults
Ayako Yamamoto, Fuki Miyazaki, Toshio Irino (Wakayama Univ.)
(88)
SP
11:05-12:25 [Poster Presentation]
Sammo: Incorporating MAMBA-2 into Modern Streaming Encoders for Japanese ASR
Wen Shen Teo, Yasuhiro Minami (UEC)
(89)
SP
11:05-12:25 [Poster Presentation]
Improvement of Speech Recognition Performance for Elderly Speech by Alternating Learning of Acoustic and Linguistic information
Kaito Takahashi, Yukoh Wakabayashi (TUT), Kengo Ohta (NIT, Anan College), Norihide Kitaoka (TUT)
(90)
EA
11:05-12:25 [Poster Presentation]
Source Separation Based on Regularization Using Back-Projected Demixing Vectors
Kukuru Koiso, Taishi Nakashima, Nobutaka Ono (TMU)
(91)
EA
11:05-12:25 [Poster Presentation]
Real-Time Blind Source Separation for Head-Mounted Microphone Array Using Own Voice Selection Based on Relative Transfer Function
Kyoka Kazama, Taishi Nakashima, Nobutaka Ono (TMU)
(92)
EA
11:05-12:25 [Poster Presentation]
Source-specific forgetting factor in multiplicative update online AuxIVA.
Kaito Masuko, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.)
(93)
EA
11:05-12:25 [Poster Presentation]
Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation Using Independent Deeply Learned Matrix Analysis for Real-Time Multichannel Speech Extraction in Diffuse Noise Environment
Yuki Nakanishi, Yuto Ishikawa, Norihiro Takamune, Hiroshi Saruwatari (The Univ. of Tokyo)
(94)
EA
11:05-12:25 [Poster Presentation]
Two-Stage Processing of Blind Source Separation and DNN-based Speech Enhancement for In-Car Speech Recognition
Yutsuki Takeuchi, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Yoshinori Tsuchiya (MIRISE Technologies)
  12:25-13:25 Break ( 60 min. )
Tue, Mar 4 PM 
13:25 - 14:25
(95) 13:25-13:45  
(96) 13:45-14:05  
(97) 14:05-14:25  
Tue, Mar 4 PM 
13:25 - 14:45
(98)
SIP
13:25-14:45 [Poster Presentation]
Speech Synthesis from Electrocorticogram During Imagined Speech Using a Transformer-Based Decoder
Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendou Univ.), Koichi Shinoda (Science Tokyo), Kohei Yatabe, Toshihisa Tanaka (TUAT)
(99)
SIP
13:25-14:45 [Poster Presentation]
Control of 3D Physical Model of Movable Artificial Variable Width Channel with Reinforcement Learning
-- For River Digital Twin --
Ryusei Aoki, Sisaykeo Phonepaserth, Shogo Muramatsu (Niigata Univ.)
(100)
SIP
13:25-14:45 [Poster Presentation]
Fundamental considerations for dynamics modeling with Locally Structured Unitary Network
Motoyasu Suzuki, Yasas Godage, Shogo Muramatsu (Nigata Univ.)
(101)
SIP
13:25-14:45 [Poster Presentation]
A Study of Constraints on Directivity Design Method for Improving Suppression Performance
Miryu Goino, Kenji Suyama (Tokyo Denki Univ.)
(102)
SIP
13:25-14:45 [Poster Presentation]
A dynamic data augmentation method using diffusion models for classification of intensive care EEG
Takuma Bingo, Hajime Yano, Taichiro Ashizaki, Kazuma Koda, Masaya Togo (Kobe Univ.), Riki Matsumoto (Kobe Univ./Kyoto Univ.), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.)
(103)
SIP
13:25-14:45 [Poster Presentation]
Individual differences in interoception affects brain activity during music recall
Kazuki Matsunaga, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT)
(104)
SIP
13:25-14:45 [Poster Presentation]
Nonnegative Sparse Optimization Using Relu Activation Function and Its Application to Deep Unfolding
Haruki Esaki, Towa Yasui, Seisuke Kyochi (Kogakuin Univ.)
(105)
SIP
13:25-14:45 [Poster Presentation]
Sparse Modeling for Electroencephalogram-based Sustained Attention Assessment
Masaya Togashi, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT)
(106)
EA
13:25-14:45 [Poster Presentation]
Large-Scale Numerical Simulation of Tsunami-Induced Infrasound Using Spherical Coordinates
Masami Tokuda, Yoshiki Saito, Kan Okubo (TMU)
(107)
EA
13:25-14:45 [Poster Presentation]
Study on Acoustic Analysis of Micro Speakers Considering Electromagnetic-Structure-Acoustic Coupling
Kakeru Yamaguchi, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.)
(108)
EA
13:25-14:45 [Poster Presentation]
Acoustic Vibration Analysis of Distributed Mode Loudspeaker (DML) Using Pattern Structures
Yuito Kimura, Kan Okubo (Tokyo Metropolitan Univ.)
  14:45-15:00 Break ( 15 min. )
Tue, Mar 4 PM 
15:00 - 16:00
(109) 15:00-16:00  

Announcement for Speakers
General TalkEach speech will have 15 minutes for presentation and 5 minutes for discussion.
Poster PresentationEach speech will have 80 minutes for presentation and 0 minutes for discussion.

Contact Address and Latest Schedule Information
SP Technical Committee on Speech (SP)   [Latest Schedule]
Contact Address  
EA Technical Committee on Engineering Acoustics (EA)   [Latest Schedule]
Contact Address Yoshiaki Bando (AIST)
E--mail: ybanaist 
SIP Technical Committee on Signal Processing (SIP)   [Latest Schedule]
Contact Address IEICE Technical Group on Signal Processing
Email: sip-n 
IPSJ-SLP Special Interest Group on Spoken Language Processing (IPSJ-SLP)   [Latest Schedule]
Contact Address  


Last modified: 2025-02-04 16:49:34


Notification: Mail addresses are partially hidden against SPAM.

[Download Paper's Information (in Japanese)] <-- Press download button after click here.
 
[Cover and Index of IEICE Technical Report by Issue]
 

[Presentation and Participation FAQ] (in Japanese)
 

[Return to EA Schedule Page]   /   [Return to SIP Schedule Page]   /   [Return to SP Schedule Page]   /   [Return to IPSJ-SLP Schedule Page]   /  
 
 Go Top  Go Back   Prev SP Conf / [HTML] / [HTML(simple)] / [TEXT]  [Japanese] / [English] 


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan