IEICE Technical Committee Submission System
Advance Program
Online Proceedings
[Sign in]
Tech. Rep. Archives
 Go Top  Go Back   Prev SP Conf / [HTML] / [HTML(simple)] / [TEXT]  [Japanese] / [English] 


Technical Committee on Speech (SP) [schedule] [select]
Chair Takahiro Shinozaki (Tokyo Inst. of Tech)
Secretary Atsushi Ando (NTT), Kei Hashimoto (Nagoya Inst. of Tech.)
Assistant Motoi Oomachi (Line Yahoo), Yuuki Saito (Univ. of Tokyo)

Technical Committee on Engineering Acoustics (EA) [schedule] [select]
Chair Nobutaka Ono (Tokyo Metropolitan Univ.)
Vice Chair Takanobu Nishiura (RitsumeikanUniv.), Keigo Wakayama (NTT)
Secretary Yoshiaki Bando (AIST), Nobutaka Ito (Univ. of Tokyo)
Assistant Daichi Kitamura (NIT,Kagawa), Yuma Kinoshita (Tokai Univ.)

Technical Committee on Signal Processing (SIP) [schedule] [select]
Chair Koichi Ichige (Yokohama National Univ.)
Vice Chair Akira Tanaka (Hokkaido Univ.), Kiyoshi Nishikawa (okyo Metropolitan Univ.)
Secretary Shoko Imaizumi (Chiba Univ.), Taizo Suzuki (Univ. of Tsukubaba)
Assistant Masanari Nakamura (Hokkaido Univ.), Sayaka Shiota (Tokyo Metropolitan Univ.)

Special Interest Group on Spoken Language Processing (IPSJ-SLP) [schedule] [select]
Chair Takahiro Shinozaki (Tokyo Inst. of Tech)
Secretary Atsushi Ando (NTT), Kei Hashimoto (Nagoya Inst. of Tech.), Motoi Oomachi (Line Yahoo), Yuuki Saito (Univ. of Tokyo)

Conference Date Sun, Mar 2, 2025 09:30 - 16:40
Mon, Mar 3, 2025 09:45 - 16:45
Tue, Mar 4, 2025 09:30 - 16:10
Topics  
Conference Place  
Sponsors This conference is co-sponsored by Technical Committee on Electroacoustics of ASJ and IEEE Signal Processing Society Tokyo Joint Chapter. This conference is technical co-sponsored by IEEE SPS Tokyo Joint Chapter, IEEE Signal Processing Society Tokyo Joint Chapter and APSIPA Japan Chapter.
Copyright
and
reproduction
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
Registration Fee This workshop will be held as the IEICE workshop in fully electronic publishing. Registration fee will be necessary except the speakers and participants other than the participants to workshop(s) in non-electronic publishing. See the registration fee page. We request the registration fee or presentation fee to participants who will attend the workshop(s) on EA, SIP, SP.

Sun, Mar 2 AM 
09:30 - 10:50
(1)
SP
09:30-09:50 Uncertainty-Based Streaming ASR with Evidential Deep Learning EA2024-77 SIP2024-112 SP2024-18 Hiroaki Sato, Asahi Sakuma, Ryuga Sugano, Tadashi Kumano, Yoshihiko Kawai (NHK STRL), Ogawa Tetsuji (Waseda Univ.)
(2) 09:50-10:10  
(3) 10:10-10:30  
(4) 10:30-10:50  
Sun, Mar 2 AM 
09:30 - 10:50
(5)
EA
09:30-09:50 Sound field estimation method robust to microphone position error EA2024-78 SIP2024-113 SP2024-19 Takumi Koga, Ueno Natsuki (Kumamoto Univ.)
(6)
EA
09:50-10:10 Acoustic Wave Propagation Simulation based on Wave Equation-based Neural Networks EA2024-79 SIP2024-114 SP2024-20 Shota Okubo, Toshiharu Horiuchi (KDDI Research, Inc.)
(7)
EA
10:10-10:30 Sound field reconstruction with sparse channel acoustic signals based on simultaneous learning of graph and signal interpolation EA2024-80 SIP2024-115 SP2024-21 Shihori Kozuka, Takayuki Sasaki (NTT), Yukihiro Bando (Shimonoseki City Univ.), Hiroaki Itou, Kazuya Hayase, Noriyoshi Kamado, Masaki Kitahara (NTT)
(8)
EA
10:30-10:50 Implementation of Sound Field Synthesis Renderer for Volumetric Audio EA2024-81 SIP2024-116 SP2024-22 Yo Sasaki, Yasushige Nakayama (NHK)
  10:50-11:00 Break ( 10 min. )
Sun, Mar 2 AM 
11:00 - 12:00
(9)
SP
11:00-11:20 An Experimental Study on Text-independent Speaker Verification for Forensic Applications EA2024-82 SIP2024-117 SP2024-23 Shigeki Ozawa (YCU), Akira Gotoh, Yuko Saito, Hiroki Matsuura (NEC), Takafumi Koshinaka (YCU)
(10)
SP
11:20-11:40 Speaker Verification Based on Deformable Convolutional Networks EA2024-83 SIP2024-118 SP2024-24 Keiya Sato, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech)
(11)
SP
11:40-12:00 Speech-Activity-Guided Speaker Embedding Extraction EA2024-84 SIP2024-119 SP2024-25 Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix (NTT)
Sun, Mar 2 AM 
11:00 - 12:00
(12)
SIP
11:00-11:20 Joint Diagonalization Based on Equivalence Classes of Orthogonal Matrices by Signed Permutations and Weighted Averaging in the Cayley Transform Domain EA2024-85 SIP2024-120 SP2024-26 Akira Tanaka, Takafumi Edo (Hokkaido Univ.)
(13)
SIP
11:20-11:40 Algebraic representation of dynamical systems in time-frequency domain: An extension to integro-differential equations EA2024-86 SIP2024-121 SP2024-27 Shigeru Ando (Univ. Tokyo)
(14)
SIP
11:40-12:00 Toward nonlinear system identification EA2024-87 SIP2024-122 SP2024-28 Fumihiko Ishiyama (NTT)
  12:00-13:00 Break ( 60 min. )
Sun, Mar 2 PM 
13:00 - 14:20
(15)
SP
13:00-13:20 Zero-Shot Speech Synthesis Directly Referring Target Speech Through Attention Mechanisms EA2024-88 SIP2024-123 SP2024-29 Kyohei Nakatsuka, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.)
(16) 13:20-13:40  
(17) 13:40-14:00  
(18) 14:00-14:20  
Sun, Mar 2 PM 
13:00 - 14:20
(19)
EA
13:00-13:20 Affective Impression Structural Models and Individual Differences in Chord Listening EA2024-89 SIP2024-124 SP2024-30 Sakura Sakamoto (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Kenji Katahira (Waseda Univ.), Takashi Fujisawa (Univ. of Fukui), Noriko Nagata (Kwansei Gakuin Univ.)
(20)
EA
13:20-13:40 A Value Structure Model and Individual Differences for the Designing of Pleasant Motor Drive Sounds EA2024-90 SIP2024-125 SP2024-31 Jun Urayama, Noriko Nagata (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Yuto kobayashi, Yasunori Sugita (Nagaoka Univ. of Technology), Takashi Hoduki, Akira Satake, Hiroyasu Iwabuki (MELCO)
(21)
EA
13:40-14:00 Localization of Victims Using Equivalent Rotating Sound Sources EA2024-91 SIP2024-126 SP2024-32 Atsuhisa Nakane, Takaaki Nara (UTokyo)
(22)
EA
14:00-14:20 Sound image localization experiments using shoulder-mounted wearable speakers with an inverse filter applied using H-infinity control theory EA2024-92 SIP2024-127 SP2024-33 Kenji Kita (Daido Univ.)
  14:20-14:30 Break ( 10 min. )
Sun, Mar 2 PM 
14:30 - 15:30
(23)
EA
14:30-15:30 EA2024-93 SIP2024-128 SP2024-34
  15:30-15:40 Break ( 10 min. )
Sun, Mar 2 PM 
15:40 - 16:40
(24)
SIP
15:40-16:40 [Invited Talk]
Time-domain and spatial-domain linear predictive analysis and its application for audio and speech lossless coding standards EA2024-94 SIP2024-129 SP2024-35
Yutaka Kamamoto (NTT)
Mon, Mar 3 AM 
09:45 - 11:05
(25) 09:45-10:05  
(26) 10:05-10:25  
(27)
SP
10:25-10:45 Study on a Japanese Speech Understanding Model Robust to Multi-Item Questioning EA2024-95 SIP2024-130 SP2024-36 Yuki Takashima, Atsushi Ando, Taichi Asami (NTT)
(28)
SP
10:45-11:05 Measurement of time delay tolerance for third-person game live audio commentary EA2024-96 SIP2024-131 SP2024-37 Ryosuke Matsushita, Ryosuke Sakai, Koki Fukuda (Keio Univ.), Shinnosuke Takamichi (Keio Univ./UTokyo), Kota Iura, Yuki Saito (UTokyo), Graham Neubig (CMU), Katsuhito Sudoh (NWU), Hiroya Takamura, Tatsuya Ishigaki (AIST)
Mon, Mar 3 AM 
09:45 - 11:05
(29)
EA
09:45-11:05 [Poster Presentation]
Machine-type dependent positive and negative division of training data for unsupervised anomalous detection of machinery sounds EA2024-97 SIP2024-132 SP2024-38
Yuuki Tachioka (Denso IT Laboratory)
(30)
EA
09:45-11:05 [Poster Presentation]
Evaluation of Sound Field and Multizone Reproduction Performance in Loudspeaker Arrays with Different Enclosures EA2024-98 SIP2024-133 SP2024-39
Tong Zhou, Kana Itahashi, Akitoshi Kataoka (Ryukoku Univ.)
(31)
EA
09:45-11:05 [Poster Presentation]
Shifted sound-image perception using pre-virtual-leading hypersonic signals with bass frequency envelopes EA2024-99 SIP2024-134 SP2024-40
Ryota Imanaka, Yuting Geng (Ritsumeikan Univ.), Masato Nakayama (Osaka Sangyo Univ), Takanobu Nishiura (Ritsumeikan Univ.)
(32)
EA
09:45-11:05 [Poster Presentation]
Decentralized Independent Vector Analysis Based on Majorization-Minimization Algorithm for Distributed Microphone Arrays EA2024-100 SIP2024-135 SP2024-41
Katsuhiro Morita, Kouei Yamaoka, Norihiro Takamune, Hiroshi Saruwatari (UTokyo)
(33)
EA
09:45-11:05 [Poster Presentation]
Evaluation of noise reduction performance of multichannel feedforward ANC system with optical laser microphone in reverberant environments EA2024-101 SIP2024-136 SP2024-42
Maoto Mizutani, Kenta Iwai, Takanobu Nishiura (Ritsumeikan Univ.), Yoshiharu Soeta (AIST)
(34)
EA
09:45-11:05 [Poster Presentation]
Study on Virtual Sensing ANC Using Tetrahedral Microphone Arrays EA2024-102 SIP2024-137 SP2024-43
Toma Yoshimatsu (UEC), Hiroaki Itou, Shihori Kozuka, Noriyoshi Kamado (NTT), Yoichi Haneda (UEC)
(35)
EA
09:45-11:05 [Poster Presentation]
Improvement of Localization Performance in Binaural Rendering with Panning for Transmission Systems with Delay EA2024-103 SIP2024-138 SP2024-44
Kenta Takeuchi, Masayuki Nishiguchi, Koji Abe, Kanji Watanabe (Akita Prefectural Univ.)
(36)
EA
09:45-11:05 [Poster Presentation]
Creation of representative head-related impulse responses for smooth reproduction of moving audio objects EA2024-104 SIP2024-139 SP2024-45
Kazuki Hoshito, Masayuki Nishiguchi, Kanji Watanabe, Koji Abe (Akita Prefectural Univ.)
(37)
EA
09:45-11:05 [Poster Presentation]
Augmentation of Asynchronous Data for Acoustic Scene Classification Using Asynchronous Distributed Microphone Arrays EA2024-105 SIP2024-140 SP2024-46
Takao Kawamura, Nobutaka Ono (TMU)
(38)
EA
09:45-11:05 [Poster Presentation]
Performance Evaluation of Active Noise Control System without Error Microphone Introducing Primary Path Estimation under Moving Noise Source Position. EA2024-106 SIP2024-141 SP2024-47
Ryo Matsuura, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Yoshinobu Kajikawa (Kansai Univ.)
(39)
EA
09:45-11:05 [Poster Presentation]
Numerical Simulation based Design of Moving Sound Sources Using Impulse Response Combination and Acoustic Effects Integration EA2024-107 SIP2024-142 SP2024-48
Ryuuta Kouma, Sun Chang, Kan Okubo (TMU)
  11:05-11:20 Break ( 15 min. )
Mon, Mar 3 AM 
11:20 - 12:40
(40)
EA
11:20-11:40 Proposal and Analysis of Metric for Evaluating Sampling Frequency Independence Based on Local Equivariance Error EA2024-108 SIP2024-143 SP2024-49 Kanami Imamura (UTokyo/AIST), Tomohiko Nakamura (AIST), Norihiro Takamune (UTokyo), Kouhei Yatabe (TUAT), Hiroshi Saruwatari (UTokyo)
(41)
EA
11:40-12:00 Traffic Volume and Speed Estimation Using Pre-trained Audio Model EA2024-109 SIP2024-144 SP2024-50 Tomohiro Takahashi (TMU), Natsuki Ueno (TMU/Kumamoto Univ.), Yuma Kinoshita (Tokai Univ.), Yukoh Wakabayashi (TUT), Nobutaka Ono (TMU), Makiho Sukekawa, Seishi Fukuma, Hiroshi Nakagawa (NEE)
(42)
EA
12:00-12:20 A method of estimating the power of residual noise by using the auxiliary filter EA2024-110 SIP2024-145 SP2024-51 Kensaku Fujii (Kodaway Lab.), Mitsuji Muneyasu (Kansai Univ.), Yoshifumi Chisaki (CIT)
(43)
EA
12:20-12:40 Memory-efficient and low-computational hierarchical musical instruments classification using element selection EA2024-111 SIP2024-146 SP2024-52 Ryu Kato (Tokyo Metropolitan Univ.), Natsuki Ueno (Kumamoto Univ./), Nobutaka Ono (Tokyo Metropolitan Univ.), Ryo Matsuda, Kazunobu Kondo, Yu Takahashi (Yamaha Corp.)
Mon, Mar 3 AM 
11:20 - 12:40
(44)
SIP
11:20-12:40 [Poster Presentation]
Low-Dose DECT Image Reconstruction Using Edge Sparsity and Similarity EA2024-112 SIP2024-147 SP2024-53
Akira Egashira, Daichi Kitahara (Keio Univ.)
(45)
SIP
11:20-12:40 [Poster Presentation]
Validation of the Optimality and Usefulness of Tight Windows Designed via Manifold Optimization EA2024-113 SIP2024-148 SP2024-54
Keito Takahashi, Daichi Kitahara (Keio Univ.)
(46)
SIP
11:20-12:40 [Poster Presentation]
1D Nonnegative Spline Smoothing by Convex Semi-Infinite Programming EA2024-114 SIP2024-149 SP2024-55
Hiroki Arai, Daichi Kitahara (Keio Univ.)
(47)
SIP
11:20-12:40 [Poster Presentation]
MMSE Beamforming with the Consistency of Multiple Covariance Matrices for Phased Array Weather Radar EA2024-115 SIP2024-150 SP2024-56
Shinji Naito, Daichi Kitahara (Keio Univ.)
(48)
SIP
11:20-12:40 [Poster Presentation]
An Extension of Privacy-Preserving FedSGD Federated Learning with Random Binary Weights to FedAvg Federated Learning EA2024-116 SIP2024-151 SP2024-57
Hiroto Sawada, Shoko Imaizumi (Chiba Univ.), Hitoshi Kiya (Tokyo Metropolitan Univ.)
(49)
SIP
11:20-12:40 [Poster Presentation]
Pseudo Artifacts and Data Augmentation for Real-World Video Deblurring Using Deep Learning EA2024-117 SIP2024-152 SP2024-58
Sota Moriyama, Koichi Ichige (YNU)
(50)
SIP
11:20-12:40 [Poster Presentation]
Multichannel Speech Enhancement Method Using Dilated Semi-Dense Convolution Network EA2024-118 SIP2024-153 SP2024-59
Tomohiro Ueyama, Koichi Ichige (Yokohama National Univ.), Takahiro Murakami (Meiji Univ.)
(51)
SIP
11:20-12:40 [Poster Presentation]
Detecting Human-Object Contact Using Human Region Enlargement on Video EA2024-119 SIP2024-154 SP2024-60
Kaito Kira, Sota Moriyama, Koichi Ichige (Yokohama National Univ.)
(52)
SIP
11:20-12:40 [Poster Presentation]
Study on Hybrid Compensation Selective Fixed-Filter Active Noise Control Using One-Dimensional CNN EA2024-120 SIP2024-155 SP2024-61
Hiroki Tsukahara, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.)
(53)
SIP
11:20-12:40 [Poster Presentation]
[Poster Presentation] Improvement of Estimation of Variance for Acoustic Echo and Noise Canceller Based on Variable-Step-Size-Shared-Error NLMS Algorithm EA2024-121 SIP2024-156 SP2024-62
Kenta Iwai (Ritsumeikan Univ.)
(54)
SIP
11:20-12:40 [Poster Presentation]
On System Identification Based on Dynamic Mode Decomposition with Control for Model Predictive Control EA2024-122 SIP2024-157 SP2024-63
Sekiya Futamura (Niigata grad school), Shogo Muramatsu (Niigata Univ)
  12:40-13:40 Break ( 60 min. )
Mon, Mar 3 PM 
13:10 - 13:40
  -  
Mon, Mar 3 PM 
13:40 - 15:04
(55)
SP
13:40-13:47 [No paper] Extension of the head-related transfer function from two front/back directions to any direction in the upper hemisphere Masaki Saito, Ryota Shimokura, Yoji Iiguni (Osaka Univ.)
(56)
SP
13:47-13:54 [No paper] Cross-modal effects using salty and sweet tastes to improve the accuracy of sound image localization for general HRTF Hikaru Yoshida, Kenji Kita (Daido Univ.)
(57)
SP
13:54-14:01 [No paper] Investigation of human perception based CLAPScore Taisei Takano, Yuki Okamoto, Yusuke Kanamori, Yuki Saito (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Hiroshi Saruwatari (UTokyo)
(58)
SP
14:01-14:08 [No paper] Cartilage Conduction-based approach to reduce discomfort from low-frequency noise Ito Hirata, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori univ.), Yoji Iiguni (Osaka Univ.)
(59)
SP
14:08-14:15 [No paper] Construction of subjective evaluation dataset for automatic evaluation of input-output relevance in text-to-audio Yusuke Kanamori, Yuki Okamoto, Taisei Takano (UTokyo), Shinnosuke Takamichi (Keio Univ./UTokyo), Yuki Saito, Hiroshi Saruwatari (UTokyo)
(60)
SP
14:15-14:22 [No paper] Selective Noise Control Using Virtual Sensing and Active Noise Control with Cartilage Conduction Yoshiki Kato, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori Univ.), Yoji Iiguni (Osaka Univ.)
(61)
SP
14:22-14:29 [No paper] Online Processing for Spatial Voice Conversion Using BSS, VC, and Remixing Kenta Takada, Kentaro Seki, Yuki Saito, Kouei Yamaoka, Yuto Ishikawa, Hiroshi Saruwatari (UTokyo)
(62)
SP
14:29-14:36
(63)
SP
14:36-14:43 [No paper] Impression Caption Dataset for Environmental Sounds Yuki Okamoto (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Junichi Yamagishi (NII), Yuki Saito (UTokyo), Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.)
(64)
SP
14:43-14:50 [No paper] Unsupervised EEG Channel Selection Based on Inter-individuals Distance in Covariance Matrix Sota Hayashi, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.)
(65)
SP
14:50-14:57 [No paper] Joint analysis of distance and class of environmental sound from single channel recording Yuki Hoshikawa, Keisuke Imoto, Takao Tsuchiya (Doshisha Univ.)
(66)
SP
14:57-15:04 [No paper] Analysis of time-frequency features in speech decoding from intracranial recordings Shoya Murakami, Shuji Komeiji, Yu Watanabe (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Science Tokyo), Toshihisa Tanaka (TUAT)
  15:04-15:20 Break ( 16 min. )
Mon, Mar 3 PM 
15:20 - 16:20
(67)
SIP
15:20-16:20 [Special Invited Talk]
Spatial Audio Intelligence: From Representation to Understanding and Control of Auditory Environments EA2024-123 SIP2024-158 SP2024-64
Woon-Seng Gan (NTU Singapore)
  16:20-16:30 Break ( 10 min. )
Mon, Mar 3 PM 
16:30 - 16:45
  -  
Tue, Mar 4 AM 
09:30 - 10:50
(68)
SIP
09:30-09:50 CLaSP: Multimodal Foundation Model Using Time Series Signal Data and Natural Language EA2024-124 SIP2024-159 SP2024-65 Aoi Ito (Hitachi Ltd./Hosei Univ.), Kota Dohi, Yohei Kawaguchi (Hitachi Ltd.)
(69)
SIP
09:50-10:10 Domain-Independent Automatic Generation of Descriptive Texts for Time-Series Data EA2024-125 SIP2024-160 SP2024-66 Kota Dohi (Hitachi), Aoi Ito (Hitachi/Hosei), Harsh Purohit, Tomoya Nishida, Takashi Endo, Yohei Kawaguchi (Hitachi)
(70)
SIP
10:10-10:30 Riverbed Estimation using Locally-Structured Unitary Network with Multiresolution Representation EA2024-126 SIP2024-161 SP2024-67 Seiyu Hitomi, Godage Yasas, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.)
(71)
SIP
10:30-10:50 Online Short-term Prediction of Riverbed Evolution Using Extended Dynamic Mode Decomposition EA2024-127 SIP2024-162 SP2024-68 Reiya Asuke, Masahiro Yukawa (Keio Univ.), Shogo Muramatsu, Daichi Moteki, Hiroyasu Yasuda (Niigata Univ.)
Tue, Mar 4 AM 
09:30 - 10:50
(72)
SP
09:30-10:50 [Poster Presentation]
Improving Conv-TasNet for Multi-Channel Speech Enhancement and Examination of Microphone Placement EA2024-128 SIP2024-163 SP2024-69
Taisuke Morikawa, Akitoshi Kataoka (Grad. Sch., Ryukoku Univ.)
(73)
SP
09:30-10:50 [Poster Presentation]
An Analysis of Speaker Representation for Target-Speaker Speech Processing EA2024-129 SIP2024-164 SP2024-70
Takanori Ashihara, Takafumi Moriya, Shota Horiguchi (NTT), Junyi Peng (BUT), Tsubasa Ochiai, Marc Delcroix, Kohei Matsuura, Hiroshi Sato (NTT)
(74)
SP
09:30-10:50 [Poster Presentation]
Speech spoofing detection using deep learning model with multiple acoustic features EA2024-130 SIP2024-165 SP2024-71
Haruto Namba, Sayaka Shiota (TMU)
(75)
SP
09:30-10:50 [Poster Presentation]
Necessity of Voice Sample Selection in Qualification Tests for Crowdsourced Subjective Audio Quality Evaluation EA2024-131 SIP2024-166 SP2024-72
Takuma Yabe, Moe Yaegashi, Teppei Nakano, Tetsuji Ogawa (Waseda Univ.)
(76)
SP
09:30-10:50 [Poster Presentation]
JIS: Japanese Speech Corpus of Idol Speakers with Various Speaking Styles EA2024-132 SIP2024-167 SP2024-73
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT)
(77)
SP
09:30-10:50 EA2024-133 SIP2024-168 SP2024-74
(78) 09:30-10:50  
(79) 09:30-10:50  
(80) 09:30-10:50  
  10:50-11:05 Break ( 15 min. )
Tue, Mar 4 PM 
11:05 - 12:25
(81)
SIP
11:05-11:25 Performance Evaluation of Data-driven Water Level Distribution Prediction for Integrated River Control EA2024-134 SIP2024-169 SP2024-75 Hiromu Kanauchi, Ryuto Ito, Hiroyasu Yasuda (Niigata Univ.), Masaaki Nagahara (Hiroshima Univ.), Shogo Muramatsu (Niigata Univ.)
(82)
SIP
11:25-11:45 Estimation of Riverbed Undulation using DMDc for Active River Channel Control with Groynes and Its Evaluation EA2024-135 SIP2024-170 SP2024-76 Chen Zhang, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.)
(83)
SIP
11:45-12:05 Clustering for time-varying graphs with varying number of nodes EA2024-136 SIP2024-171 SP2024-77 Tomoya Akabayashi (Osaka Univ.), Hayate Kojima (TUAT), Junya Hara, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.)
(84)
SIP
12:05-12:25 Generalized Graph Signal Sampling with Pre-selection of Critical Vertices EA2024-137 SIP2024-172 SP2024-78 Keitaro Yamashita, Kazuki Naganuma, Shunsuke Ono (Science Tokyo)
Tue, Mar 4 AM 
11:05 - 12:25
(85)
SP
11:05-12:25 [Poster Presentation]
Construction of a ASR model based on self-supervised learning using intermediate layer outputs EA2024-138 SIP2024-173 SP2024-79
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT)
(86)
SP
11:05-12:25 [Poster Presentation]
Improvement and Evaluation of Utterance End Time Estimation Method for Spoken Dialog Systems EA2024-139 SIP2024-174 SP2024-80
Takanori Kanai, Yukoh Wakabayashi (TUT), Ryota Nishimura (Tokushima Univ.), Norihide Kitaoka (TUT)
(87)
SP
11:05-12:25 [Poster Presentation]
Improvement of the GESI for Predicting Speech Intelligibility in Older Adults EA2024-140 SIP2024-175 SP2024-81
Ayako Yamamoto, Fuki Miyazaki, Toshio Irino (Wakayama Univ.)
(88)
SP
11:05-12:25 [Poster Presentation]
Sammo: Incorporating MAMBA-2 into Modern Streaming Encoders for Japanese ASR EA2024-141 SIP2024-176 SP2024-82
Wen Shen Teo, Yasuhiro Minami (UEC)
(89)
SP
11:05-12:25 [Poster Presentation]
Improvement of Speech Recognition Performance for Elderly Speech by Alternating Learning of Acoustic and Linguistic information EA2024-142 SIP2024-177 SP2024-83
Kaito Takahashi, Yukoh Wakabayashi (TUT), Kengo Ohta (NIT, Anan College), Norihide Kitaoka (TUT)
(90)
EA
11:05-12:25 [Poster Presentation]
Source Separation Based on Regularization Using Back-Projected Demixing Vectors EA2024-143 SIP2024-178 SP2024-84
Kukuru Koiso, Taishi Nakashima, Nobutaka Ono (TMU)
(91)
EA
11:05-12:25 [Poster Presentation]
Real-Time Blind Source Separation for Head-Mounted Microphone Array Using Own Voice Selection Based on Relative Transfer Function EA2024-144 SIP2024-179 SP2024-85
Kyoka Kazama, Taishi Nakashima, Nobutaka Ono (TMU)
(92)
EA
11:05-12:25 [Poster Presentation]
Source-specific forgetting factor in multiplicative update online AuxIVA. EA2024-145 SIP2024-180 SP2024-86
Kaito Masuko, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.)
(93)
EA
11:05-12:25 [Poster Presentation]
Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation Using Independent Deeply Learned Matrix Analysis for Real-Time Multichannel Speech Extraction in Diffuse Noise Environment EA2024-146 SIP2024-181 SP2024-87
Yuki Nakanishi, Yuto Ishikawa, Norihiro Takamune, Hiroshi Saruwatari (The Univ. of Tokyo)
(94)
EA
11:05-12:25 [Poster Presentation]
Two-Stage Processing of Blind Source Separation and DNN-based Speech Enhancement for In-Car Speech Recognition EA2024-147 SIP2024-182 SP2024-88
Yutsuki Takeuchi, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Yoshinori Tsuchiya (MIRISE Technologies)
  12:25-13:25 Break ( 60 min. )
Tue, Mar 4 PM 
13:25 - 14:25
(95) 13:25-13:45  
(96) 13:45-14:05  
(97) 14:05-14:25  
Tue, Mar 4 PM 
13:25 - 14:45
(98)
SIP
13:25-14:45 [Poster Presentation]
Speech Synthesis from Electrocorticogram During Imagined Speech Using a Transformer-Based Decoder EA2024-148 SIP2024-183 SP2024-89
Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendou Univ.), Koichi Shinoda (Science Tokyo), Kohei Yatabe, Toshihisa Tanaka (TUAT)
(99)
SIP
13:25-14:45 [Poster Presentation]
Control of 3D Physical Model of Movable Artificial Variable Width Channel with Reinforcement Learning
-- For River Digital Twin --
EA2024-149 SIP2024-184 SP2024-90
Ryusei Aoki, Sisaykeo Phonepaserth, Shogo Muramatsu (Niigata Univ.)
(100)
SIP
13:25-14:45 [Poster Presentation]
Fundamental considerations for dynamics modeling with Locally Structured Unitary Network EA2024-150 SIP2024-185 SP2024-91
Motoyasu Suzuki, Yasas Godage, Shogo Muramatsu (Nigata Univ.)
(101)
SIP
13:25-14:45 [Poster Presentation]
A Study of Constraints on Directivity Design Method for Improving Suppression Performance EA2024-151 SIP2024-186 SP2024-92
Miryu Goino, Kenji Suyama (Tokyo Denki Univ.)
(102)
SIP
13:25-14:45 [Poster Presentation]
A dynamic data augmentation method using diffusion models for classification of intensive care EEG EA2024-152 SIP2024-187 SP2024-93
Takuma Bingo, Hajime Yano, Taichiro Ashizaki, Kazuma Koda, Masaya Togo (Kobe Univ.), Riki Matsumoto (Kobe Univ./Kyoto Univ.), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.)
(103)
SIP
13:25-14:45 [Poster Presentation]
Individual differences in interoception affects brain activity during music recall EA2024-153 SIP2024-188 SP2024-94
Kazuki Matsunaga, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT)
(104)
SIP
13:25-14:45 [Poster Presentation]
Nonnegative Sparse Optimization Using Relu Activation Function and Its Application to Deep Unfolding EA2024-154 SIP2024-189 SP2024-95
Haruki Esaki, Towa Yasui, Seisuke Kyochi (Kogakuin Univ.)
(105)
SIP
13:25-14:45 [Poster Presentation]
Sparse Modeling for Electroencephalogram-based Sustained Attention Assessment EA2024-155 SIP2024-190 SP2024-96
Masaya Togashi, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT)
(106)
EA
13:25-14:45 [Poster Presentation]
Large-Scale Numerical Simulation of Tsunami-Induced Infrasound Using Spherical Coordinates EA2024-156 SIP2024-191 SP2024-97
Masami Tokuda, Yoshiki Saito, Kan Okubo (TMU)
(107)
EA
13:25-14:45 [Poster Presentation]
Study on Acoustic Analysis of Micro Speakers Considering Electromagnetic-Structure-Acoustic Coupling EA2024-157 SIP2024-192 SP2024-98
Kakeru Yamaguchi, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.)
(108)
EA
13:25-14:45 [Poster Presentation]
Acoustic Vibration Analysis of Distributed Mode Loudspeaker (DML) Using Pattern Structures EA2024-158 SIP2024-193 SP2024-99
Yuito Kimura, Kan Okubo (Tokyo Metropolitan Univ.)
  14:45-15:00 Break ( 15 min. )
Tue, Mar 4 PM 
15:00 - 16:00
(109) 15:00-16:00  
Tue, Mar 4 PM 
16:00 - 16:10
(110) 16:00-16:10  

Announcement for Speakers
General TalkEach speech will have 15 minutes for presentation and 5 minutes for discussion.
Poster PresentationEach speech will have 80 minutes for presentation and 0 minutes for discussion.

Contact Address and Latest Schedule Information
SP Technical Committee on Speech (SP)   [Latest Schedule]
Contact Address  
EA Technical Committee on Engineering Acoustics (EA)   [Latest Schedule]
Contact Address Yoshiaki Bando (AIST)
E--mail: ybanaist 
SIP Technical Committee on Signal Processing (SIP)   [Latest Schedule]
Contact Address IEICE Technical Group on Signal Processing
Email: sip-n 
IPSJ-SLP Special Interest Group on Spoken Language Processing (IPSJ-SLP)   [Latest Schedule]
Contact Address  


Last modified: 2025-03-02 15:18:17


Notification: Mail addresses are partially hidden against SPAM.

[Download Paper's Information (in Japanese)] <-- Press download button after click here.
 
[Cover and Index of IEICE Technical Report by Issue]
 

[Presentation and Participation FAQ] (in Japanese)
 

[Return to EA Schedule Page]   /   [Return to SIP Schedule Page]   /   [Return to SP Schedule Page]   /   [Return to IPSJ-SLP Schedule Page]   /  
 
 Go Top  Go Back   Prev SP Conf / [HTML] / [HTML(simple)] / [TEXT]  [Japanese] / [English] 


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan