Sun, Mar 2 AM 09:30 - 10:50 |
(1) SP |
09:30-09:50 |
Uncertainty-Based Streaming ASR with Evidential Deep Learning |
Hiroaki Sato, Asahi Sakuma, Ryuga Sugano, Tadashi Kumano, Yoshihiko Kawai (NHK STRL), Ogawa Tetsuji (Waseda Univ.) |
(2) |
09:50-10:10 |
|
(3) |
10:10-10:30 |
|
(4) |
10:30-10:50 |
|
Sun, Mar 2 AM 09:30 - 10:50 |
(5) EA |
09:30-09:50 |
Sound field estimation method robust to microphone position error |
Takumi Koga, Ueno Natsuki (Kumamoto Univ.) |
(6) EA |
09:50-10:10 |
Acoustic Wave Propagation Simulation based on Wave Equation-based Neural Networks |
Shota Okubo, Toshiharu Horiuchi (KDDI Research, Inc.) |
(7) EA |
10:10-10:30 |
Sound field reconstruction with sparse channel acoustic signals based on simultaneous learning of graph and signal interpolation |
Shihori Kozuka, Takayuki Sasaki (NTT), Yukihiro Bando (Shimonoseki City Univ.), Hiroaki Itou, Kazuya Hayase, Noriyoshi Kamado, Masaki Kitahara (NTT) |
(8) EA |
10:30-10:50 |
Implementation of Sound Field Synthesis Renderer for Volumetric Audio |
Yo Sasaki, Yasushige Nakayama (NHK) |
|
10:50-11:00 |
Break ( 10 min. ) |
Sun, Mar 2 AM 11:00 - 12:00 |
(9) SP |
11:00-11:20 |
An Experimental Study on Text-independent Speaker Verification for Forensic Applications |
Shigeki Ozawa (YCU), Akira Gotoh, Yuko Saito, Hiroki Matsuura (NEC), Takafumi Koshinaka (YCU) |
(10) SP |
11:20-11:40 |
Speaker Verification Based on Deformable Convolutional Networks |
Keiya Sato, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (NITech) |
(11) SP |
11:40-12:00 |
Speech-Activity-Guided Speaker Embedding Extraction |
Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix (NTT) |
Sun, Mar 2 AM 11:00 - 12:00 |
(12) SIP |
11:00-11:20 |
Joint Diagonalization Based on Equivalence Classes of Orthogonal Matrices by Signed Permutations and Weighted Averaging in the Cayley Transform Domain |
Akira Tanaka, Takafumi Edo (Hokkaido Univ.) |
(13) SIP |
11:20-11:40 |
Algebraic representation of dynamical systems in time-frequency domain: An extension to integro-differential equations |
Shigeru Ando (Univ. Tokyo) |
(14) SIP |
11:40-12:00 |
Toward nonlinear system identification |
Fumihiko Ishiyama (NTT) |
|
12:00-13:00 |
Break ( 60 min. ) |
Sun, Mar 2 PM 13:00 - 14:20 |
(15) SP |
13:00-13:20 |
Zero-Shot Speech Synthesis Directly Referring Target Speech Through Attention Mechanisms |
Kyohei Nakatsuka, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Inst. of Tech.) |
(16) |
13:20-13:40 |
|
(17) |
13:40-14:00 |
|
(18) |
14:00-14:20 |
|
Sun, Mar 2 PM 13:00 - 14:20 |
(19) EA |
13:00-13:20 |
Affective Impression Structural Models and Individual Differences in Chord Listening |
Sakura Sakamoto (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Kenji Katahira (Waseda Univ.), Takashi Fujisawa (Univ. of Fukui), Noriko Nagata (Kwansei Gakuin Univ.) |
(20) EA |
13:20-13:40 |
A Value Structure Model and Individual Differences for the Designing of Pleasant Motor Drive Sounds |
Jun Urayama, Noriko Nagata (Kwansei Gakuin Univ.), Yoichi Yamazaki (Univ. of Nagasaki), Yuto kobayashi, Yasunori Sugita (Nagaoka Univ. of Technology), Takashi Hoduki, Akira Satake, Hiroyasu Iwabuki (MELCO) |
(21) EA |
13:40-14:00 |
Localization of Victims Using Equivalent Rotating Sound Sources |
Atsuhisa Nakane, Takaaki Nara (UTokyo) |
(22) EA |
14:00-14:20 |
Sound image localization experiments using shoulder-mounted wearable speakers with an inverse filter applied using H-infinity control theory |
Kenji Kita (Daido Univ.) |
|
14:20-14:30 |
Break ( 10 min. ) |
Sun, Mar 2 PM 14:30 - 15:30 |
(23) EA |
14:30-15:30 |
|
|
|
15:30-15:40 |
Break ( 10 min. ) |
Sun, Mar 2 PM 15:40 - 16:40 |
(24) SIP |
15:40-16:40 |
[Invited Talk]
Time-domain and spatial-domain linear predictive analysis and its application for audio and speech lossless coding standards |
Yutaka Kamamoto (NTT) |
Mon, Mar 3 AM 09:45 - 11:05 |
(25) |
09:45-10:05 |
|
(26) |
10:05-10:25 |
|
(27) SP |
10:25-10:45 |
Study on a Japanese Speech Understanding Model Robust to Multi-Item Questioning |
Yuki Takashima, Atsushi Ando, Taichi Asami (NTT) |
(28) SP |
10:45-11:05 |
Measurement of time delay tolerance for third-person game live audio commentary |
Ryosuke Matsushita, Ryosuke Sakai, Koki Fukuda (Keio Univ.), Shinnosuke Takamichi (Keio Univ./UTokyo), Kota Iura, Yuki Saito (UTokyo), Graham Neubig (CMU), Katsuhito Sudoh (NWU), Hiroya Takamura, Tatsuya Ishigaki (AIST) |
Mon, Mar 3 AM 09:45 - 11:05 |
(29) EA |
09:45-11:05 |
[Poster Presentation]
Machine-type dependent positive and negative division of training data for unsupervised anomalous detection of machinery sounds |
Yuuki Tachioka (Denso IT Laboratory) |
(30) EA |
09:45-11:05 |
[Poster Presentation]
Evaluation of Sound Field and Multizone Reproduction Performance in Loudspeaker Arrays with Different Enclosures |
Tong Zhou, Kana Itahashi, Akitoshi Kataoka (Ryukoku Univ.) |
(31) EA |
09:45-11:05 |
[Poster Presentation]
Shifted sound-image perception using pre-virtual-leading hypersonic signals with bass frequency envelopes |
Ryota Imanaka, Yuting Geng (Ritsumeikan Univ.), Masato Nakayama (Osaka Sangyo Univ), Takanobu Nishiura (Ritsumeikan Univ.) |
(32) EA |
09:45-11:05 |
[Poster Presentation]
Decentralized Independent Vector Analysis Based on Majorization-Minimization Algorithm for Distributed Microphone Arrays |
Katsuhiro Morita, Kouei Yamaoka, Norihiro Takamune, Hiroshi Saruwatari (UTokyo) |
(33) EA |
09:45-11:05 |
[Poster Presentation]
Evaluation of noise reduction performance of multichannel feedforward ANC system with optical laser microphone in reverberant environments |
Maoto Mizutani, Kenta Iwai, Takanobu Nishiura (Ritsumeikan Univ.), Yoshiharu Soeta (AIST) |
(34) EA |
09:45-11:05 |
[Poster Presentation]
Study on Virtual Sensing ANC Using Tetrahedral Microphone Arrays |
Toma Yoshimatsu (UEC), Hiroaki Itou, Shihori Kozuka, Noriyoshi Kamado (NTT), Yoichi Haneda (UEC) |
(35) EA |
09:45-11:05 |
[Poster Presentation]
Improvement of Localization Performance in Binaural Rendering with Panning for Transmission Systems with Delay |
Kenta Takeuchi, Masayuki Nishiguchi, Koji Abe, Kanji Watanabe (Akita Prefectural Univ.) |
(36) EA |
09:45-11:05 |
[Poster Presentation]
Creation of representative head-related impulse responses for smooth reproduction of moving audio objects |
Kazuki Hoshito, Masayuki Nishiguchi, Kanji Watanabe, Koji Abe (Akita Prefectural Univ.) |
(37) EA |
09:45-11:05 |
[Poster Presentation]
Augmentation of Asynchronous Data for Acoustic Scene Classification Using Asynchronous Distributed Microphone Arrays |
Takao Kawamura, Nobutaka Ono (TMU) |
(38) EA |
09:45-11:05 |
[Poster Presentation]
Performance Evaluation of Active Noise Control System without Error Microphone Introducing Primary Path Estimation under Moving Noise Source Position. |
Ryo Matsuura, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Yoshinobu Kajikawa (Kansai Univ.) |
(39) EA |
09:45-11:05 |
[Poster Presentation]
Numerical Simulation based Design of Moving Sound Sources Using Impulse Response Combination and Acoustic Effects Integration |
Ryuuta Kouma, Sun Chang, Kan Okubo (TMU) |
|
11:05-11:20 |
Break ( 15 min. ) |
Mon, Mar 3 AM 11:20 - 12:40 |
(40) EA |
11:20-11:40 |
Proposal and Analysis of Metric for Evaluating Sampling Frequency Independence Based on Local Equivariance Error |
Kanami Imamura (UTokyo/AIST), Tomohiko Nakamura (AIST), Norihiro Takamune (UTokyo), Kouhei Yatabe (TUAT), Hiroshi Saruwatari (UTokyo) |
(41) EA |
11:40-12:00 |
Traffic Volume and Speed Estimation Using Pre-trained Audio Model |
Tomohiro Takahashi (TMU), Natsuki Ueno (TMU/Kumamoto Univ.), Yuma Kinoshita (Tokai Univ.), Yukoh Wakabayashi (TUT), Nobutaka Ono (TMU), Makiho Sukekawa, Seishi Fukuma, Hiroshi Nakagawa (NEE) |
(42) EA |
12:00-12:20 |
A method of estimating the power of residual noise by using the auxiliary filter |
Kensaku Fujii (Kodaway Lab.), Mitsuji Muneyasu (Kansai Univ.), Yoshifumi Chisaki (CIT) |
(43) EA |
12:20-12:40 |
Memory-efficient and low-computational hierarchical musical instruments classification using element selection |
Ryu Kato (Tokyo Metropolitan Univ.), Natsuki Ueno (Kumamoto Univ./), Nobutaka Ono (Tokyo Metropolitan Univ.), Ryo Matsuda, Kazunobu Kondo, Yu Takahashi (Yamaha Corp.) |
Mon, Mar 3 AM 11:20 - 12:40 |
(44) SIP |
11:20-12:40 |
[Poster Presentation]
Low-Dose DECT Image Reconstruction Using Edge Sparsity and Similarity |
Akira Egashira, Daichi Kitahara (Keio Univ.) |
(45) SIP |
11:20-12:40 |
[Poster Presentation]
Validation of the Optimality and Usefulness of Tight Windows Designed via Manifold Optimization |
Keito Takahashi, Daichi Kitahara (Keio Univ.) |
(46) SIP |
11:20-12:40 |
[Poster Presentation]
1D Nonnegative Spline Smoothing by Convex Semi-Infinite Programming |
Hiroki Arai, Daichi Kitahara (Keio Univ.) |
(47) SIP |
11:20-12:40 |
[Poster Presentation]
MMSE Beamforming with the Consistency of Multiple Covariance Matrices for Phased Array Weather Radar |
Shinji Naito, Daichi Kitahara (Keio Univ.) |
(48) SIP |
11:20-12:40 |
[Poster Presentation]
An Extension of Privacy-Preserving FedSGD Federated Learning with Random Binary Weights to FedAvg Federated Learning |
Hiroto Sawada, Shoko Imaizumi (Chiba Univ.), Hitoshi Kiya (Tokyo Metropolitan Univ.) |
(49) SIP |
11:20-12:40 |
[Poster Presentation]
Pseudo Artifacts and Data Augmentation for Real-World Video Deblurring Using Deep Learning |
Sota Moriyama, Koichi Ichige (YNU) |
(50) SIP |
11:20-12:40 |
[Poster Presentation]
Multichannel Speech Enhancement Method Using Dilated Semi-Dense Convolution Network |
Tomohiro Ueyama, Koichi Ichige (Yokohama National Univ.), Takahiro Murakami (Meiji Univ.) |
(51) SIP |
11:20-12:40 |
[Poster Presentation]
Detecting Human-Object Contact Using Human Region Enlargement on Video |
Kaito Kira, Sota Moriyama, Koichi Ichige (Yokohama National Univ.) |
(52) SIP |
11:20-12:40 |
[Poster Presentation]
Study on Hybrid Compensation Selective Fixed-Filter Active Noise Control Using One-Dimensional CNN |
Hiroki Tsukahara, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.) |
(53) SIP |
11:20-12:40 |
[Poster Presentation]
[Poster Presentation] Improvement of Estimation of Variance for Acoustic Echo and Noise Canceller Based on Variable-Step-Size-Shared-Error NLMS Algorithm |
Kenta Iwai (Ritsumeikan Univ.) |
(54) SIP |
11:20-12:40 |
[Poster Presentation]
On System Identification Based on Dynamic Mode Decomposition with Control for Model Predictive Control |
Sekiya Futamura (Niigata grad school), Shogo Muramatsu (Niigata Univ) |
|
12:40-13:40 |
Break ( 60 min. ) |
Mon, Mar 3 PM 13:10 - 13:40 |
|
- |
|
Mon, Mar 3 PM 13:40 - 15:04 |
(55) SP |
13:40-13:47 |
[No paper] Extension of the head-related transfer function from two front/back directions to any direction in the upper hemisphere |
Masaki Saito, Ryota Shimokura, Yoji Iiguni (Osaka Univ.) |
(56) SP |
13:47-13:54 |
[No paper] Cross-modal effects using salty and sweet tastes to improve the accuracy of sound image localization for general HRTF |
Hikaru Yoshida, Kenji Kita (Daido Univ.) |
(57) SP |
13:54-14:01 |
[No paper] Investigation of human perception based CLAPScore |
Taisei Takano, Yuki Okamoto, Yusuke Kanamori, Yuki Saito (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Hiroshi Saruwatari (UTokyo) |
(58) SP |
14:01-14:08 |
[No paper] Cartilage Conduction-based approach to reduce discomfort from low-frequency noise |
Ito Hirata, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori univ.), Yoji Iiguni (Osaka Univ.) |
(59) SP |
14:08-14:15 |
[No paper] Construction of subjective evaluation dataset for automatic evaluation of input-output relevance in text-to-audio |
Yusuke Kanamori, Yuki Okamoto, Taisei Takano (UTokyo), Shinnosuke Takamichi (Keio Univ./UTokyo), Yuki Saito, Hiroshi Saruwatari (UTokyo) |
(60) SP |
14:15-14:22 |
[No paper] Selective Noise Control Using Virtual Sensing and Active Noise Control with Cartilage Conduction |
Yoshiki Kato, Ryota Shimokura (Osaka Univ.), Naoto Sasaoka (Tottori Univ.), Yoji Iiguni (Osaka Univ.) |
(61) SP |
14:22-14:29 |
[No paper] Online Processing for Spatial Voice Conversion Using BSS, VC, and Remixing |
Kenta Takada, Kentaro Seki, Yuki Saito, Kouei Yamaoka, Yuto Ishikawa, Hiroshi Saruwatari (UTokyo) |
(62) SP |
14:29-14:36 |
|
|
(63) SP |
14:36-14:43 |
[No paper] Impression Caption Dataset for Environmental Sounds |
Yuki Okamoto (UTokyo), Ryotaro Nagase (Ritsumeikan Univ.), Keisuke Imoto (Doshisha Univ.), Junichi Yamagishi (NII), Yuki Saito (UTokyo), Takahiro Fukumori, Yoichi Yamashita (Ritsumeikan Univ.) |
(64) SP |
14:43-14:50 |
[No paper] Unsupervised EEG Channel Selection Based on Inter-individuals Distance in Covariance Matrix |
Sota Hayashi, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.) |
(65) SP |
14:50-14:57 |
[No paper] Joint analysis of distance and class of environmental sound from single channel recording |
Yuki Hoshikawa, Keisuke Imoto, Takao Tsuchiya (Doshisha Univ.) |
(66) SP |
14:57-15:04 |
[No paper] Analysis of time-frequency features in speech decoding from intracranial recordings |
Shoya Murakami, Shuji Komeiji, Yu Watanabe (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Science Tokyo), Toshihisa Tanaka (TUAT) |
|
15:04-15:20 |
Break ( 16 min. ) |
Mon, Mar 3 PM 15:20 - 16:20 |
(67) SIP |
15:20-16:20 |
[Special Invited Talk]
Spatial Audio Intelligence: From Representation to Understanding and Control of Auditory Environments |
Woon-Seng Gan (NTU Singapore) |
|
16:20-16:30 |
Break ( 10 min. ) |
Mon, Mar 3 PM 16:30 - 16:45 |
|
- |
|
Tue, Mar 4 AM 09:30 - 10:50 |
(68) SIP |
09:30-09:50 |
CLaSP: Multimodal Foundation Model Using Time Series Signal Data and Natural Language |
Aoi Ito (Hitachi Ltd./Hosei Univ.), Kota Dohi, Yohei Kawaguchi (Hitachi Ltd.) |
(69) SIP |
09:50-10:10 |
Domain-Independent Automatic Generation of Descriptive Texts for Time-Series Data |
Kota Dohi (Hitachi), Aoi Ito (Hitachi/Hosei), Harsh Purohit, Tomoya Nishida, Takashi Endo, Yohei Kawaguchi (Hitachi) |
(70) SIP |
10:10-10:30 |
Riverbed Estimation using Locally-Structured Unitary Network with Multiresolution Representation |
Seiyu Hitomi, Godage Yasas, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.) |
(71) SIP |
10:30-10:50 |
Online Short-term Prediction of Riverbed Evolution Using Extended Dynamic Mode Decomposition |
Reiya Asuke, Masahiro Yukawa (Keio Univ.), Shogo Muramatsu, Daichi Moteki, Hiroyasu Yasuda (Niigata Univ.) |
Tue, Mar 4 AM 09:30 - 10:50 |
(72) SP |
09:30-10:50 |
[Poster Presentation]
Improving Conv-TasNet for Multi-Channel Speech Enhancement and Examination of Microphone Placement |
Taisuke Morikawa, Akitoshi Kataoka (Grad. Sch., Ryukoku Univ.) |
(73) SP |
09:30-10:50 |
[Poster Presentation]
An Analysis of Speaker Representation for Target-Speaker Speech Processing |
Takanori Ashihara, Takafumi Moriya, Shota Horiguchi (NTT), Junyi Peng (BUT), Tsubasa Ochiai, Marc Delcroix, Kohei Matsuura, Hiroshi Sato (NTT) |
(74) SP |
09:30-10:50 |
[Poster Presentation]
Speech spoofing detection using deep learning model with multiple acoustic features |
Haruto Namba, Sayaka Shiota (TMU) |
(75) SP |
09:30-10:50 |
[Poster Presentation]
Necessity of Voice Sample Selection in Qualification Tests for Crowdsourced Subjective Audio Quality Evaluation |
Takuma Yabe, Moe Yaegashi, Teppei Nakano, Tetsuji Ogawa (Waseda Univ.) |
(76) SP |
09:30-10:50 |
[Poster Presentation]
JIS: Japanese Speech Corpus of Idol Speakers with Various Speaking Styles |
Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko (NTT) |
(77) SP |
09:30-10:50 |
|
|
(78) |
09:30-10:50 |
|
(79) |
09:30-10:50 |
|
(80) |
09:30-10:50 |
|
|
10:50-11:05 |
Break ( 15 min. ) |
Tue, Mar 4 PM 11:05 - 12:25 |
(81) SIP |
11:05-11:25 |
Performance Evaluation of Data-driven Water Level Distribution Prediction for Integrated River Control |
Hiromu Kanauchi, Ryuto Ito, Hiroyasu Yasuda (Niigata Univ.), Masaaki Nagahara (Hiroshima Univ.), Shogo Muramatsu (Niigata Univ.) |
(82) SIP |
11:25-11:45 |
Estimation of Riverbed Undulation using DMDc for Active River Channel Control with Groynes and Its Evaluation |
Chen Zhang, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu (Niigata Univ.) |
(83) SIP |
11:45-12:05 |
Clustering for time-varying graphs with varying number of nodes |
Tomoya Akabayashi (Osaka Univ.), Hayate Kojima (TUAT), Junya Hara, Hiroshi Higashi, Yuichi Tanaka (Osaka Univ.) |
(84) SIP |
12:05-12:25 |
Generalized Graph Signal Sampling with Pre-selection of Critical Vertices |
Keitaro Yamashita, Kazuki Naganuma, Shunsuke Ono (Science Tokyo) |
Tue, Mar 4 AM 11:05 - 12:25 |
(85) SP |
11:05-12:25 |
[Poster Presentation]
Construction of a ASR model based on self-supervised learning using intermediate layer outputs |
Keigo Hojo, Yukoh Wakabayashi (TUT), Kengo Ohta (NITAC), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) |
(86) SP |
11:05-12:25 |
[Poster Presentation]
Improvement and Evaluation of Utterance End Time Estimation Method for Spoken Dialog Systems |
Takanori Kanai, Yukoh Wakabayashi (TUT), Ryota Nishimura (Tokushima Univ.), Norihide Kitaoka (TUT) |
(87) SP |
11:05-12:25 |
[Poster Presentation]
Improvement of the GESI for Predicting Speech Intelligibility in Older Adults |
Ayako Yamamoto, Fuki Miyazaki, Toshio Irino (Wakayama Univ.) |
(88) SP |
11:05-12:25 |
[Poster Presentation]
Sammo: Incorporating MAMBA-2 into Modern Streaming Encoders for Japanese ASR |
Wen Shen Teo, Yasuhiro Minami (UEC) |
(89) SP |
11:05-12:25 |
[Poster Presentation]
Improvement of Speech Recognition Performance for Elderly Speech by Alternating Learning of Acoustic and Linguistic information |
Kaito Takahashi, Yukoh Wakabayashi (TUT), Kengo Ohta (NIT, Anan College), Norihide Kitaoka (TUT) |
(90) EA |
11:05-12:25 |
[Poster Presentation]
Source Separation Based on Regularization Using Back-Projected Demixing Vectors |
Kukuru Koiso, Taishi Nakashima, Nobutaka Ono (TMU) |
(91) EA |
11:05-12:25 |
[Poster Presentation]
Real-Time Blind Source Separation for Head-Mounted Microphone Array Using Own Voice Selection Based on Relative Transfer Function |
Kyoka Kazama, Taishi Nakashima, Nobutaka Ono (TMU) |
(92) EA |
11:05-12:25 |
[Poster Presentation]
Source-specific forgetting factor in multiplicative update online AuxIVA. |
Kaito Masuko, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.) |
(93) EA |
11:05-12:25 |
[Poster Presentation]
Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation Using Independent Deeply Learned Matrix Analysis for Real-Time Multichannel Speech Extraction in Diffuse Noise Environment |
Yuki Nakanishi, Yuto Ishikawa, Norihiro Takamune, Hiroshi Saruwatari (The Univ. of Tokyo) |
(94) EA |
11:05-12:25 |
[Poster Presentation]
Two-Stage Processing of Blind Source Separation and DNN-based Speech Enhancement for In-Car Speech Recognition |
Yutsuki Takeuchi, Taishi Nakashima, Nobutaka Ono (Tokyo Metropolitan Univ.), Takashi Takazawa, Shuhei Shimanoe, Yoshinori Tsuchiya (MIRISE Technologies) |
|
12:25-13:25 |
Break ( 60 min. ) |
Tue, Mar 4 PM 13:25 - 14:25 |
(95) |
13:25-13:45 |
|
(96) |
13:45-14:05 |
|
(97) |
14:05-14:25 |
|
Tue, Mar 4 PM 13:25 - 14:45 |
(98) SIP |
13:25-14:45 |
[Poster Presentation]
Speech Synthesis from Electrocorticogram During Imagined Speech Using a Transformer-Based Decoder |
Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendou Univ.), Koichi Shinoda (Science Tokyo), Kohei Yatabe, Toshihisa Tanaka (TUAT) |
(99) SIP |
13:25-14:45 |
[Poster Presentation]
Control of 3D Physical Model of Movable Artificial Variable Width Channel with Reinforcement Learning
-- For River Digital Twin -- |
Ryusei Aoki, Sisaykeo Phonepaserth, Shogo Muramatsu (Niigata Univ.) |
(100) SIP |
13:25-14:45 |
[Poster Presentation]
Fundamental considerations for dynamics modeling with Locally Structured Unitary Network |
Motoyasu Suzuki, Yasas Godage, Shogo Muramatsu (Nigata Univ.) |
(101) SIP |
13:25-14:45 |
[Poster Presentation]
A Study of Constraints on Directivity Design Method for Improving Suppression Performance |
Miryu Goino, Kenji Suyama (Tokyo Denki Univ.) |
(102) SIP |
13:25-14:45 |
[Poster Presentation]
A dynamic data augmentation method using diffusion models for classification of intensive care EEG |
Takuma Bingo, Hajime Yano, Taichiro Ashizaki, Kazuma Koda, Masaya Togo (Kobe Univ.), Riki Matsumoto (Kobe Univ./Kyoto Univ.), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.) |
(103) SIP |
13:25-14:45 |
[Poster Presentation]
Individual differences in interoception affects brain activity during music recall |
Kazuki Matsunaga, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT) |
(104) SIP |
13:25-14:45 |
[Poster Presentation]
Nonnegative Sparse Optimization Using Relu Activation Function and Its Application to Deep Unfolding |
Haruki Esaki, Towa Yasui, Seisuke Kyochi (Kogakuin Univ.) |
(105) SIP |
13:25-14:45 |
[Poster Presentation]
Sparse Modeling for Electroencephalogram-based Sustained Attention Assessment |
Masaya Togashi, Ingon Chanpornpakdi, Toshihisa Tanaka (TUAT) |
(106) EA |
13:25-14:45 |
[Poster Presentation]
Large-Scale Numerical Simulation of Tsunami-Induced Infrasound Using Spherical Coordinates |
Masami Tokuda, Yoshiki Saito, Kan Okubo (TMU) |
(107) EA |
13:25-14:45 |
[Poster Presentation]
Study on Acoustic Analysis of Micro Speakers Considering Electromagnetic-Structure-Acoustic Coupling |
Kakeru Yamaguchi, Shota Toyooka (Kansai Univ.), Kenta Iwai (Ritsumeikan Univ.), Shunsuke Kita (ORIST), Yoshinobu Kajikawa (Kansai Univ.) |
(108) EA |
13:25-14:45 |
[Poster Presentation]
Acoustic Vibration Analysis of Distributed Mode Loudspeaker (DML) Using Pattern Structures |
Yuito Kimura, Kan Okubo (Tokyo Metropolitan Univ.) |
|
14:45-15:00 |
Break ( 15 min. ) |
Tue, Mar 4 PM 15:00 - 16:00 |
(109) |
15:00-16:00 |
|