Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-02 13:30 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Primary: On-site, Secondary: Online) |
Effectiveness of Signal Compression in Speech Enhancement with Diffusion Models Yuki Nishi (Titech), Koji Iwano (Tokyo City Univ.), Koichi Shinoda (Titech) NLC2023-14 SP2023-34 |
(To be available after the conference date) [more] |
NLC2023-14 SP2023-34 pp.1-6 |
EMM, ITE-ME, IE, LOIS, IEE-CMN, IPSJ-AVM [detail] |
2023-09-07 09:50 |
Osaka |
Osaka Metropolitan University - Nakamozu Campus- (Hybrid) (Primary: On-site, Secondary: Online) |
Improving Performance of Convolutional Neural Network-Based Driver Behavior Recognition Shengbiao Wang, Koji Iwano (Tokyo City Univ.) |
[more] |
|
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
Noise-robust time-domain speech separation with basis signals for noise Kohei Ozamoto (Tokyo Tech), Koji Iwano (TCU), Kuniaki Uto, Koichi Shinoda (Tokyo Tech) EA2020-70 SIP2020-101 SP2020-35 |
Recently, speech separation using deep learning has been extensively studied. TasNet, a time-domain method that directly... [more] |
EA2020-70 SIP2020-101 SP2020-35 pp.63-67 |
PRMU, BioX |
2018-03-18 14:45 |
Tokyo |
|
A Kinect-based Multimodal Person Authentication System with User Existence Confirmation Lin Zhou, Koji Iwano (Tokyo City Univ.) BioX2017-46 PRMU2017-182 |
Person identification systems used for receiving pension need to have countermeasures against fraud by "spoofing" as ann... [more] |
BioX2017-46 PRMU2017-182 pp.61-66 |
IE, ITE-ME, ITE-AIT [detail] |
2017-10-06 08:30 |
Nagasaki |
|
Neural network-based estimation of degree of feeling that natural objects appear in photographic images Manami Sasaki, Koji Iwano (Tokyo City Univ.) |
[more] |
|
SP |
2017-08-30 16:15 |
Kyoto |
Kyoto Univ. |
SP2017-32 |
(To be available after the conference date) [more] |
SP2017-32 pp.55-60 |
SP |
2016-08-25 11:10 |
Kyoto |
ACCMS, Kyoto Univ. |
SP2016-37 |
(To be available after the conference date) [more] |
SP2016-37 pp.53-58 |
SP |
2015-01-22 15:30 |
Gifu |
Juroku Plaza |
Analysis of effects of voice mimicry on speaker verification and acoustic features of the imitated voices Koji Iwano, Taito Sone, Kanako Sakamoto (Tokyo City Univ.) SP2014-134 |
It is quite important to recognize the vulnerability of speaker verification (SV) systems against voice mimicry attacks ... [more] |
SP2014-134 pp.43-48 |
SP, IPSJ-SLP (Joint) |
2014-07-26 10:30 |
Iwate |
Hotel Hanamaki |
Dialogue group detection and speaker diarization for multi-party conversational speech recorded by multiple smartphones Koji Iwano, Toko Iizuka, Kanoko Saito, Shuhei Yoneyama (Tokyo City Univ.) SP2014-71 |
This paper describes acoustic information processing techniques for multi-party conversational speech recorded by attend... [more] |
SP2014-71 pp.47-52 |
PRMU, SP |
2012-02-10 15:50 |
Miyagi |
|
Two-pass Approach for Recognizing Code-Switching Speech Yonatan Andy Fajar Nugraha, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.) PRMU2011-235 SP2011-150 |
We propose a recognition method for code-switching utterances in multilingual speech recognition. Code-switching is the ... [more] |
PRMU2011-235 SP2011-150 pp.225-229 |
SP |
2011-07-23 09:30 |
Hokkaido |
Jozankei Grand Hotel |
Nonlinear Normalization Using q-Logarithm for Robust Speech Recognition Hilman F. Pardede, Koichi Shinoda (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.) SP2011-49 |
The performance of speech recognition degrades signicantly in the noisy environment. Most compensation methods to improv... [more] |
SP2011-49 pp.45-50 |
SP |
2010-06-18 10:00 |
Fukuoka |
Kyushu University |
Noise-robust speech recognition decoder using speech/non-speech confidence measures Tasuku Oonishi (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.), Sadaoki Furui (Tokyo Inst. of Tech.) SP2010-30 |
Abstract In a speech recognition system a Voice Activity Detection(VAD) is a crucial component for maintaining accuracy.... [more] |
SP2010-30 pp.49-54 |
SP |
2009-06-25 14:30 |
Hokkaido |
Clark Memorial Hall, Hokkaido Univ. |
A mean F0 speaker adaptation method for regression model-based F0 contour generation Hosana Kamiyama, Takahiro Shinozaki (Tokyo Inst. of Tech.), Koji Iwano (Tokyo City Univ.), Sadaoki Furui (Tokyo Inst. of Tech.) SP2009-38 |
This paper proposes a new speaker adaptation method for the fundamental frequency ($F_0$) contour generation models base... [more] |
SP2009-38 pp.87-92 |
SP |
2009-01-29 14:45 |
Nara |
NAIST |
Speeding up fundamental frequency information extraction by Hough transform for noise-rubust speech recognition Hideki Yasui, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.), Koji Iwano (Musashi Inst. of Tech.) SP2008-129 |
While $F_0$ information obtained by Hough transform has been shown to be effective in speech recognition in noisy enviro... [more] |
SP2008-129 pp.19-24 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Accent Analysis for Mandarin Large Vocabulary Continuous Speech Recognition Dong Yang, Koji Iwano, Sadaoki Furui (Tokyo Inst. of Tech.) SP2007-201 |
This paper presents our work on accent issues in Mandarin large vocabulary continuous speech recognition. What we want t... [more] |
SP2007-201 pp.87-92 |
SP |
2008-03-20 15:15 |
Tokyo |
Univ. Tokyo |
[Poster Presentation]
Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment Kousuke Hiraki, Takahiro Shinozaki, Koji Iwano, Agnieszka Betkowska, Koichi Shinoda, Sadaoki Furui (Tokyo Inst. of Tech.) SP2007-202 |
Car navigation systems are getting more and more popular and many of them equip a speech recognition system as hands-fre... [more] |
SP2007-202 pp.93-98 |
SP |
2007-05-31 11:00 |
Kyoto |
ATR |
A study on models for HMM-based spontaneous speech synthesis Tatsuya Akagawa, Koji Iwano, Sadaoki Furui (Tokyo Tech) SP2007-3 |
This paper describes construction of an HMM-based spontaneous speech
synthesizer and investigates effectiveness of the ... [more] |
SP2007-3 pp.13-18 |
SP |
2007-05-31 11:30 |
Kyoto |
ATR |
A study on multimodal speech recognition for spoken dialogue systems Shunsuke Takayama, Toshihide Matsuo, Koji Iwano, Sadaoki Furui (Tokyo Tech) SP2007-4 |
This paper describes speaker-independent multimodal speech recognition toward constructing multimodal spoken dialogue sy... [more] |
SP2007-4 pp.19-24 |