Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
ET |
2023-03-14 17:35 |
Tokushima |
Tokushima University (Primary: On-site, Secondary: Online) |
ET2022-78 |
(To be available after the conference date) [more] |
ET2022-78 pp.117-122 |
SP |
2019-01-27 11:30 |
Ishikawa |
Kanazawa-Harmonie |
Multimodal Data Augmentation for Visual Speech Recognition using Deep Canonical Correlation Analysis Masaki Shimonishi, Satoshi Tamura, Satoru Hayamizu (Gifu University) SP2018-60 |
This paper proposes ta new data augmentation strategy for deep learning, in which feature vectors in one modality can be... [more] |
SP2018-60 pp.41-45 |
PRMU, SP |
2018-06-28 15:10 |
Nagano |
|
Multimodal voice conversion using deep bottleneck features and deep canonical correlation analysis Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu (Gifu Univ.), Tomoki Toda (Nagoya Univ.) PRMU2018-24 SP2018-4 |
In this paper, we aim at improving the speech quality in voice conversion and propose a novel multi-modal voice conversi... [more] |
PRMU2018-24 SP2018-4 pp.13-18 |
SP |
2015-10-16 11:15 |
Hyogo |
Kobe Univ. |
Multi-modal speech recognition using deep bottleneck features Satoshi Tamura (Gifu Univ), Hiroshi Ninomiya (Nagoya Univ), Norihide Kitaoka (Tokushima Univ), Shin Osuga (Aisin Seiki), Yurie Iribe (Aichi Prefectural Univ), Kazuya Takeda (Nagoya Univ), Satoru Hayamizu (Gifu Univ) SP2015-69 |
In this paper, we propose a novel multi-modal speech recognition method which uses speech and lip images, employing Deep... [more] |
SP2015-69 pp.57-62 |
SP |
2015-01-22 10:25 |
Gifu |
Juroku Plaza |
A study for the robustness of multi-modal voice conversion Daiki Kawashima, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2014-128 |
Voice Conversion (VC) is a technique to convert speeches of source speaker into those of target speaker. VC has an issue... [more] |
SP2014-128 pp.7-12 |
PRMU |
2014-03-14 10:45 |
Tokyo |
|
A study on multi-modal speech recognition using depth images Naoya Ukai, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) PRMU2013-198 |
This paper presents a novel framework which uses depth information of human face and mouth movements as yet another moda... [more] |
PRMU2013-198 pp.179-184 |
PRMU |
2014-03-14 11:15 |
Tokyo |
|
Application of multi-modal speech interface in real environments Takumi Seko, Takuya Kawasaki, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) PRMU2013-199 |
This paper proposes multi-modal speech interface for mobile devices such as smart phones, which is based on multi-modal ... [more] |
PRMU2013-199 pp.185-190 |
NC, MBE (Joint) |
2013-12-21 17:35 |
Gifu |
Gifu University |
A Study on reduction of heart sounds and ambient noise from lung sounds Takuma Kashima, Tatsuya Yamashita, Satoshi Tamura, Satoru Hayamizu, Kenji Hayashi, Yutaka Nishimoto (Gifu Univ.) MBE2013-92 |
In medical fields, a medical staff often inspects by auscultation, whether patient lung sounds contain an abnormal sound... [more] |
MBE2013-92 pp.93-98 |
SP |
2013-02-28 15:00 |
Aichi |
Daido University |
[Poster Presentation]
Comparison of classification methods for multi-modal voice activity detection Hiroya Okuda, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2012-124 |
Automatic Speech Recognition (ASR) technology has been developed and used in various situations, such as car navigation ... [more] |
SP2012-124 pp.31-32 |
SP, IPSJ-SLP |
2012-12-20 16:25 |
Tokyo |
TITECH(Ookayama) |
Recent efforts for high-performance multi-modal speech recognition Satoshi Tamura, Peng Shen, Hiroya Okuda, Naoya Ukai, Takuya Kawasaki, Takumi Seko, Satoru Hayamizu (Gifu Univ.) SP2012-88 |
Regarding Multi-Modal Automatic Speech Recognition (MMASR) which uses acoustic and lip/mouth information, this paper des... [more] |
SP2012-88 pp.41-46 |
SP, IPSJ-SLP (Joint) |
2012-07-21 10:00 |
Yamagata |
Hotel Takinoyu (Yamagata Pref.) |
Acoustic model adaptation choosing static and dynamic streams in noisy environments Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2012-56 |
In this paper, an acoustic model adaptation method based on multi streams is proposed for speech recognition in noisy or... [more] |
SP2012-56 pp.33-38 |
PRMU, SP |
2012-02-09 16:30 |
Miyagi |
|
[Poster Presentation]
Sputum detection in real environment using sparse representation Tatsuya Yamashita, Satoshi Tamura, Kenji Hayashi, Yutaka Nishimoto, Satoru Hayamizu (Gifu Univ.) PRMU2011-217 SP2011-132 |
It is difficult for a patient who is on ventilator to cough up sputum from one's respiratory tract by oneself. A medical... [more] |
PRMU2011-217 SP2011-132 pp.135-138 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 09:25 |
Tokyo |
|
GIF-SP: Improvement of Speech Recognition Using General and Discriminative Feature Satoshi Tamura, Yoji Tagami, Satoru Hayamizu (Gifu Univ.) NLC2011-47 SP2011-92 |
This paper proposes a general and discriminative feature ``GIF''.
The feature extraction method proposed in this paper ... [more] |
NLC2011-47 SP2011-92 pp.119-124 |
SP |
2011-06-23 16:00 |
Aichi |
Nagoya Univ. |
Model Adaptation using Audio-visual Interaction for Multi-modal Speech Recognition Masanao Oonishi, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2011-33 |
This paper investigates a linear-regressive model adaptation method, i.e. MLLR (Maximum Likelihood Linear Regression), f... [more] |
SP2011-33 pp.17-22 |
SP |
2010-06-17 16:15 |
Fukuoka |
Kyushu University |
Decision Fusion using Boosting Method for Multi-Modal Voice Activity Detection Shin'ichi Takeuchi, Takashi Hashiba, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2010-26 |
In this paper, we propose a multi-modal voice activity detection system
(VAD) that uses audio and visual information. ... [more] |
SP2010-26 pp.25-30 |
PRMU, SP, MVE, CQ |
2010-01-22 14:50 |
Kyoto |
Kyoto Univ. |
Multimodal speech recognition using multimodal voice activity detection Satoshi Tamura, Masato Ishikawa, Takashi Hashiba, Shin'ichi Takeuchi, Satoru Hayamizu (Gifu Univ.) CQ2009-105 PRMU2009-204 SP2009-145 MVE2009-127 |
Audio-Visual Automatic Speech Recognition (AVASR) has been developed to enhance the robustness in noisy environments, us... [more] |
CQ2009-105 PRMU2009-204 SP2009-145 MVE2009-127 pp.345-350 |
CAS, CS, SIP |
2009-03-03 15:40 |
Gifu |
Nagaragawa Convention Center |
Human Activity Recognition Based on Acceleration Information Shin'ichi Takeuchi, Shin'ya Ito, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) CAS2008-142 SIP2008-205 CS2008-116 |
In this paper, we study human activity recognition method based on acceleration information using hidden Markov models f... [more] |
CAS2008-142 SIP2008-205 CS2008-116 pp.229-234 |
SP |
2008-11-20 10:30 |
Gifu |
Softopia Japan (Ogaki) |
Synchronization of speech and image channels in multimodal speech recognition Satoshi Tamura, Masato Ishikawa, Satoru Hayamizu (Gifu Univ.) SP2008-70 |
[more] |
SP2008-70 pp.1-6 |
SP |
2008-11-20 11:00 |
Gifu |
Softopia Japan (Ogaki) |
Improvement of multimodal speech recognition by normalizing visual features Masato Ishikawa, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2008-71 |
[more] |
SP2008-71 pp.7-12 |
SP |
2007-11-28 |
Chiba |
Chiba Institute of Technology |
Multimodal speech recognition using audio and visual confusion networks Tai Kamisawa, Masato Ishikawa, Satoshi Tamura, Satoru Hayamizu (Gifu Univ.) SP2007-92 |
In multimodal speech recognition, hypotheses from speech and visual recognizers are usually integrated afterwards when b... [more] |
SP2007-92 pp.37-42 |