IEICE Technical Report

Print edition: ISSN 0913-5685      Online edition: ISSN 2432-6380

Volume 114, Number 475

Speech

Workshop Date : 2015-03-02 - 2015-03-03 / Issue Date : 2015-02-23

[PREV] [NEXT]

[TOP] | [2011] | [2012] | [2013] | [2014] | [2015] | [2016] | [2017] | [Japanese] / [English]

[PROGRAM] [BULK PDF DOWNLOAD]


Table of contents

SP2014-135
Wind-induced Noise Reduction in Time Domain Using Closely-aligned Two Microphones.
Naoto Sakata, Hirofumi Nakajima (Kogakuin Univ.), Kazuhiro Nakadai (HRI-JP)
pp. 1 - 6

SP2014-136
Estimation of arrival time differences between direct and reflected sounds from monaural observed signal
Taira Onoguchi, Irwansyah, Yoshifumi Chisaki (Kumamoto Univ.)
pp. 7 - 12

SP2014-137
Unified approach for BSS, DOA estimation, audio event detection and dereverberation with multichannel factorial HMM and DOA mixture model
Takuya Higuchi (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/ NTT)
pp. 13 - 18

SP2014-138
Estimating correlation coefficients of two super-Gaussian complex signals without phase observation
Shigeki Miyabe (University of Tsukuba), Nobutaka Ono (NII/SOKENDAI), Shoji Makino (University of Tsukuba)
pp. 19 - 24

SP2014-139
Overview of 3GPP Standard EVS Codec -- High Performance Speech and Audio Coding for VoLTE --
Takehiro Moriya, Yutaka Kamamoto, Noboru Harada (NTT), Kei Kikuiri, Nobuhiko Naka, Kimitaka Tsutsumi, Shinichirou Oosaki (NTT DOCOMO), Hiroyuki Ehara, Takako Sanda, Takuya Kawashima, Seigo Nakao (Panasonic)
pp. 25 - 30

SP2014-140
Modulation spectrum-constrained trajectory training algorithm for statistical parametric speech synthesis
Shinnosuke Takamichi (NAIST/CMU), Tomoki Toda (NAIST), Alan W. Black (CMU), Satoshi Nakamura (NAIST)
pp. 31 - 36

SP2014-141
Optimization of impulse responses for model training in reverberant speech recognition
Takahiro Fukumori, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita (Ritsumeikan Univ.)
pp. 37 - 42

SP2014-142
Visualization of Speech Information by using Animated Texts
Yuya Satake, Kazunori Mano (SIT)
pp. 43 - 48

SP2014-143
An Approach Combining Organically Noise, SNR and Voice Activity Estimation
Masaru Fujieda, Takashi Yazu (OKI)
pp. 49 - 54

SP2014-144
Two-Channel Audio Source Separation in Reverberant Conditions Based on Single Voice Activity Segments
Atsushi Matsuda, Arata Kawamura, Youji Iiguni (Osaka Univ.)
pp. 55 - 60

SP2014-145
Multiple Sound Source Tracking Based on Sequential Updating Histogram Using 0-1 Reliability Distribution
Masato Hirakawa, Kenji Suyama (Tokyo Denki Univ.)
pp. 61 - 66

SP2014-146
Evaluation of Modified Amplitude Modulation Methods in the Parametric Array Loudspeaker
Chuang Shi, Yoshinobu Kajikawa (Kansai Univ.)
pp. 67 - 70

SP2014-147
[Special Invited Talk] Active Control of Sound Fields: Enhancing Signals and Attenuating Noise
Stephen Elliott (Univ. of Southampton, ISVR)
p. 71

SP2014-148
[Special Invited Talk] Intermediate representation for statistical pattern recognition
Koichi Shinoda (TokyoTech)
p. 73

SP2014-149
[Poster Presentation] RIP Conditions for Exact Recovery of Group-Sparse Vectors under Two Sets of Non-Overlapping Groups
Keiji Furuya, Masahiro Yukawa (Keio Univ.), Masao Yamagishi, Isao Yamada (Tokyo Tech.)
pp. 75 - 78

SP2014-150
[Poster Presentation] Estimation of Feeling based on a Change in Cerebral Bloodflow, using Polynominal Approximation and Difference
Miho Asano (The Open Univ. of Japan), Masayuki Nambu (HSI), Masaki Yoshida (Osaka Electro-Communication Univ.), Yasuhiro Kawahara (The Open Univ. of Japan)
pp. 79 - 82

SP2014-151
[Poster Presentation] A Study on Anomaly Detection Method Using Kalman Filter and Wavelet Transform
Motomi Yamaguchi, Keiji Osaki (ICU)
pp. 83 - 86

SP2014-152
[Poster Presentation] Density estimation using proper loss functions
Matthew J. Holland, Kazushi Ikeda (NAIST)
pp. 87 - 90

SP2014-153
[Poster Presentation] MSIST-based Image Restoration with Non-separable Oversampled Lapped Transforms
Sho Wakasugi, Shogo Muramatsu (Niigata Univ)
pp. 91 - 96

SP2014-154
[Poster Presentation] A Study on Multiple Exposure Image Fusion with Focus Blur Correction
Ryo Matsuoka, Haruki Ishibashi, Tatsuya Baba, Masahiro Okuda (Univ of Kitakyushu)
pp. 97 - 102

SP2014-155
[Poster Presentation] Computational Precision in HW/SW Co-implementation of Non-separable Oversampled Lapped Transforms
Kenta Seino, Kosuke Furuya, Shogo Muramatsu (Niigata Univ.)
pp. 103 - 108

SP2014-156
[Poster Presentation] Content-Aware Image Compression Using Iterative Edge-Directed Interpolation
Eri Hosogai, Yuichi Tanaka (Tokyo Univ. of Agri. and Tech.)
pp. 109 - 114

SP2014-157
[Poster Presentation] Feature Extractions of Auditory Brain--Computer Interface with Selective Attention
Tatsuhiko Osa, Toshihisa Tanaka (TUAT)
pp. 115 - 120

SP2014-158
[Poster Presentation] PERFORMANCE ANALYSIS OF RETARGETING PYRAMID AND ITS APPLICATIONS
Ryosuke Morita (Tokyo Univ. of Agri. and Tech), Keiichiro Shirai (Shinshu Univ.), Yuichi Tanaka (Tokyo Univ. of Agri. and Tech)
pp. 121 - 126

SP2014-159
[Poster Presentation] An efficient algorithm for convex clustering with l2 regularization for sparsification on probability simplex
Suguru Yasutomi, Toshihisa Tanaka (Tokyo Univ. of Agriculture and Tech.)
pp. 127 - 132

SP2014-160
[Poster Presentation] Reliability-based Automatic Repeat Request for Code Modulation Visual Evoked Potentials in Brain Computer Interfaces
Jun-ichi Sato, Yoshikazu Washizawa (UEC)
pp. 133 - 138

SP2014-161
[Poster Presentation] Image Restoration by Stochastic Proximal Optimization
Shunsuke Ono (Tokyo Tech.), Takamichi Miyata (Chiba Tech.), Itsuo Kumazawa (Tokyo Tech.)
pp. 139 - 144

SP2014-162
[Poster Presentation] Fast image inpainting using Chebyshev polynomial approximation
Masaki Onuki (TUAT), Shunsuke Ono (Tokyo Tech.), Keiichiro Shirai (Shinshu Univ.), Yuichi Tanaka (TUAT)
pp. 145 - 150

SP2014-163
[Poster Presentation] Asynchronous brain-computer interfacing using canonical correlation analysis in phase space
Kaori Suefusa, Toshihisa Tanaka (Tokyo Univ. of Agriculture and Tech.)
pp. 151 - 156

SP2014-164
[Poster Presentation] Comparative Evaluation of Discrimination Methods of Repeating Earthquake Based on Coherence Analysis Using Multi-site Observational Seismic Data
Takahiro Koizumi, Kan Okubo (Tokyo Met. Univ.), Naoki Uchida, Nobunao Takeuchi, Toru Matsuzawa (Tohoku Univ.)
pp. 157 - 161

SP2014-165
[Poster Presentation] Fisher's linear discriminant analysis as a communication of categories
Jun Fujiki, Masaru Tanaka (Fukuoka Univ.), Hitoshi Sakano (NTT Data), Akisato Kimura (NTT)
pp. 163 - 168

SP2014-166
[Poster Presentation] An Acceleration Method of Detection of Repeating Earthquakes Using Multi-GPU Computing-Based Parallel Signal Processing
Taiki Kawakami, Kan Okubo (Tokyo Met. Univ.), Naoki Uchida, Nobunao Takeuchi, Toru Matsuzawa (Tohoku Univ.)
pp. 169 - 174

SP2014-167
[Poster Presentation] A Study on a Fusion Approach Adaptable to Environmental Changes for a Multimodal Biometric System
Qian Shi, Yoshinobu Kajikawa (Kansai Univ.)
pp. 175 - 180

SP2014-168
[Poster Presentation] Image Interpolation based on Weighting Function of Gaussian
Takuro Yamaguchi, Yasuhiro Nakajima, Masaaki Ikehara (Keio Univ.)
pp. 181 - 185

SP2014-169
[Poster Presentation] Automatic Language Identification Based on Posterior Probability on Articulatory Classes -- Language-Independent Articulatory Feature Extractor and Codebook Size --
Takumi Hirata, Kazuyuki Takagi (UEC)
pp. 187 - 190

SP2014-170
[Poster Presentation] An STD method using subword N-gram index
Shota Sugawara, Kazunori Kojima (IPU), Shi-wook Lee (AIST), Yoshiaki Itoh (IPU)
pp. 191 - 196

SP2014-171
[Poster Presentation] Effectiveness of Local Feature, Group Delay Spectrum, MFCC and Their Combination on Phoneme Recognition Performance
Risa Koizumi, Kazuyuki Takagi (UEC)
pp. 197 - 200

SP2014-172
[Poster Presentation] Improvement of STD by acoustic distance between subwords and states obtained from DNN
Ryota Konno (IPU), Shi-wook Lee (AIST), Kazuyo Tanaka (Univ. of Tsukuba), Kazunori Kojima, Yoshiaki Itoh (IPU)
pp. 201 - 206

SP2014-173
[Poster Presentation] Beamforming with reflector by using array manifold vector based on the FDTD method
Hitomi Nakashima, Yoichi Haneda (UEC), Kenta Niwa (NTT)
pp. 207 - 212

SP2014-174
[Poster Presentation] A Consideration on Deteriorated Phoneme Classification by using Haar-Wavelet for Speech Sound Super-Resolution
Yuki Sugii, Akira Sano (Ryukoku Univ.)
pp. 213 - 217

SP2014-175
[Poster Presentation] Directivity control of a dodecahedron loudspeaker array by using a minimum variance beamformer in a spherical harmonics domain.
Kazuna Bando, Yoichi Haneda (UEC)
pp. 219 - 224

SP2014-176
[Poster Presentation] Analysis of Apnea Hypopnea Index (AHI) Prediction Model by Breath sounds
Yuki Kashina, Nichito Nakata, Naoto Sakata, Hirofumi Nakajima (Kogakuin Univ.), Yasuhiro Yamaguchi (UTokyo)
pp. 225 - 230

SP2014-177
[Poster Presentation] Investigation of local area sound reproduction using a circular array of loudspeakers with null-space based sound field control
Takayuki Seki, Yoichi Haneda (UEC)
pp. 231 - 236

SP2014-178
[Poster Presentation] A Study on Compensation Ability of Mirror Filter
Kenta Iwai, Yoshinobu Kajikawa (Kansai Univ.)
pp. 237 - 242

SP2014-179
[Poster Presentation] Design of 3D moving sound image with spherical parametric loudspeaker
Daisuke Ikefuji, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita (Ritsumeikan Univ.)
pp. 243 - 248

SP2014-180
[Poster Presentation] A Study on Multi-channel Active Noise Control System Using Parametric Array Loudspeakers for Factory Noise
Kihiro Tanaka, Chuang Shi, Yoshinobu Kajikawa (Kansai Univ.)
pp. 249 - 254

SP2014-181
[Poster Presentation] An evaluation of audio spot expansion based on separating emission with curved-type parametric loudspeakers
Tadashi Matsui, Daisuke Ikefuji, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.)
pp. 255 - 260

SP2014-182
[Poster Presentation] Compensation of Nonlinear Distortions for Parametric Array Loudspeakers -- Reduction of Computational Complexity of Volterra Filters and the Compensation Effects --
Yuta Hatano, Satoshi Kinoshita, Chuang Shi, Yoshinobu Kajikawa (Kansai Univ.)
pp. 261 - 266

SP2014-183
[Poster Presentation] Multimodal source number estimation in multi-party conversations
Yukoh Wakabayashi, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita (Ritsumeikan Univ.)
pp. 267 - 272

SP2014-184
[Poster Presentation] Multi-channel ANC system using automatically-selected reference microphone based on TDOA
Satoru Hase, Yoshinobu Kajikawa (Kansai Univ.)
pp. 273 - 278

SP2014-185
[Poster Presentation] Multi-stage danger sound detection based on power envelope and spectal ratio
Asako Okamoto, Kohei Hayashida, Masato Nakayama, Takanobu Nishiura (Ritsumeikan Univ.)
pp. 279 - 284

SP2014-186
Construction of Inverse System for Transaural Reproduction by Diagonalization Method
Maho Sugaya (Keio Univ.), Kentaro Matsui (Keio Univ./NHK), Yasushige Nakayama (NHK), Shuichi Adachi (Keio Univ.)
pp. 285 - 290

SP2014-187
Multi-input single-output ARX modeling of multi-directional head-related transfer functions
Sekitoshi Kanai (Keio Univ), Kentaro Matsui (NHK/Keio Univ), Yasushige Nakayama (NHK), Shuichi Adachi (Keio Univ)
pp. 291 - 294

SP2014-188
Numerical Analysis of Acoustic Wave Propagation Using Hybrid MM-MOC Method with Non-uniform Grid and Perfectly Matched Layer Absorbing Boundaries
Yuta Matsumura, Kan Okubo, Norio Tagawa (Tokyo Met. Univ.), Takao Tsuchiya (Doshisha Univ.), Takashi Ishizuka (Shimizu)
pp. 295 - 300

SP2014-189
Acoustic feature analysis of desirable voice in chorus
Akihiro Tawa, Toshiyuki Tanaka (Kyoto Univ.)
pp. 301 - 306

SP2014-190
Statistical modeling of an F0 estimation method based on higher-order waveform symmetry and its application to filled pause analysis
Hideki Kawahara, Ryuichi Nisimura, Toshio Irino (Wakayama Univ.)
pp. 307 - 312

SP2014-191
Sampling Synchronization Using Radio Broadcast Signals for Distributed Microphone Arrays
Osamu Hoshuyama (NEC)
pp. 313 - 316

SP2014-192
A study of noise reduction function of Adaptive Microphone Array for Noise Reduction
Yuma Oizumi, Yutaka Kaneda (Tokyo Denki Univ.)
pp. 317 - 322

SP2014-193
Sound Source Separation Using Multiple Weighted Sum Circuits by Two Microphones
Shigeharu Aoki, Kenji Suyama (Tokyo Denki Univ.)
pp. 323 - 328

Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan