ken-system: Advance Program - 2014-05-SP-IPSJ-MUS

IEICE Technical Committee Submission System
Advance Program

Online Proceedings
[Sign in]
Tech. Rep. Archives

Technical Committee on Speech (SP)

Chair		Takeshi Kawabata (Kwansei Gakuin Univ.)
Vice Chair		Hisashi Kawai (KDDI Labs.)
Secretary		Motoyuki Suzuki (Osaka Inst. of Tech.), Tomoki Toda (NAIST)
Assistant		Yamato Ohtani (Toshiba), Takanobu Oba (NTT)

Special Interest Group on Music and Computer (IPSJ-MUS)

[schedule] [select]

Chair		Rumi Hiraga
Secretary		Tetsuro Kitahara, Tetsuaki Baba, Keiji Hirata, Kazuyoshi Yoshii, Hirokazu Kameoka

Conference Date	Sat, May 24, 2014 08:50 - 17:45 Sun, May 25, 2014 09:00 - 18:00
Topics
Conference Place
Copyright and reproduction	All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)

	-
Sat, May 24 AM 08:50 - 09:00
(1) SP	08:50-09:00	"Ongaku" Symposium 2014: The 2nd Symposium on Any Topics Related to Acoustics, Audition and Natural Language SP2014-1	Hirokazu Kameoka (Univ. of Tokyo/NTT), Eriko Aiba (UEC), Yasunori Ohishi (NTT), Tetsuro Kitahara (Nihon Univ.), Tatsuya Kitamura (Konan Univ.), Shoei Sato (NHK), Masahito Togami (Hitachi), Tomoki Toda (NAIST), Kazuyoshi Yoshii (Kyoto Univ.)
Sat, May 24 AM 09:00 - 09:45
(2) SP	09:00-09:45	[Invited Talk] Speaker adaptation technologies for speech synthesis and its application to assistive technology SP2014-2	Junichi Yamagishi (NII)
Sat, May 24 AM 09:45 - 10:30
(3)	09:45-10:30
Sat, May 24 AM 10:30 - 11:15
(4) SP	10:30-11:15	[Invited Talk] Infinite data analysis and Bayesian nonparametrics for audio signal processing SP2014-3	Masahiro Nakano (NTT)
Sat, May 24 AM 11:15 - 15:30
	-
	-
Sat, May 24 PM 15:30 - 16:15
(5) SP	15:30-16:15	[Invited Talk] From multimodal spatial hearing to engineering applications to cope with severe disasters -- Our recent research restuls on spatial acoustic information sciences -- SP2014-4	Yo-iti Suzuki, Shuichi Sakamoto (Tohoku Univ.)
Sat, May 24 PM 16:15 - 17:00
(6)	16:15-17:00
Sat, May 24 PM 17:00 - 17:45
(7)	17:00-17:45
	-
	-
	-
Sun, May 25 AM 09:00 - 09:45
(8) SP	09:00-09:45	[Invited Talk] Behavioral neurosciences of vocal control and learning -- using the songbird as a model system -- SP2014-5	Ryosuke O. Tachibana (Univ. of Tokyo)
Sun, May 25 AM 09:45 - 10:30
(9) SP	09:45-10:30	[Invited Talk] Machine Translation -- Why couldn't we do it? Why are we starting to be able to now? -- SP2014-6	Graham Neubig (NAIST)
Sun, May 25 AM 10:30 - 11:15
(10) SP	10:30-11:15	[Invited Talk] Applications and Advances of Deep Learning for Automatic Speech Recognition SP2014-7	Yotaro Kubo (Amazon)
Sun, May 25 AM 11:15 - 15:30
	-
	-
Sun, May 25 PM 15:30 - 16:15
(11) SP	15:30-16:15	[Invited Talk] R&D of Music Information Retrieval Technology and Issues for its Deployment to Practical Applications SP2014-8	Keiichiro Hoashi (KDDI Labs)
Sun, May 25 PM 16:15 - 17:00
(12) SP	16:15-17:00	[Invited Talk] What Higher-Order Statistics Tell Us? -- Acoustic Signal Processing Based on Unsupervised Learning -- SP2014-9	Hiroshi Saruwatari (Univ. of Tokyo)
Sun, May 25 PM 17:00 - 17:45
(13)	17:00-17:45
Sun, May 25 PM 17:45 - 18:00
	-
	-
Sat, May 24 AM 11:30 - 15:30
(14)	11:30-15:30
(15)	11:30-15:30
(16)	11:30-15:30
(17)	11:30-15:30
(18)	11:30-15:30
(19)	11:30-15:30
(20)	11:30-15:30
(21)	11:30-15:30
(22)	11:30-15:30
(23)	11:30-15:30
(24)	11:30-15:30
(25) SP	11:30-15:30	A Consideration of Evaluation Measurements in Spoken Term Detection SP2014-10	Satoshi Oshima, Yoshiaki Itoh (Iwate Prefectural Univ.)
(26) SP	11:30-15:30	Robustness of Speaker Identification Using Pseudo Pitch Synchronized Phase Information SP2014-11	Yuta Kawakami, Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai (Shizuoka Univ.), Seiichi Nakagawa (Toyohashi Univ. of Tech.)
(27) SP	11:30-15:30	Visualization of World Englishes pronunciations from a speaker's self-centered viewpoint using attributes of accent, gender, and age SP2014-12	Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose (UTokyo), Han-Ping Shen (NCKU)
(28)	11:30-15:30
(29) SP	11:30-15:30	Native language recognition using machine learning SP2014-13	Ryota Sakagami, Kouki Takeshita, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech)
(30) SP	11:30-15:30	Language recognition in reverberant environments SP2014-14	Kouki Takeshita, Ryota Sakagami, Longbiao Wang, Masahiro Iwahashi (Nagaoka Univ. of Tech.)
(31) SP	11:30-15:30	Discriminative training of acoustic models for system combination SP2014-15	Yuuki Tachioka (Mitsubishi Electric), Shinji Watanabe, Jonathan Le Roux, John R. Hershey (MERL)
(32) SP	11:30-15:30	Distant-talking Speech Recognition with Asynchronous Speech Recording SP2014-16	Shunta Teraoka, Yuma Ueda (Shizuoka Univ.), Longbiao Wang (Nagaoka Univ. of Tech.), Atsuhiko Kai, Taku Fukushima (Shizuoka Univ.)
(33)	11:30-15:30
(34)	11:30-15:30
(35) SP	11:30-15:30	[研究紹介] A spectrogram-patch-input DNN model for detection and classification of acoustic events robust to speech overlapping scenarios SP2014-17	Miquel Espi, Masakiyo Fujimoto, Yotaro Kubo, Tomohiro Nakatani (NTT)
(36) SP	11:30-15:30	Development of environmental sound collection system using smart devices based on crowd-sourcing approach SP2014-18	Sunao Hara, Akinori Kasai, Masanobu Abe (Okayama Univ.), Noboru Sonehara (NII)
(37) SP	11:30-15:30	ROCKON:Environmental sound collection and recognition system using smartphones SP2014-19	Minori Matsuyama, Takahiko Tsuda, Ryuichi Nisimura, Hideki Kawahara (Wakayama Univ), Junnosuke Yamada (NTT), Toshio Irino (Wakayama Univ)
(38)	11:30-15:30
(39)	11:30-15:30
(40)	11:30-15:30
(41)	11:30-15:30
(42) SP	11:30-15:30	Underdetermined Blind Separation of Moving Sources Based on Probabilistic Modeling SP2014-20	Takuya Higuchi, Norihiro Takamune, Tomohiko Nakamura (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT)
(43) SP	11:30-15:30	Psychometric functions for across-frequency gap detection SP2014-21	Yousuke Kikuchi, Takako Mitsudo, Nobuyuki Hirose, Shuji Mori (Kyushu Univ.)
(44) SP	11:30-15:30	Deriving the Salience Level of a Target Sound using a Tapping Technique Method SP2014-22	Shunsuke Kidani, Hsin-I Liao, Makoto Yoneya, Makio Kashino, Shigeto Furukawa (NTT)
(45) SP	11:30-15:30	Perception of stop consonants at the beginning of binaurally fused words SP2014-23	Hitomi Kondo, Yousuke Kikuchi, Takako Mitsudo, Nobuyuki Hirose, Shuji Mori (Kyushu Univ.)
(46) SP	11:30-15:30	Effect of interaural time difference for localization of spatially segregated sound SP2014-24	Daisuke Morikawa (JAIST)
(47) SP	11:30-15:30	Acquisition and retention of perceptual cue for size judgment using whispered speech SP2014-25	Koudai Yamamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara (Wakayama Univ.)
Sun, May 25 AM 11:30 - 15:30
(48)	11:30-15:30
(49)	11:30-15:30
(50)	11:30-15:30
(51)	11:30-15:30
(52)	11:30-15:30
(53)	11:30-15:30
(54)	11:30-15:30
(55)	11:30-15:30
(56)	11:30-15:30
(57)	11:30-15:30
(58) SP	11:30-15:30	Analysis of the Relationship between Pitch and Formant Frequencies in Voice Register Transition SP2014-26	Yasufumi Uezu, Takahiro Furukawa, Tokihiko Kaburagi (Kyushu Univ.)
(59) SP	11:30-15:30	Statistical bandwidth extension using sub-band basis spectrum model SP2014-27	Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine (Toshiba)
(60) SP	11:30-15:30	Text-to-speech prosody synthesis based on probabilistic model for F0 contour SP2014-28	Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo (Univ. of Tokyo), Hirokazu Kameoka (Univ. of Tokyo/NTT)
(61) SP	11:30-15:30	Evaluation of singing voice similarity based on "acoustic singing-structure" SP2014-29	Shun Kojima, Takeshi Saitou, Masato Miyoshi (Kanazawa Univ.)
(62) SP	11:30-15:30	Statistical approach to perceived age control of singing voice SP2014-30	Kazuhiro Kobayashi, Tomoki Toda (NAIST), Tomoyasu Nakano, Masataka Goto (AIST), Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST)
(63) SP	11:30-15:30	A portable application for assistance of vocal sound training by overtone analysis SP2014-31	Iori Sugahara, Takayuki Itoh (Ochanomizu Univ)
(64) SP	11:30-15:30	An Evaluation of a Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Prediction SP2014-32	Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (NAIST)
(65) SP	11:30-15:30	Design of voice-enabled web test system for eliminating users' impatience SP2014-33	Chihiro Tafuji, Ryuichi Nisimura, Hideki Kawahara, Toshio Irino (Wakayama Univ.)
(66) SP	11:30-15:30	A joint restricted Boltzmann machine for dictionary learning in sparse-representation-based voice conversion SP2014-34	Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.)
(67) SP	11:30-15:30	Speech waveform generation on subband domain SP2014-35	Nobuyuki Nishizawa, Tsuneo Kato (KDDI R&D Labs)
(68) SP	11:30-15:30	A Kana Protocol Recommendation Method for Switch Input Speech Synthesis Systems SP2014-36	Fuming Fang, Takahiro Shinozaki, Takao Kobayashi (Tokyo Tech)
(69) SP	11:30-15:30	Current situations and issues of open-source high-quality speech synthesis system WORLD SP2014-37	Masanori Morise (Univ. of Yamanashi)
(70) SP	11:30-15:30	The Acoustic Feature of the Loudspeaker which used the Reinforced Corrugated Fibreboard for the Enclosure Material SP2014-38	Takuto Isoyama, Yukio Mori (Salesian Polytechnic), Yoshiaki Kiyama
(71) SP	11:30-15:30	Spot-forming method by using two shotgun microphones SP2014-39	Motoyuki Suzuki, Takeshi Honjo (Osaka Inst. of Tech.)
(72) SP	11:30-15:30	Signal processing of ultrasound for osteoporosis diagnosis -- Modeling, time domain analysis, and frequency domain analysis -- SP2014-40	Yoshiki Nagatani (KCCT), Ryosuke O. Tachibana (Univ. of Tokyo)
(73) SP	11:30-15:30	Modulation transfer function based robust method of voice activity detection for noisy reverberant environments -- Utilization of subband SNR estimation -- SP2014-41	Shota Morita, Masashi Unoki (JAIST), Xugang Lu (NICT), Masato Akagi (JAIST)
(74) SP	11:30-15:30	Systematic study on kawaii products (The seventeenth report) -- Basic study for Kawaii sound -- SP2014-42	Michiko Ohkura, Ryo Kanno (Shibaura Inst. Tech.)
(75) SP	11:30-15:30	The basic mechanisms for perception of simultaneity, stream segregation, and temporal order for auditory stimuli SP2014-43	Satoshi Okazaki, Makoto Ichikawa (Chiba Univ.)
(76)	11:30-15:30
(77)	11:30-15:30
(78) SP	11:30-15:30	[研究紹介] Adaptive adjustment of local temporal structure in song of Bengalese finches SP2014-44	Ryosuke O. Tachibana, Neal A. Hessler, Kazuo Okanoya (Univ. of Tokyo)
(79) SP	11:30-15:30	Modulation of the Temporal Dynamics of Microsaccades with the Presentation of Salient Sounds SP2014-45	Makoto Yoneya, Hsin-I Liao, Shunsuke Kidani, Shigeto Furukawa (NTT), Makio Kashino (NTT/Tokyo Tech)

Announcement for Speakers
Invited Talk	Each speech will have 35 minutes for presentation and 10 minutes for discussion.
Poster Presentation	Each speech will have 225 minutes for presentation.

Contact Address and Latest Schedule Information
SP	Technical Committee on Speech (SP) [Latest Schedule]
	Contact Address
IPSJ-MUS	Special Interest Group on Music and Computer (IPSJ-MUS) [Latest Schedule]
	Contact Address

Last modified: 2014-05-23 15:06:55

Notification: Mail addresses are partially hidden against SPAM.

[Download Paper's Information (in Japanese)] <-- Press download button after click here.

[Cover and Index of IEICE Technical Report by Issue]

[Presentation and Participation FAQ] (in Japanese)

[Return to SP Schedule Page] / [Return to IPSJ-MUS Schedule Page] /

Go Top

Go Back

[HTML] / [HTML(simple)] / [TEXT]

[Japanese] / [English]

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan