|
Chair |
|
Takao Kobayashi (Tokyo Inst. of Tech.) |
Vice Chair |
|
Kazunori Mano (Shibaura Inst. of Tech.) |
Secretary |
|
Yoshiaki Ito (Iwate Pref. Univ.), Akinobu Lee (Nagoya Inst. of Tech.) |
Assistant |
|
Takaaki Hori (NTT), Tatsuya Kitamura (Konan Univ.) |
|
Technical Committee on Natural Language Understanding and Models of Communication (NLC) |
[schedule] [select]
|
|
Chair |
|
Naomi Inoue (ATR) |
Vice Chair |
|
Naoto Kato (NHK), Toshihiko Ito (Hokkaido Univ.) |
Secretary |
|
Kazuhide Yamamoto (Nagaoka Univ. of Tech.), Hiroshi Masuichi (fujixerox) |
Assistant |
|
Kouji Murakami (Tokyo Inst. of Tech.), Koichi Takeuchi (Okayama Univ.) |
|
Conference Date |
Tue, Dec 9, 2008 10:00 - 18:10
Wed, Dec 10, 2008 09:30 - 18:00 |
Topics |
|
Conference Place |
Ono Memorial Hall, Waseda University (Waseda Campus) |
Address |
1-104, Totsuka-machi, Shinjuku-ku, Tokyo 169-8050, Japan |
Transportation Guide |
http://www.waseda.jp/eng/campus/nishiwaseda.html |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Tue, Dec 9 AM 10:00 - 12:05 |
(1) |
10:00-10:25 |
Two-channel input speech recognition using sparsness-based blind source separation NLC2008-24 SP2008-79 |
Kenta Nishiki, Yosuke Izumi (Univ. of Tokyo), Shinji Watanabe (NTT), Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama (Univ. of Tokyo) |
(2) |
10:25-10:50 |
Hands-free speech recognition system for robot NLC2008-25 SP2008-80 |
Kosuke Hosoya, Tetsuji Ogawa, Shinya Fujie, Daichi Watanabe, Yuhi Ichikawa, Hikaru Taniyama, Tetsunori Kobayashi (Waseda Univ.) |
(3) |
10:50-11:15 |
Noisy speech recognition using integrated method of statistical model-based voice activity detection and noise suppression NLC2008-26 SP2008-81 |
Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani (NTT Corporation) |
(4) |
11:15-11:40 |
Music suppression method for single channel speech mixed with BGM using Bayesian networks NLC2008-27 SP2008-82 |
Hiroaki Itou, Takanori Nishino, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) |
(5) |
11:40-12:05 |
Speaker diarization of multi-party conversations based on audio and visual information integration NLC2008-28 SP2008-83 |
Kentaro Ishizuka, Shoko Araki, Kazuhiro Otsuka, Masakiyo Fujimoto, Tomohiro Nakatani (NTT) |
|
12:05-13:10 |
Lunch Break ( 65 min. ) |
Tue, Dec 9 PM 13:10 - 14:00 |
(6) |
13:10-14:00 |
[Invited Talk]
Cognitive competence required for spoken language performance and computational competence realized by spoken language engineering NLC2008-29 SP2008-84 |
Nobuaki Minematsu (Univ. of Tokyo) |
|
14:00-14:10 |
Break ( 10 min. ) |
Tue, Dec 9 PM 14:10 - 15:00 |
(7) |
14:10-14:35 |
Acoustic Model Training Technique for Speech Recognition using Style Estimation with Multiple-Regression HMM NLC2008-30 SP2008-85 |
Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi (Tokyo Tech) |
(8) |
14:35-15:00 |
Speech Feature Extraction Using Constrained Nonnegative Matrix Factorization NLC2008-31 SP2008-86 |
Hyunsin Park, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
|
15:00-15:10 |
Break ( 10 min. ) |
Tue, Dec 9 PM 15:10 - 16:25 |
(9) |
15:10-15:35 |
Evaluation of annealing schadule for PLSA language model adaptaion NLC2008-32 SP2008-87 |
Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Akinori Ito, Shozo Makino (Tohoku Univ.) |
(10) |
15:35-16:00 |
Speech Recognition by Topic Models with Continuous/Discontinuous Topic Changes NLC2008-33 SP2008-88 |
Atsushi Sako, Yasuo Ariki (Kobe Univ.), Tomoharu Iwata, Shinji Watanabe, Takaaki Hori (NTT) |
(11) |
16:00-16:25 |
User modeling for a satisfaction evaluation of a speech recognition system NLC2008-34 SP2008-89 |
Sunao Hara, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) |
|
16:25-16:40 |
Break ( 15 min. ) |
Tue, Dec 9 PM 16:40 - 18:10 |
|
- |
|
Wed, Dec 10 AM 09:30 - 11:10 |
(12) |
09:30-09:55 |
Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features NLC2008-35 SP2008-90 |
Katsuyoshi Setoyama (Nara Institute of Science and Technology), Hideki Kashioka, Nick Campbell (Nara Institute of Science and Technology/National Institute of I) |
(13) |
09:55-10:20 |
Bayesian Context Clustering Using Cross Validation for HMM-Based Speech Synthesis NLC2008-36 SP2008-91 |
Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda (Nagoya Institute of Technology) |
(14) |
10:20-10:45 |
Simultaneous Transformation of Duration and Spectrum Using Statistical Models Including Time-Sequence Matching NLC2008-37 SP2008-92 |
Kaori Yutani, Yoshihiko Nankaku (Nagoya Institute of Technology), Tomoki Toda (Nara Institute of Science and Technology), Keiichi Tokuda (Nagoya Institute of Technology) |
(15) |
10:45-11:10 |
Aperiodicity extraction based on linear prediction and temporal axis warping using fundamental frequency information NLC2008-38 SP2008-93 |
Hideki Kawahara (Wakayama Univ.), Masanori Morise (Kwansei Univ.), Toru Takahashi (Kyoto Univ.), Hideki Banno (Meijo Univ.), Ryuichi Nisimura, Toshio Irino (Wakayama Univ.) |
|
11:10-11:20 |
Break ( 10 min. ) |
Wed, Dec 10 AM 11:20 - 12:35 |
(16) |
11:20-11:45 |
Mutually-Adaptive Generation of Utterances Based on Belief Shared by Human And Robots in Real World. NLC2008-39 SP2008-94 |
Shinya Nakamura (UEC/NICT), Naoto Iwahashi (NICT/ATR), Takayuki Nagai (The University of Electro-Communications) |
(17) |
11:45-12:10 |
Controlling thought-evoking dialogue using POMDP NLC2008-40 SP2008-95 |
Yasuhiro Minami, Minako Sawaki, Ryuichiro Higashinaka, Kohji Dohsaka (NTT) |
(18) |
12:10-12:35 |
Speech recognition system for spoken dialogue system NLC2008-41 SP2008-96 |
Toru Taniguchi, Shinya Fujie, Tetsunori Kobayashi (Waseda Univ.) |
|
12:35-13:40 |
Lunch Break ( 65 min. ) |
Wed, Dec 10 PM 13:40 - 14:30 |
(19) |
13:40-14:30 |
[Invited Talk]
A New Paradigm for Speech Application System Development NLC2008-42 SP2008-97 |
Tetsunori Kobayashi (Waseda Univ.) |
|
14:30-14:40 |
Break ( 10 min. ) |
Wed, Dec 10 PM 14:40 - 15:55 |
(20) |
14:40-15:05 |
Progress Report of SLP Spoken Document Processing Working Group NLC2008-43 SP2008-98 |
Tomoyoshi Akiba (Toyohashi Univ. of Tech.), Kiyoaki Aikawa (Tokyo Univ. of Tech.), Yoshiaki Itoh (Iwate Prefectural Univ.), Tatsuya Kawahara (Kyoto Univ.), Hiroaki Nanjo (Ryukoku Univ.), Hiromitsu Nishizaki (Univ. of Yamanashi), Norihito Yasuda (NTT), Yoichi Yamashita (Ritsumeikan Univ.), Tomoko Matsui (The Institute of Statistical Mathematics), Xinhui Hu (NICT/ATR), Seiichi Nakagawa (Toyohashi Univ. of Tech.), Katunobu Itou (Hosei Univ.) |
(21) |
15:05-15:30 |
An automatic transcription system for creation of meeting records in the Japanese Congress NLC2008-44 SP2008-99 |
Yuya Akita, Masato Mimura, Tatsuya Kawahara (Kyoto Univ.) |
(22) |
15:30-15:55 |
Effect of punctuation marks for speech translatio unit boundary detection NLC2008-45 SP2008-100 |
Tohru Shimizu (NICT/ATR), Satoshi Nakamura (National Institute of Information and Communication), Tatsuya Kawahara (Kyoto University) |
|
15:55-16:10 |
Break ( 15 min. ) |
Wed, Dec 10 PM 16:10 - 18:00 |
(23) |
16:10-18:00 |
Characteristics of pitch accents in infant-directed speech
-- An analysis of Riken Japanese Mother-Infant Conversation Corpus -- NLC2008-46 SP2008-101 |
Mafuyu Kitahara (Waseda Univ.), Ken'ya Nishikawa (RIKEN/Keio Univ.), Yosuke Igarashi (NIJL/RIKEN), Takahito Shinya (Sophi Univ./RIKEN), Reiko Mazuka (RIKEN/Duke Univ.) |
(24) |
16:10-18:00 |
The effect of associated conditions on the received emotional information transferred by sound effects NLC2008-47 SP2008-102 |
Mari Sato, Kiyoaki Aikawa (Univ. of Technology) |
(25) |
16:10-18:00 |
Physical Model of the Vocal Tract with Flexible Velum NLC2008-48 SP2008-103 |
Takayuki Arai, Kimi Tanaka (Sophia Univ.), Ryuta Kataoka (Showa Univ.) |
(26) |
16:10-18:00 |
Articulatory feature extraction based on 3-stage MLNs and Inhibition/Enhancement Network NLC2008-49 SP2008-104 |
Mohammad Nurul Huda, Hiroaki Kawashima, Tsuneo Nitta (Toyohashi Univ. of Tech.) |
(27) |
16:10-18:00 |
Parameter optimization for a fundamental frequency extractor based on TANDEM-STRAIGHT NLC2008-50 SP2008-105 |
Hanae Itagaki, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara (Wakayama Univ.) |
(28) |
16:10-18:00 |
Study on Spectro-Temporal Features Based on Gradient Histograms NLC2008-51 SP2008-106 |
Takashi Muroi, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
(29) |
16:10-18:00 |
Automatic Speech Character Identification using Vocal Tract information NLC2008-52 SP2008-107 |
Yusuke Watanabe, Naoki Matsumoto (Meiji Univ.) |
(30) |
16:10-18:00 |
Evaluation of speaker identification/verification method using phase information NLC2008-53 SP2008-108 |
Longbiao Wang (Shizuoka Univ.), Kazue Minami, Kazumasa Yamamoto, Seiichi Nakagawa (Toyohashi Univ. of Tech.) |
(31) |
16:10-18:00 |
Dialect-based speaker classification of Chinese using acoustic features invariant with extra-linguistic factors NLC2008-54 SP2008-109 |
XueBin Ma, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose (Univ. of Tokyo), Akira Nemoto (Nankai Univ.), Feng Shi (nankai Univ.) |
(32) |
16:10-18:00 |
Speaker Recognition Based on Gaussian Mixture Models Using Variational Bayesian Method NLC2008-55 SP2008-110 |
Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nitech) |
(33) |
16:10-18:00 |
Sudden noise reduction using dynamic speech feature model NLC2008-56 SP2008-111 |
Nobuyuki Miyake, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
(34) |
16:10-18:00 |
Speech period detection using Hough transform of distance matrix images NLC2008-57 SP2008-112 |
Hiroyuki Nishi, Yoshimasa Kimura, Nguyen Van Don (Sojo Univ.) |
(35) |
16:10-18:00 |
Isolated word recognition based on speech structures and discriminant analysis NLC2008-58 SP2008-113 |
Satoshi Asakawa, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) |
(36) |
16:10-18:00 |
Speech recognition using localized affine invariant features NLC2008-59 SP2008-114 |
Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose (Univ. of Tokyo) |
(37) |
16:10-18:00 |
Tying covariance parameters for HMM-based speech synthesis NLC2008-60 SP2008-115 |
Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Inusitute of Technology) |
(38) |
16:10-18:00 |
Speech Recognition Based on Statistical Models Including Multiple Decision Trees NLC2008-61 SP2008-116 |
Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (Nagoya Institute of Technology) |
(39) |
16:10-18:00 |
Recording system for controlling speaking rate (ReCoK5) and public domain speech database with speaking rate variations (SRV-DB) NLC2008-62 SP2008-117 |
Kota Takahashi, Keigo Tsutaki, Toru Yoshihara (The University of Electro-Communications) |
(40) |
16:10-18:00 |
Speaking rate estimation and utterance analysis of fast speech for high-speed reproduction
-- A practical example of speech database with speaking rate variations -- NLC2008-63 SP2008-118 |
Toru Yoshihara, Keigo Tsutaki, Kota Takahashi (The University of Electro-Communications) |
(41) |
16:10-18:00 |
All directional Fatigue Detection Using Noise Ration at Vocal Cords Level and Spectrum Q
-- Considering Working Efficiency and MAnagement for Crisis of a Speaker -- NLC2008-64 SP2008-119 |
Kazuhide Okada (Toyota) |
(42) |
16:10-18:00 |
Driver's irritation detection using speech recognition results NLC2008-65 SP2008-120 |
Lucas Malta, Chiyomi Miyajima, Akira Ozaki, Norihide Kitaoka, Kazuya Takeda (Nagoya Univ.) |
(43) |
16:10-18:00 |
Language Model Adaptation by Topic Model Based on Sequence of Words NLC2008-66 SP2008-121 |
Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) |
(44) |
16:10-18:00 |
Discriminative Rescoring Based on Minimization of Word Errors for Speech Recognition NLC2008-67 SP2008-122 |
Akio Kobayashi, Takahiro Oku, Shinichi Homma, Shoei Sato, Toru Imai, Tohru Takagi (NHK) |
(45) |
16:10-18:00 |
Verification of Speech Recognition Results Based on the Utterance Classification Using Conditional Random Fields NLC2008-68 SP2008-123 |
Kenko Ota, Terumasa Ehara (TUS, Suwa) |
(46) |
16:10-18:00 |
Estimation of Spoken Dialog System using Automatically-generated question-and-answer database NLC2008-69 SP2008-124 |
Takahiro Morimoto, Masashi Ito (Tohoku Univ.), Motoyuki Suzuki (The Univ. of Tokushima), Akinori Ito, Shozo Makino (Tohoku Univ.) |
(47) |
16:10-18:00 |
Building a Question-Answer System based on RIME-TK, a Toolkit for Dialogue and Behavior Controller of Robots and Agents NLC2008-70 SP2008-125 |
Hiromi Narimatsu (Tsuda College), Mikio Nakano (Honda Research Institute Japan Co., Ltd.), Kotaro Funakoshi, Yuji Hasegawa, Hiroshi Tsujino (Tsuda College) |
Contact Address and Latest Schedule Information |
SP |
Technical Committee on Speech (SP) [Latest Schedule]
|
Contact Address |
|
NLC |
Technical Committee on Natural Language Understanding and Models of Communication (NLC) [Latest Schedule]
|
Contact Address |
|
Last modified: 2008-12-05 16:43:25
|
Notification: Mail addresses are partially hidden against SPAM.
|