Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
ET |
2023-01-20 13:20 |
Hyogo |
Hyogo College of Medicine and Online (Primary: On-site, Secondary: Online) |
Japanese Spoken Language Learning Support System for Foreigners Wenbo Zhao, Tsuneo Kuwabara (KU) ET2022-56 |
(To be available after the conference date) [more] |
ET2022-56 pp.1-4 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2022-12-01 14:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
A Japanese Automatic Speech Recognition System on the Next-Gen Kaldi Framework Wen Shen Teo, Yasuhiro Minami (UEC) NLC2022-16 SP2022-36 |
2021 saw the introduction of the cutting-edge successor to the Kaldi speech processing toolkit, known as Next-Gen Kaldi.... [more] |
NLC2022-16 SP2022-36 pp.39-44 |
SP, EA, SIP |
2020-03-02 13:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61 |
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] |
EA2019-112 SIP2019-114 SP2019-61 pp.65-70 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
LARGE-CONTEXT POINTER-GENERATOR NETWORKS FOR SPOKEN-TO-WRITTEN STYLE CONVERSION Mana Ihori, Akihiko Takashima, Ryo Masumura (NTT) EA2019-142 SIP2019-144 SP2019-91 |
This paper introduces a spoken-to-written style conversion method that is suitable for handling a series of text such as... [more] |
EA2019-142 SIP2019-144 SP2019-91 pp.237-242 |
SP |
2014-11-13 13:45 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
A comparative study of paralinguistic information control methods for HMM-based dialogue speech synthesis Hiroki Mori, Shunsuke Takahashi, Tomohiro Nagata (Utsunomiya Univ.) SP2014-90 |
Toward the realization of dialogue speech synthesis with capability to
control paralinguistic information,
we have pro... [more] |
SP2014-90 pp.1-6 |
TL |
2014-08-12 17:15 |
Tokyo |
The University of Tokyo (Komaba) 18 Bldg. Hall |
The Digital Linguistics: An Interdisciplinary Hypothesis for Neural Mechanism of Grammatical Demodulation
-- Syllabic Addition and Modification of Grammars are Translated into Meanings at Brainstem Auditory Nuclei -- Kimiaki Tokumaru TL2014-26 |
To date, the origin and the mechanism of human language has never been clarified, because it is an interdisciplinary and... [more] |
TL2014-26 pp.83-88 |
SP |
2013-01-31 13:00 |
Kyoto |
Doshisha Univ. |
[Invited Talk]
Speaker and style diversification in statistical parametric speech synthesis Takashi Nose (Tokyo Inst. of Tech.) SP2012-109 |
This paper reviews representative techniques for adding and modifying various
speaker characteristics and style express... [more] |
SP2012-109 pp.67-72 |
SP, NLC, IPSJ-SLP [detail] |
2011-12-20 16:10 |
Tokyo |
|
On the use of prosodic-event-based HMM in F0 generation of conversational speech Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) NLC2011-53 SP2011-98 |
In this paper, we propose prosodic-event-based HMM
for effectively modeling F0 pattern of spontaneous conversational sp... [more] |
NLC2011-53 SP2011-98 pp.185-190 |
EA, SIP, SP |
2011-05-13 13:00 |
Osaka |
Ritsumeikan Univ. |
Performance evaluation of contexts for conversational speech synthesis using Corpus of Spontaneous Japanese Tomoki Koriyama, Takashi Nose, Takao Kobayashi (Tokyo Tech) EA2011-27 SIP2011-27 SP2011-27 |
This paper proposes an extended context set for generating the prosodic variability of spontaneous speech in HMM-based c... [more] |
EA2011-27 SIP2011-27 SP2011-27 pp.155-160 |
PRMU |
2011-03-10 09:30 |
Ibaraki |
|
Written languages are analogously processed, while spoken languages are digitally processed.
-- An Error Correction Request for Shannon's Information Theory and Pavlov's 'Conditional Reflexes' -- Kimiaki Tokumaru (System Engineer) PRMU2010-241 |
The author came to a hypothesis that the unique and fundamental difference between humans and non-human animals (NHA) is... [more] |
PRMU2010-241 pp.25-30 |
SP |
2010-06-17 16:45 |
Fukuoka |
Kyushu University |
Analysis on features and estimators for speech-based age estimation Toshiya Wada, Takahiro Shinozaki, Sadaoki Furui (Tokyo Inst. of Tech.) SP2010-27 |
Age estimation systems can be implemented using either a discrete classifier such as
support vector machine (SVM) or... [more] |
SP2010-27 pp.31-36 |
SP |
2010-06-18 09:00 |
Fukuoka |
Kyushu University |
Lecture Speech Recognition Based on Word Graph Combination by using Quinphone HM-Net Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Akinori Ito, Shozo Makino (Tohoku Univ.) SP2010-28 |
While high recognition performance has been achieved for read
speech, rather poor performance has been reported for spo... [more] |
SP2010-28 pp.37-42 |
TL |
2009-11-21 09:45 |
Tokyo |
|
[Invited Talk]
Why we can't live without honorifics
-- The Japanese language and the logic of "ba" -- Sachiko Ide (Japan Women's Univ.) TL2009-29 |
The Japanese speech should be interpreted ontologically rather than epistemologically. It is obligatory for Japanese spe... [more] |
TL2009-29 pp.1-5 |
SP |
2009-07-17 13:25 |
Fukushima |
|
Improvement of lecture speech recognition by using speaker-class models Takashi Ito, Yohei Okuyama, Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Masaki Kohda (professor emeritus at Yamagata Univ.) SP2009-42 |
This paper describes a new method based on speaker-class (SC) models in order to improve the performance of lecture spee... [more] |
SP2009-42 pp.7-12 |
SP, NLC |
2008-12-09 15:10 |
Tokyo |
Waseda Univ. |
Evaluation of annealing schadule for PLSA language model adaptaion Masaharu Kato, Tetsuo Kosaka (Yamagata Univ.), Akinori Ito, Shozo Makino (Tohoku Univ.) NLC2008-32 SP2008-87 |
Probabilistic Latent Semantic Analysis (PLSA)
is a powerful statistical language model.
However the PLSA has the loca... [more] |
NLC2008-32 SP2008-87 pp.49-53 |
SP, NLC |
2008-12-10 09:30 |
Tokyo |
Waseda Univ. |
Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features Katsuyoshi Setoyama (Nara Institute of Science and Technology), Hideki Kashioka, Nick Campbell (Nara Institute of Science and Technology/National Institute of I) NLC2008-35 SP2008-90 |
It is common for speech synthesis technology to process each sentence as one single and independent unit. However, in hu... [more] |
NLC2008-35 SP2008-90 pp.67-72 |
SP, NLC |
2008-12-10 15:05 |
Tokyo |
Waseda Univ. |
An automatic transcription system for creation of meeting records in the Japanese Congress Yuya Akita, Masato Mimura, Tatsuya Kawahara (Kyoto Univ.) NLC2008-44 SP2008-99 |
We have been developping an automatic speech recognition (ASR) system for creation of meeting records of the National Co... [more] |
NLC2008-44 SP2008-99 pp.121-126 |
SP |
2008-07-17 - 2008-07-19 |
Iwate |
Iwate Prefectural Univ. |
Spontaneous Speech Recognition using Speaker-Class Model and Speaker Adaptation Tetsuo Kosaka (Yamagata Univ.) SP2008-47 |
This paper describes a new method based on speaker-class models in order to improve the performance of an unsupervised b... [more] |
SP2008-47 pp.25-30 |
SP |
2007-06-29 13:00 |
Fukushima |
The University of Aizu |
Improvement of unsupervised adaptation in lecture speech recognition Takashi Kusama, Yohei Okuyama, Masaharu Katoh, Tetsuo Kosaka, Masaki Kohda (Yamagata Univ.) SP2007-20 |
In recent years, many aggressive approaches for large vocabulary continuous speech recognition (LVCSR) system trained on... [more] |
SP2007-20 pp.73-78 |
SP |
2007-05-31 11:00 |
Kyoto |
ATR |
A study on models for HMM-based spontaneous speech synthesis Tatsuya Akagawa, Koji Iwano, Sadaoki Furui (Tokyo Tech) SP2007-3 |
This paper describes construction of an HMM-based spontaneous speech
synthesizer and investigates effectiveness of the ... [more] |
SP2007-3 pp.13-18 |