IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (All Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 21  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-02
10:20
Okinawa
(Primary: On-site, Secondary: Online)
A Study on Hybrid RNN-T/Attention-based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration
Takafumi Moriya, Takanori Ashihara, Atsushi Ando, Hiroshi Sato, Tomohiro Tanaka, Kohei Matsuura, Ryo Masumura, Marc Delcroix (NTT), Takahiro Shinozaki (Tokyo Tech) EA2021-78 SIP2021-105 SP2021-63
In this paper we propose improvements to our recently proposed hybrid RNN-T/Attention architecture that includes a share... [more] EA2021-78 SIP2021-105 SP2021-63
pp.90-95
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2021-12-03
11:00
Online Online Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] NLC2021-26 SP2021-47
pp.42-47
NLC 2020-09-10
15:25
Online Online Unsupervised Domain Adaptation for Dialogue Sequence Labeling -- Application to Contact Center Tasks --
Shota Orihashi, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura (NTT) NLC2020-8
This paper presents an unsupervised domain adaptation for utterance-level sequence labeling of conversation in a contact... [more] NLC2020-8
pp.34-39
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
Japanese dialect speech classification using sequence-to-one neural networks
Ryo Imaizumi (TMU), Ryo Masumura (NTT), Sayaka Shiota, Hitoshi Kiya (TMU) EA2019-108 SIP2019-110 SP2019-57
The language specific to a certain region is called a dialect, and the task of identifying which dialect the input speec... [more] EA2019-108 SIP2019-110 SP2019-57
pp.41-46
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
[Poster Presentation] Neural Voice Activity Detection using Multiple Auxiliary Networks
Ryo Masumura, Kiyoaki Matsui, Yuma Koizumi, Takanobu Oba (NTT) EA2019-109 SIP2019-111 SP2019-58
 [more] EA2019-109 SIP2019-111 SP2019-58
pp.47-52
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
Data augmentation for ASR system by using locally time-reversed speech -- Temporal inversion of feature sequence --
Takanori Ashihara, Tomohiro Tanaka, Takafumi Moriya, Ryo Masumura, Yusuke Shinohara, Makio Kashino (NTT) EA2019-110 SIP2019-112 SP2019-59
Data augmentation is one of the techniques to mitigate overfitting and improve robustness against several acoustic varia... [more] EA2019-110 SIP2019-112 SP2019-59
pp.53-58
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] EA2019-112 SIP2019-114 SP2019-61
pp.65-70
SP, EA, SIP 2020-03-02
15:45
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
Performance evaluation of distilling knowledge using encoder-decoder for CTC-based automatic speech recognition systems
Takafumi Moriya, Hiroshi Sato, Tomohiro Tanaka, Takanori Ashihara, Ryo Masumura, Yusuke Shinohara (NTT) EA2019-131 SIP2019-133 SP2019-80
We present a novel training approach for connectionist temporal classification (CTC) -based automatic speech recognition... [more] EA2019-131 SIP2019-133 SP2019-80
pp.175-180
SP, EA, SIP 2020-03-03
09:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
LARGE-CONTEXT POINTER-GENERATOR NETWORKS FOR SPOKEN-TO-WRITTEN STYLE CONVERSION
Mana Ihori, Akihiko Takashima, Ryo Masumura (NTT) EA2019-142 SIP2019-144 SP2019-91
This paper introduces a spoken-to-written style conversion method that is suitable for handling a series of text such as... [more] EA2019-142 SIP2019-144 SP2019-91
pp.237-242
SP 2019-08-28
17:00
Kyoto Kyoto Univ. Speech Emotion Classification based on Multi-Label Emotion Existence Estimation
Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono (NTT) SP2019-16
This paper presents a novel speech emotion classification that addresses the ambiguous nature of emotions in speech. Mos... [more] SP2019-16
pp.39-44
EA, SIP, SP 2019-03-15
10:25
Nagasaki i+Land nagasaki (Nagasaki-shi) Neural Language Models based on Conditional Hierarchical Recurrent Encoder-Decoder for Multi-Party Conversational Speech Recognition
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Takanobu Oba, Yushi Aono (NTT) EA2018-131 SIP2018-137 SP2018-93
This paper presents fully neural network based language models (LMs) that can leverage long-range conversational context... [more] EA2018-131 SIP2018-137 SP2018-93
pp.191-196
EA, SIP, SP 2019-03-15
10:50
Nagasaki i+Land nagasaki (Nagasaki-shi) Likability Estimation Model Training of Call-center Agents Based on Annotators' Skills
Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono (NTT) EA2018-132 SIP2018-138 SP2018-94
This paper proposes a new technique for estimating the likability of call-center agents.
Most techniques of likability ... [more]
EA2018-132 SIP2018-138 SP2018-94
pp.197-202
NLC, IPSJ-IFAT 2019-02-07
14:45
Kyoto Ryukoku University Omiya Campus Call Scene Segmentation based on Neural Networks with Conversational Contexts
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hosana Kamiyama, Takanobu Oba, Yushi Aono (NTT) NLC2018-39
Call scene segmentation that automatically splits contact center dialogues into several call scenes is useful for constr... [more] NLC2018-39
pp.21-26
SP 2018-08-27
15:30
Kyoto Kyoto Univ. Neural Error Corrective Language Models with Multiple Hypotheses
Tomohiro Tanaka, Ryo Masumura, Yushi Aono (NTT) SP2018-29
 [more] SP2018-29
pp.31-36
SIP, EA, SP, MI
(Joint) [detail]
2018-03-19
13:00
Okinawa   [Poster Presentation] Quantitative and corpus-based analysis of pronunciation diversity observed in Japanese English
Suguru Kabashima, Haoyu Zhang, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo), Satoshi Kobashikawa, Ryo Masumura (NTT) EA2017-113 SIP2017-122 SP2017-96
In foreign language teaching, corrective feedback to learners' pronunciation is regarded
as highly important and automa... [more]
EA2017-113 SIP2017-122 SP2017-96
pp.69-74
SP, SIP, EA 2017-03-01
12:40
Okinawa Okinawa Industry Support Center [Poster Presentation] Prosodic Word Embeddings for DNN-based speech synthesis
Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami (NTT) EA2016-109 SIP2016-164 SP2016-104
This paper proposed a novel word embeddings with prosodic information (prosodic word embeddings) for DNN-based speech sy... [more] EA2016-109 SIP2016-164 SP2016-104
pp.153-158
NLC, IPSJ-NL, SP, IPSJ-SLP
(Joint) [detail]
2015-12-03
15:55
Aichi Nagoya Inst of Tech. [Invited Lecture] EMNLP 2015 Report (2) -- In terms of Deep Learning --
Ryo Masumura (NTT) NLC2015-39
 [more] NLC2015-39
p.41
SP 2015-08-21
10:00
Iwate Iwate Prefectural Univ. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi (NTT), Akinori Ito (Tohoku Universicty) SP2015-50
This paper proposes a novel language modeling approach called latent word recurrent neural network language model, which... [more] SP2015-50
pp.1-6
SP 2015-08-21
16:15
Iwate Iwate Prefectural Univ. Training Data Selection for Acoustic Modeling Based on Submodular Optimization of Joint KL Divergence
Taichi Asami, Ryo Masumura, Hirokazu Masataki, Manabu Okamoto, Sumitaka Sakauchi (NTT) SP2015-58
This paper provides a novel training data selection method to
construct acoustic models for automatic speech recogniti... [more]
SP2015-58
pp.45-50
SP, IPSJ-SLP
(Joint)
2015-07-17
09:00
Nagano Katakura Suwako Hotel Spoken Language Identification based on Language Modeling of Tandem-MLP Features
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Sumitaka Sakauchi (NTT) SP2015-43
 [more] SP2015-43
pp.43-48
 Results 1 - 20 of 21  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan