Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
HIP, HCS, HI-SIGCOASTER [detail] |
2024-05-13 13:20 |
Okinawa |
Okinawa Industry Support Center |
Strategies to encode non-speech sounds into language: A developmental study Kaede Hattori, Shoko Miyauchi, Kazuhide Hashiya (Kyushu Univ.) HCS2024-12 HIP2024-12 |
[more] |
HCS2024-12 HIP2024-12 pp.57-60 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 16:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Multiple Lag Window Pairs for Estimation of Fundamental Frequency and Periodicity Measure Michiki Koshimori (UEC), Shigeki Sagayama (UTokyo/UEC), Toru Nakashika (UEC) EA2023-75 SIP2023-122 SP2023-57 |
Extending the main concept of modified autocorrelation method in LPC, we investigate lag windows, lag window pairs, and ... [more] |
EA2023-75 SIP2023-122 SP2023-57 pp.85-90 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 15:25 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Investigation of objective intelligibility metrics based on speech foundation models for Clarity Prediction Challenge 2 Katsuhiko Yamamoto (CyberAgent) EA2023-119 SIP2023-166 SP2023-101 |
Speech Foundation Models (SFMs), which use components like the encoder layer of Whisper, have been suggested to separate... [more] |
EA2023-119 SIP2023-166 SP2023-101 pp.334-339 |
IMQ, IE, MVE, CQ (Joint) [detail] |
2023-03-16 14:00 |
Okinawa |
Okinawaken Seinenkaikan (Naha-shi) (Primary: On-site, Secondary: Online) |
Analyzing Attractiveness of Cooking Recipe Titles based on Parts of Speech Nanami Takagi (Nagoya Univ.), Haruya Kyutoku (Aichi Univ. of Technology), Keisuke Doman (Chukyo Univ.), Yasutomo Kawanishi (RIKEN), Takatsugu Hirayama (Univ. of Human Environments), Takahiro Komamizu, Ichiro Ide (Nagoya Univ.) IMQ2022-59 IE2022-136 MVE2022-89 |
Recently, occasions to use recipe Websites is increasing, and also the number of users who publish their own cooking rec... [more] |
IMQ2022-59 IE2022-136 MVE2022-89 pp.192-197 |
PRMU |
2022-10-21 15:25 |
Tokyo |
Miraikan - The National Museum of Emerging Science and Innovation (Primary: On-site, Secondary: Online) |
Features and Deep Learning Models Suitable for Speech Source Discrimination Method in Plural Voice User Interfaces Environment Kengo Maeda, Takahiro Yoshida (TUS) PRMU2022-27 |
Under the situation that plural devices equipped with a voice user interface exist in the user’s environment in the near... [more] |
PRMU2022-27 pp.29-34 |
SIS, ITE-BCT |
2022-10-13 14:15 |
Aomori |
Hachinohe Institute of Technology (Primary: On-site, Secondary: Online) |
Toward Improving Speech Naturalness Introducing a Capsule Structure for Speech Enhancement Networks Reito Kasuga, Tetsuya Shimamura, Yosuke Sugiura, Nozomiko Yasui (Saitama Univ.) SIS2022-12 |
Although the field of speech enhancement has been extensively studied around the world, phase tends to be neglected comp... [more] |
SIS2022-12 pp.7-12 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-19 15:00 |
Online |
Online |
Neural speech synthesis using local phrase dependency structure information Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura (NIST) SP2021-23 |
In order to synthesize Japanese speech with natural prosody, we introduce an end-to-end TTS with new prosodic symbol rep... [more] |
SP2021-23 pp.107-112 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 13:05 |
Online |
Online |
[Invited Talk]
* Masahito Togami (LINE) EA2020-64 SIP2020-95 SP2020-29 |
Recently, deep learning based speech source separation has been evolved rapidly. A neural network (NN) is usually learne... [more] |
EA2020-64 SIP2020-95 SP2020-29 pp.27-32 |
MSS, NLP (Joint) |
2020-03-10 10:45 |
Aichi |
(Cancelled but technical report was issued) |
Analysis of language network for Japanese and English translations of the New Testament Kihei Magishi, Tomoko Matsumoto (TUS), Yutaka Shimada (SU), Tohru Ikeguchi (TUS) NLP2019-126 |
This paper investigated characteristics of Japanese and English language networks.
We used the New Testament as a sampl... [more] |
NLP2019-126 pp.77-82 |
WIT, SP |
2019-10-26 14:45 |
Kagoshima |
Daiichi Institute of Technology |
An extension of the editor functions of Versatile Communication Support System, VCAN Hinako Sugiyama, Toyohiko Hayashi, Maiko Iriyama (Niigata Univ), Nariyasu Fujikawa, Satoru Kawabe, Takashi Goto (Ginzado Ltd.) SP2019-20 WIT2019-19 |
Voice-output communication aid (VOCA) is one of the assistive devices supporting communication of children with speech d... [more] |
SP2019-20 WIT2019-19 pp.17-22 |
RCS, SAT (Joint) |
2019-08-23 09:30 |
Aichi |
Nagoya University |
Comparison of Speech Quality between Voice Communication System based on Communication Satellite Network and Satellite Mobile Phone Byeongyo Jeong, Ryouichi Nishimura, Hajime Susukita, Takashi Takahashi (NICT) SAT2019-32 |
The mobile networks become almost useless under large-scale disasters due to physical damage of mobile base stations, el... [more] |
SAT2019-32 pp.79-83 |
SP |
2019-01-26 14:30 |
Ishikawa |
Kanazawa-Harmonie |
[Invited Lecture]
Measurement method of speech listening performance under various listening environments based on speech and word intelligibility Shuichi Sakamoto (Tohoku Univ.) SP2018-52 |
The effectiveness in conveying a message is an important consideration when transmitting speech information via a commun... [more] |
SP2018-52 pp.5-8 |
EA, ASJ-H, EMM, IPSJ-MUS [detail] |
2018-11-22 10:00 |
Ishikawa |
Hotel Koshuen |
[Invited Talk]
Phase reconstruction for speech enhancement and its effect on array processing Yukoh Wakabayashi (TMU) EA2018-80 EMM2018-80 |
Phase spectrum processing for speech enhancement, so called ``phase reconstruction,'' has been particularly received att... [more] |
EA2018-80 EMM2018-80 pp.163-168 |
SP, IPSJ-SLP (Joint) |
2018-07-26 16:15 |
Shizuoka |
Sago-Royal-Hotel (Hamamatsu) |
Ladder Network Driven from Auditory Computational Model for Multi-talker Speech Separation Hiroshi Sekiguchi, Yoshiaki Narusue, Hiroyuki Morikawa (Univ. of Tokyo) SP2018-18 |
This paper introduces ladder network implementation induced by auditory computational model for multi-talker speech sepa... [more] |
SP2018-18 pp.9-13 |
PRMU, SP |
2018-06-29 11:00 |
Nagano |
|
Speaker adaptation in speech synthesis based on neural networks including temporal structure modeling Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (NIT) PRMU2018-31 SP2018-11 |
This paper proposes a speaker adaptation technique for speech synthesis based on deep neural networks (DNNs) using a str... [more] |
PRMU2018-31 SP2018-11 pp.53-58 |
WIT |
2018-06-09 16:00 |
Kanagawa |
|
Usage status analysis of web-based versatile communication support system VCAN Hinako Sugiyama, Toyohiko Hayashi, Mieko Iriyama (Niigata Univ.), Nariyasu Fujikawa, Satoru Kawabe, Takashi Goto (Ginzado Ltd.) WIT2018-5 |
Voice-output communication aid (VOCA) is one of the assistive devices supporting communication of children with speech d... [more] |
WIT2018-5 pp.21-26 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 14:20 |
Okinawa |
|
A Study on Structure of Deep Neural Network for Speech Enhancement Yosuke Sugiura, Tetsuya Shimamura (Saitama Univ.) EA2017-171 SIP2017-180 SP2017-154 |
In this paper, we study a structure of a deep neural network for speech enhancement.In speech enhancement, it is a big i... [more] |
EA2017-171 SIP2017-180 SP2017-154 pp.379-384 |
SP, ASJ-H |
2018-01-21 14:45 |
Tokyo |
The University of Tokyo |
An investigation of multi-speaker WaveNet vocoder Tomoki Hayashi, Kazuhiro Kobayashi, Akira Tamamori, Kazuya Takeda, Tomoki Toda (Nagoya Univ.) SP2017-81 |
In this paper, we investigate a multi-speaker WaveNet vocoder. In our previous work, we have demonstrated that our propo... [more] |
SP2017-81 pp.81-86 |
SIS |
2017-12-14 10:50 |
Tottori |
Tottori Prefectural Center for Lifelong Learning |
Harmonic Structure Detection in Speech Separation Using Modified DFT Pair Based on ASA Motohiro Ichikawa, Isao Nakanishi (Tottori Univ) SIS2017-34 |
Humans have the ability of cocktail party effect to be able to recognized the target voice from the various conversation... [more] |
SIS2017-34 pp.5-9 |
SITE, EMM, ISEC, ICSS, IPSJ-CSEC, IPSJ-SPT [detail] |
2017-07-15 13:25 |
Tokyo |
|
Investigation of spikegram-based signal representation for speech fingerprints Dung Kim Tran, Masashi Unoki (JAIST) ISEC2017-32 SITE2017-24 ICSS2017-31 EMM2017-35 |
This paper investigates the ability of spikegrams in representing the speech content and voice identications of speech s... [more] |
ISEC2017-32 SITE2017-24 ICSS2017-31 EMM2017-35 pp.241-246 |