Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
WIT, IPSJ-AAC |
2022-03-08 10:55 |
Online |
Online |
A study on high-intelligibility speech synthesis of dysarthric speakers using voice conversion from normal speech and multi-speaker vocoder Tetsuro Takano (HTS), Takashi Nose, Aoi Kanagaki (Tohoku Univ.), Satoshi Watanabe (HTS) WIT2021-46 |
In this study, we investigated the possibility of generating intelligible synthetic speech by converting the voice of a ... [more] |
WIT2021-46 pp.18-23 |
WIT |
2020-06-12 13:30 |
Online |
Online |
Improving the pronounce clarity of dysarthric speech using CycleGAN Shuhei Imai, Takashi Nose, Aoi Kanagaki (Tohoku Univ.), Satoshi Watanabe (HTS), Akinori Ito (Tohoku Univ.) WIT2020-1 |
Several voice conversion systems have been developed that converts the dysarthric speech into healthy speech.The convent... [more] |
WIT2020-1 pp.1-6 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2019-12-06 13:55 |
Tokyo |
NHK Science & Technology Research Labs. |
[Poster Presentation]
Analysis and Subjective Labeling for Emotional Speech Database JTES Mai Yamanaka, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.) SP2019-39 |
We have constructed JTES, a prosodic balanced emotional speech database containing 50 sentences of 4 emotions of 50 men ... [more] |
SP2019-39 pp.61-66 |
SP, IPSJ-SLP (Joint) |
2017-07-27 14:30 |
Miyagi |
Akiu Resort Hotel Crescent |
[Invited Talk]
Synthesis, Recognition and Conversion of Various Speech Using Deep Learning and Their Applications Takashi Nose (Tohoku Univ.) SP2017-16 |
This paper focuses on synthesis, recognition and conversion of various speech in the speech processing using deep learni... [more] |
SP2017-16 pp.3-8 |
SP |
2017-01-21 11:00 |
Tokyo |
The University of Tokyo |
[Poster Presentation]
A Study on Singer-Independent Singing Voice Conversion Using Read Speech Based on Neural Network Harunori Koike, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-67 |
There is a problem that the conventional method requires the speech of the source speaker for training. We proposed a me... [more] |
SP2016-67 pp.17-22 |
SP |
2017-01-21 16:10 |
Tokyo |
The University of Tokyo |
A study on DNN-based speech synthesis using vector quantization of spectral features Takashi Nose, Suzunosuke Ito (Tohoku Univ.) SP2016-75 |
[more] |
SP2016-75 pp.65-70 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Improvement of accent sandhi rules based on accent dictionary for Japanese text-to-speech systems Hiroto Aoyama, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-54 |
In order to synthesize more natural speech in Japanese text-to-speech systems, we improved accent sandhi rules. Conventi... [more] |
SP2016-54 pp.31-36 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
F0 control by modeling differential features in DNN-based speech synthesis Shuhei Yamada, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2016-55 |
We have been developing ``tailor-made speech synthesis,'' a framework which enables users to modify synthetic speech nat... [more] |
SP2016-55 pp.37-42 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Development of the Julius-compatible interface for the speech recognition engine of Kaldi toolkit Yusuke Yamada, Takashi Nose, Yuya Chiba, Akinori Ito (Tohoku Univ.) SP2016-57 |
[more] |
SP2016-57 pp.49-51 |
ITE-ME, IE, EMM, LOIS, IEE-CMN [detail] |
2016-09-16 15:30 |
Aichi |
Aichi Prefectural University |
A Study on Colorization in Photo-Realistic Facial Animation Synthesis from Text Based on HMM and DNN with Animation Unit Kazuki Sato, Takashi Nose, Akinori Ito (Tohoku Univ.) LOIS2016-27 IE2016-64 EMM2016-53 |
We propose a technique for synthesizing photo-realistic facial animation from a text based on hidden Markov model (HMM) ... [more] |
LOIS2016-27 IE2016-64 EMM2016-53 pp.67-72 |
IT, EMM |
2016-05-19 15:10 |
Hokkaido |
Otaru Economic Center |
Study of Photo-realistic Face Moving Image Generation from the Text Using the Facial Feature Kazuki Sato, Takashi Nose, Akinori Ito (Tohoku Univ.) IT2016-8 EMM2016-8 |
In this paper, we propose face moving image synthesis technique based on Hidden Markov model (HMM) using the facial feat... [more] |
IT2016-8 EMM2016-8 pp.43-48 |
EA, EMM |
2015-11-12 15:15 |
Kumamoto |
Kumamoto Univ. |
Facial image conversion based on transformation of Animation Units using DNN Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) EA2015-28 EMM2015-49 |
[more] |
EA2015-28 EMM2015-49 pp.23-28 |
SP |
2015-10-15 13:50 |
Hyogo |
Kobe Univ. |
A Study on Speaker-Independent Voice Conversion Using Spectral Differential Filter Based on Neural Network Harunori Koike, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Tech), Akinori Ito (Tohoku Univ.) SP2015-61 |
In this paper, we propose a novel technique for making the speech individuality of an arbitrary source (input) speaker. ... [more] |
SP2015-61 pp.13-18 |
SP |
2015-10-15 16:45 |
Hyogo |
Kobe Univ. |
A study on quick model training in HMM-based speech synthesis Shuhei Yamada, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2015-64 |
In this paper, we propose an alternative model training technique using speaker-independent monophone models and decisio... [more] |
SP2015-64 pp.27-32 |
SP |
2015-10-15 17:10 |
Hyogo |
Kobe Univ. |
Design and evaluation of prosodically balanced emotion-dependent sentence set based on entropy Emika Takeishi, Takashi Nose, Taketo Kase, Akinori Ito (Tohoku Univ.) SP2015-65 |
We designed an emotional speech database that can be used for emition recognition as well as recognition and synthsis of... [more] |
SP2015-65 pp.33-38 |
SP |
2015-08-21 10:25 |
Iwate |
Iwate Prefectural Univ. |
Automatic generation of abbreviated named entities for localized speech recognition Kenta Shiga, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2015-51 |
[more] |
SP2015-51 pp.7-12 |
SP |
2015-08-21 15:50 |
Iwate |
Iwate Prefectural Univ. |
Performance Evaluation of Large-Scale Training Sentence Set Construction Based on Entropy in Statistical Speech Synthesis Takashi Nose (Tohoku Univ.), Yusuke Arao (DNP), Takao Kobayashi (Tokyo Tech), Komei Sugiura, Yoshinori Shiga (NICT) SP2015-57 |
This paper reports the evaluation results of training sentence set construction based on entropy that we previously prop... [more] |
SP2015-57 pp.39-44 |
EMM, IT |
2015-05-21 15:20 |
Kyoto |
Kyoto International Community House |
A study on speaker conversion using speech and expression features for video chatting Yuuki Saito, Takashi Nose (Tohoku Univ.), Takahiro Shinozaki (Tokyo Institute of Technology), Akinori Ito (Tohoku Univ.) IT2015-9 EMM2015-9 |
In this paper, we suggest two method that the individuality of the face of original speaker convert that of target speak... [more] |
IT2015-9 EMM2015-9 pp.45-50 |
SP |
2014-11-13 16:30 |
Fukuoka |
Kyushu Univ. Chikushi Campus |
A study on intuitive control of emotional expressions and speaking styles using facial features by Kinect Yu Bi, Takashi Nose, Akinori Ito (Tohoku Univ.) SP2014-94 |
This paper proposes a style control technique of synthetic speech based on multiple regression HSMM (MRHSMM) using facia... [more] |
SP2014-94 pp.25-30 |
SP |
2014-01-23 16:30 |
Aichi |
Meijo Univ. |
A study on hyperparameter optimization for speech synthesis based on Gaussian process regression Tomoki Koriyama (Tokyo Inst. of Tech.), Takashi Nose (Tohoku Univ.), Takao Kobayashi (Tokyo Inst. of Tech.) SP2013-99 |
[more] |
SP2013-99 pp.19-24 |