IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (All Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 20 of 124  /  [Next]  
Committee Date Time Place Paper Title / Authors Abstract Paper #
SP, IPSJ-MUS, IPSJ-SLP [detail] 2022-06-17
13:00
Online Online SP2022-6 Rank-constrained spatial covariance matrix estimation (RCSCME) is a method for blind speech extraction. In RCSCME, we de... [more] SP2022-6
pp.18-23
EA 2022-05-13
12:45
Online Online Directionally-weighted region-to-region kernel interpolation of acoustic transfer function
Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari (UTokyo) EA2022-4
An interpolation method for the acoustic transfer function (ATF) for variable source and receiver points within regions ... [more] EA2022-4
pp.18-19
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-01
12:20
Okinawa
(Primary: On-site, Secondary: Online)
Training Algorithm for Multispeaker Text-To-Speech Synthesis Considering Adversarial Regularizer
Yusuke Nakai, Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari (UTokyo) EA2021-72 SIP2021-99 SP2021-57
(To be available after the conference date) [more] EA2021-72 SIP2021-99 SP2021-57
pp.50-55
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-02
10:45
Okinawa
(Primary: On-site, Secondary: Online)
Evaluation of sentence-level generation in Japanese dialect speech synthesis using accent latent variables
Kazuya Yufune, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2021-79 SIP2021-106 SP2021-64
Japanese dialect speech synthesis is useful for personalized speech synthesis systems. However, inability to prepare acc... [more] EA2021-79 SIP2021-106 SP2021-64
pp.96-101
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-02
13:25
Okinawa
(Primary: On-site, Secondary: Online)
[Poster Presentation] Filtered-X LMS algorithm based on individual interpolation of primary and secondary sound fields for spatial active noise control
Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari (The Univ. of Tokyo) EA2021-84 SIP2021-111 SP2021-69
Spatial active noise control (ANC), which aims to reduce noise over a three-dimensional target region, has at- tracted a... [more] EA2021-84 SIP2021-111 SP2021-69
pp.126-131
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-02
13:25
Okinawa
(Primary: On-site, Secondary: Online)
[Poster Presentation] Sound Field Estimation from Small Number of Observations by Deep Learning with Difference-Approximation-Based Helmholtz-Equation Loss Function
Kazuhide Shigemi, Shoichi Koyama, TomohikoNakamura, Hiroshi Saruwatari (UTokyo) EA2021-85 SIP2021-112 SP2021-70
We propose a single-frequency sound field estimation method from a small number of observations that uses a loss functio... [more] EA2021-85 SIP2021-112 SP2021-70
pp.132-139
EA, SIP, SP, IPSJ-SLP [detail] 2022-03-02
15:35
Okinawa
(Primary: On-site, Secondary: Online)
[Poster Presentation] Interpolation of head-related transfer function from small amount of observation data using deep learning based on spherical wavefunction expansion
Yuki Ito, Tomohiko Nakamura, Shoichi Koyama, Hiroshi Saruwatari (UTokyo) EA2021-90 SIP2021-117 SP2021-75
In binaural synthesis, listeners' individual head-related transfer functions (HRTFs) are necessary for highly-immersive ... [more] EA2021-90 SIP2021-117 SP2021-75
pp.163-170
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] 2021-12-03
11:00
Online Online Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] NLC2021-26 SP2021-47
pp.42-47
SP, WIT, IPSJ-SLP, ASJ-H [detail] 2021-10-19
15:10
Online Online Speaker adaptation of speech synthesis using human perceptual evaluation feedback
Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari (UT) SP2021-33 WIT2021-26
 [more] SP2021-33 WIT2021-26
pp.46-51
EA, US, SP, SIP, IPSJ-SLP [detail] 2021-03-03
14:05
Online Online [Poster Presentation] End-to-end incremental TTS with lookahead generation with large pretrained language model
Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari (UTokyo) EA2020-74 SIP2020-105 SP2020-39
(To be available after the conference date) [more] EA2020-74 SIP2020-105 SP2020-39
pp.85-90
SP, IPSJ-MUS, IPSJ-SLP 2020-06-07
15:45
Online Online HumanGAN: generative adversarial network with human-based discriminator and its naturalness evaluation in synthesized voice
Kazuki Fujii (NITTC), Yuki Saito, Shinnosuke Takamichi (UTokyo), Yukino Baba (UTsukuba), Hiroshi Saruwatari (UTokyo) SP2020-6
 [more] SP2020-6
pp.15-20
SP, EA, SIP 2020-03-02
10:10
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
Multichannel NMF with Joint-Diagonalizable Constraint Based on Generalized Gaussian Distribution for Blind Source Separation
Keigo Kamo, Yuki Kubo, Norihiro Takamune (UTokyo), Daichi Kitamura (NIT Kagawa), Hiroshi Saruwatari (UTokyo), Yu Takahashi, Kazunobu Kondo (Yamaha) EA2019-103 SIP2019-105 SP2019-52
Multichannel nonnegative matrix factorization (MNMF) is a blind source separation technique, which employs the full-rank... [more] EA2019-103 SIP2019-105 SP2019-52
pp.13-19
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
The Effectiveness of Additional Context in DNN-based Spontaneous Speech Synthesis
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi (UTokyo), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UTokyo) EA2019-112 SIP2019-114 SP2019-61
In DNN-based speech synthesis, contexts, which are input features of DNN, can be used not only for the representation of... [more] EA2019-112 SIP2019-114 SP2019-61
pp.65-70
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
[Poster Presentation] Sensor placement allowing independent setting of estimation and candidate regions for field estimation based on Gaussian process
Tomoya Nishida, Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari (Univ Tokyo) EA2019-125 SIP2019-127 SP2019-74
 [more] EA2019-125 SIP2019-127 SP2019-74
pp.141-146
SP, EA, SIP 2020-03-02
13:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
[Poster Presentation] Restoration of clipped signal using oversampling based on differentiable and convex loss function
Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari (Univ. Tokyo) EA2019-126 SIP2019-128 SP2019-75
A signal reconstruction method of clipped time-continuous signal using oversampling is proposed. The signal reconstructi... [more] EA2019-126 SIP2019-128 SP2019-75
pp.147-152
SP, EA, SIP 2020-03-03
09:00
Okinawa Okinawa Industry Support Center
(Cancelled but technical report was issued)
[Poster Presentation] Time-domain audio source separation using multiresolution deep layered analysis based on simultaneous learning of neural networks and wavelet basis functions
Shihori Kozuka, Tomohiko Nakamura, Hiroshi Saruwatari (UTokyo) EA2019-149 SIP2019-151 SP2019-98
Wave-U-Net is one of state-of-the-art deep neural networks (DNNs) for time-domain audio source separa-tion (TDASS). Howe... [more] EA2019-149 SIP2019-151 SP2019-98
pp.279-284
SP 2020-01-29
11:30
Toyama   Application of Deep Gaussian Process to Multi-Speaker Text-to-Speech Synthesis using Speaker Codes
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari (UTokyo) SP2019-49
Speaker codes are widely used to achieve multi-speaker text-to-speech synthesis.
Conventionally, Deep Neural Network (D... [more]
SP2019-49
pp.31-36
EA, US
(Joint)
2020-01-22
14:00
Kyoto Doshisha Univ. [Poster Presentation] Region-to-region acoustic transfer function estimation with distributed sources and receivers based on kernel interpolation
Juliano G. C. Ribeiro, Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari (Univ. Tokyo) EA2019-98
 [more] EA2019-98
pp.83-88
EA 2019-12-13
13:25
Fukuoka Kyushu Inst. Tech. Rank-constrained spatial covariance matrix estimation based on multivariate complex generalized Gaussian distribution and its acceleration for blind speech extraction
Yuki Kubo, Norihiro Takamune (UTokyo), Daichi Kitamura (NIT, Kagawa), Hiroshi Saruwatari (UTokyo) EA2019-78
In this paper, we generalize a generative model in rank-constrained spatial covariance matrix estimation that separates ... [more] EA2019-78
pp.85-92
EA, EMM 2019-11-22
15:30
Ishikawa Kanazawa Institute of Technology EA2019-60 EMM2019-88 In this paper, we propose a time-domain audio source separation method using down-sampling and up-sampling layers based ... [more] EA2019-60 EMM2019-88
pp.41-48
 Results 1 - 20 of 124  /  [Next]  
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan