IEICE Technical Committee Submission System
Conference Schedule
Online Proceedings
[Sign in]
Tech. Rep. Archives
    [Japanese] / [English] 
( Committee/Place/Topics  ) --Press->
 
( Paper Keywords:  /  Column:Title Auth. Affi. Abst. Keyword ) --Press->

All Technical Committee Conferences  (Searched in: Recent 10 Years)

Search Results: Conference Papers
 Conference Papers (Available on Advance Programs)  (Sort by: Date Descending)
 Results 1 - 10 of 10  /   
Committee Date Time Place Paper Title / Authors Abstract Paper #
PRMU, IBISML, IPSJ-CVIM [detail] 2023-03-02
11:25
Hokkaido Future University Hakodate (Hokkaido, Online)
(Primary: On-site, Secondary: Online)
Binarization of Vision Transformer with Scaling Factors
Shun Sato, Shun Sawada, Hidefumi Ohmura, Kouichi katsurada (TUS) PRMU2022-83 IBISML2022-90
1bit neural network optimization is an optimization technique that achieves a significant increase in computational spee... [more] PRMU2022-83 IBISML2022-90
pp.134-139
SP, IPSJ-SLP, EA, SIP [detail] 2023-02-28
09:50
Okinawa (Okinawa, Online)
(Primary: On-site, Secondary: Online)
End-to-End Speech Synthesis Based on Articulatory Movements Captured by Real-time MRI
Yuto Otani, Shun Sawada, Hidefumi Ohmura, Kouichi Katsurada (Tokyo Univ. Sci.) EA2022-77 SIP2022-121 SP2022-41
We propose an end-to-end deep learning model for speech synthesis based on articulatory movements captured by real-time ... [more] EA2022-77 SIP2022-121 SP2022-41
pp.13-18
SP, WIT, IPSJ-SLP [detail] 2022-10-22
15:40
Kyoto Kyoto University (Kyoto, Online)
(Primary: On-site, Secondary: Online)
Conformer based early fusion model for audio-visual speech recognition
Nobukazu Aoki, Shun Sawada, Hidefumi Ohmura, Kouichi Katsurada (Tokyo Univ. of Sci.) SP2022-28 WIT2022-3
Previous studies of late fusion models with conformer encoders use independent encoders for both visual and audio inform... [more] SP2022-28 WIT2022-3
pp.8-13
EA, US, SP, SIP, IPSJ-SLP [detail] 2021-03-04
17:10
Online Online (Online) A Vocoder-free Any-to-Many Voice Conversion using Pre-trained vq-wav2vec
Takeshi Koshizuka, Hidefumi Ohmura, Kouichi Katsurada (TUS) EA2020-89 SIP2020-120 SP2020-54
Voice conversion (VC) is a technique that converts speaker-dependent non-linguistic information to another speaker's one... [more] EA2020-89 SIP2020-120 SP2020-54
pp.176-181
SP 2020-01-28
16:15
Toyama (Toyama) Multi-Speaker Speech Synthesis from EMA Data
Kouichi Katsurada (Tokyo Univ. Sci.), Korin Richmond (Univ. of Edinburgh) SP2019-45
 [more] SP2019-45
pp.7-12
WIT, SP 2019-10-27
09:00
Kagoshima Daiichi Institute of Technology (Kagoshima) Extraction of linguistic representation and syllable recognition from EEG signal of speech-imagery
Kentaro Fukai, Hidefumi Ohmura, Kouichi Katsurada (Tokyo Univ. of Science), Satoka Hirata, Yurie Iribe (Aichi Prefectural Univ.), Mingchua Fu, Ryo Taguchi (Nagoya Inst. of Technology), Tsuneo Nitta (Waseda Univ./Toyohashi Univ. of Technology) SP2019-28 WIT2019-27
Speech imagery recognition from Electroencephalogram (EEG) is one of the challenging technologies for non-invasive brain... [more] SP2019-28 WIT2019-27
pp.63-68
WIT, SP 2019-10-27
09:20
Kagoshima Daiichi Institute of Technology (Kagoshima) Word Recognition using word likelihood vector from speech-imagery EEG
Satoka Hirata, Yurie Iribe (Aichi Prefectual Univ.), Kentaro Fukai, Kouichi Katsurada (Tokyo Univ. of Science), Tsuneo Nitta (Waseda Univ./Toyohashi Univ. of Tech.) SP2019-29 WIT2019-28
Previous research suggests that humans manipulate the machine using their electroencephalogram called BCI (Brain Compute... [more] SP2019-29 WIT2019-28
pp.69-73
SP 2019-06-13
13:55
Kanagawa Tokyo Institute of Technology (Kanagawa) Collection of large-scale Japanese articulatory-acoustic parallel data
Kohei Wakamiya, Fumiaki Taguchi, Riko Watanabe (Kyushu Univ.), Kouichi Katsurada (Tokyo Univ. of Science), Takehiko Makino (Chuo Univ.), Tokihiko Kaburagi (Kyushu Univ.) SP2019-2
A large-scale Japanese articulatory-acoustic parallel data is a basic data using as a natural speech synthesis using a a... [more] SP2019-2
pp.7-12
PRMU, IPSJ-CVIM, MVE [detail] 2017-01-19
15:55
Kyoto (Kyoto) ngle independent lip reading using symmetrical 3D-AAM of facial images
Takuya Watanabe (TUT), Kouichi Katsurada (TUS), Yasushi Kanazawa (TUT) PRMU2016-134 MVE2016-25
Lip reading is a technique to recognize spoken words from only visual images of a face. There have been proposed variou... [more] PRMU2016-134 MVE2016-25
pp.135-140
PRMU, IBISML, IPSJ-CVIM [detail] 2015-09-15
11:15
Ehime (Ehime) Evaluation of Active Appearance Models using AutoEncoder
Takuya Watanabe, Kouichi Katsurada (TUT), Tsuneo Nitta (WU), Yurie Iribe (APU) PRMU2015-85 IBISML2015-45
Active Appearance Models (AAM) is a synthetic model of facial images, which reduces dimension of feature space construct... [more] PRMU2015-85 IBISML2015-45
pp.135-140
 Results 1 - 10 of 10  /   
Choose a download format for default settings. [NEW !!]
Text format pLaTeX format CSV format BibTeX format
Copyright and reproduction : All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)


[Return to Top Page]

[Return to IEICE Web Page]


The Institute of Electronics, Information and Communication Engineers (IEICE), Japan