Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
ITE-ME, ITE-IST, BioX, SIP, MI, IE [detail] |
2024-06-07 09:30 |
Niigata |
Nigata University (Ekinan-Campus "TOKIMATE") (Niigata) |
A pre-trained representation learning model can be used to decode speech from intracranial recordings Shoya Murakami, Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Tokyo Tech), Toshihisa Tanaka (TUAT) SIP2024-5 BioX2024-5 IE2024-5 MI2024-5 |
Deep learning has been shown to be effective in decoding the content of a speaker's speech from recordings of brain acti... [more] |
SIP2024-5 BioX2024-5 IE2024-5 MI2024-5 pp.23-28 |
PRMU, IPSJ-CVIM |
2024-05-16 15:40 |
Tokyo |
(Tokyo, Online) (Primary: On-site, Secondary: Online) |
Detection of Depression Using Web-Interview Data Cheuk Hee Lam, Nathania Nah, Koichi Shinoda (TokyoTech), Momoko Kitazawa, Yuriko Kaise (Keio), Shunsuke Takagi, Genichi Sugihara (TMD), Taishiro Kishimoto (Keio) PRMU2024-7 |
This paper presents a method for integrating speech, text, and video modalities for multimodal depression detection. Our... [more] |
PRMU2024-7 pp.36-40 |
SP, NLC, IPSJ-SLP, IPSJ-NL [detail] |
2023-12-02 13:30 |
Tokyo |
Kikai-Shinko-Kaikan Bldg. (Tokyo, Online) (Primary: On-site, Secondary: Online) |
Effectiveness of Signal Compression in Speech Enhancement with Diffusion Models Yuki Nishi (Titech), Koji Iwano (Tokyo City Univ.), Koichi Shinoda (Titech) NLC2023-14 SP2023-34 |
(To be available after the conference date) [more] |
NLC2023-14 SP2023-34 pp.1-6 |
PRMU, IPSJ-CVIM, IPSJ-DCC, IPSJ-CGVI |
2023-11-17 09:20 |
Tottori |
(Tottori, Online) (Primary: On-site, Secondary: Online) |
Co-speech Gesture Generation with Variational Auto Encoder Shihichi Ka, Koichi Shinoda (Tokyo Tech) PRMU2023-29 |
Co-speech gesture generation is the study of generating gestures from speech. In prior works, deterministic methods lear... [more] |
PRMU2023-29 pp.74-79 |
SP, IPSJ-SLP, EA, SIP [detail] |
2023-02-28 15:10 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Speech synthesis from electrocorticogram using pre-trained neural vocoder Kai Shigemi, Shuji Komeiji (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Tokyo Tech), Kohei Yatabe, Toshihisa Tanaka (TUAT) |
(To be available after the conference date) [more] |
|
SP, IPSJ-SLP, EA, SIP [detail] |
2023-03-01 14:45 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Personality Recognition on Dyadic Interactions with Representation Learning Nathania Nah (Tokyo Tech), Takafumi Koshinaka (YCU), Koichi Shinoda (Tokyo Tech) EA2022-117 SIP2022-161 SP2022-81 |
Personality computing explores methods of automatically measuring human traits to create a better understanding of the h... [more] |
EA2022-117 SIP2022-161 SP2022-81 pp.241-246 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-01 12:45 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
Incorporating Acoustic and Textual Information for Language Modeling in Code-switching Speech Recognition Roland Hartanto, Kuniaki Uto, Koichi Shinoda (TokyoTech) EA2021-73 SIP2021-100 SP2021-58 |
People who speak two or more languages tend to alternate the language when they are speaking. This particular phenomenon... [more] |
EA2021-73 SIP2021-100 SP2021-58 pp.56-63 |
EA, SIP, SP, IPSJ-SLP [detail] |
2022-03-02 13:25 |
Okinawa |
(Okinawa, Online) (Primary: On-site, Secondary: Online) |
[Poster Presentation]
Transformer-based Text Decoding using Electrocorticography Shuji Komeiji, Kai Shigemi (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (Tokyo Tech), Toshihisa Tanaka (TUAT) EA2021-87 SIP2021-114 SP2021-72 |
Invasive brain-machine interfaces (BMIs) are a promising neurotechnology for achieving direct speech communication from ... [more] |
EA2021-87 SIP2021-114 SP2021-72 pp.146-151 |
PRMU, IPSJ-CVIM, IPSJ-NL |
2021-05-21 10:45 |
Online |
Online (Online) |
3D human pose and shape reconstruction with differentiable renderer from body part segmentation Rintaro Sakurai, Kuniaki Uto, Koichi Shinoda (Tokyo Tech) PRMU2021-6 |
(To be available after the conference date) [more] |
PRMU2021-6 pp.31-36 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online (Online) |
[Poster Presentation]
Noise-robust time-domain speech separation with basis signals for noise Kohei Ozamoto (Tokyo Tech), Koji Iwano (TCU), Kuniaki Uto, Koichi Shinoda (Tokyo Tech) EA2020-70 SIP2020-101 SP2020-35 |
Recently, speech separation using deep learning has been extensively studied. TasNet, a time-domain method that directly... [more] |
EA2020-70 SIP2020-101 SP2020-35 pp.63-67 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-04 16:10 |
Online |
Online (Online) |
Estimation of imagined speech from electrocorticogram with an encoder-decoder model Kotaro Hayashi, Shuji Komeiji (TUAT), Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano (Juntendo Univ.), Koichi Shinoda (TokyoTech), Toshihisa Tanaka (TUAT) EA2020-87 SIP2020-118 SP2020-52 |
Recent advances in signal processing and machine learning technologies have made it possible to estimate and reconstruct... [more] |
EA2020-87 SIP2020-118 SP2020-52 pp.164-169 |
PRMU |
2020-12-17 16:20 |
Online |
Online (Online) |
[Short Paper]
Few-Shot Incremental Learning by Unifying with Variational Autoencoder Keita Takayama, Kuniaki Uto, Koichi Shinoda (TokyoTech) PRMU2020-48 |
We propose a few-shot incremental learning method using a variational autoencoder for deep learning. In incremental lear... [more] |
PRMU2020-48 pp.58-62 |
SP, EA, SIP |
2020-03-03 09:00 |
Okinawa |
Okinawa Industry Support Center (Okinawa) (Cancelled but technical report was issued) |
[Poster Presentation]
A Comparison of Language Models for a Design of Reduced Phoneme Set Shuji Komeiji, Toshihisa Tanaka (TUAT), Koichi Shinoda (titech) EA2019-152 SIP2019-154 SP2019-101 |
Language models for a design of reduced phoneme set are compared each other.
The reduction of the phoneme set improves ... [more] |
EA2019-152 SIP2019-154 SP2019-101 pp.295-300 |
PRMU, IPSJ-CVIM |
2019-05-31 09:40 |
Tokyo |
(Tokyo) |
Study on feature extraction from leaf-scale plant images Kuniaki Uto (Tokyo Tech), Mauro Dalla Mura, Jocelyn Chanussot (Grenoble INP), Koichi Shinoda (Tokyo Tech) PRMU2019-7 |
With the advent of an unmanned aerial vehicle (UAV) and sensing technologies, it is possible to acquire leaf-scale aeria... [more] |
PRMU2019-7 pp.259-264 |
EA, SIP, SP |
2019-03-14 13:30 |
Nagasaki |
i+Land nagasaki (Nagasaki-shi) (Nagasaki) |
[Poster Presentation]
A robust algorithm of phase recovery for speech enhancement Dongxiao Wang, Koichi Shinoda (TokyoTech), Hirokazu Kameoka (NTT) EA2018-122 SIP2018-128 SP2018-84 |
[more] |
EA2018-122 SIP2018-128 SP2018-84 pp.137-142 |
PRMU |
2018-12-14 11:00 |
Miyagi |
(Miyagi) |
[Short Paper]
Skeleton-based Human Action Recognition with Fine-to-Coarse Convolutional Neural Network Thao Minh Le, Nakamasa Inoue, Koichi Shinoda (TokyoTech) PRMU2018-86 |
This work introduces a new framework for skeleton-based human action recognition. Existing approaches using Convolutiona... [more] |
PRMU2018-86 pp.61-64 |
PRMU, SP |
2018-06-29 13:00 |
Nagano |
(Nagano) |
[Invited Talk]
Koichi Shinoda (TokyoTech) PRMU2018-33 SP2018-13 |
(To be available after the conference date) [more] |
PRMU2018-33 SP2018-13 p.65 |
PRMU |
2017-12-17 09:30 |
Kanagawa |
(Kanagawa) |
Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model Mengxi Lin, Nakamasa Inoue, Koichi Shinoda (Tokyo Tech) PRMU2017-101 |
Action sequence recognition aims to recognize what actions occur in a video and their temporal order. In this paper, we ... [more] |
PRMU2017-101 pp.1-6 |
SP |
2016-08-25 11:10 |
Kyoto |
ACCMS, Kyoto Univ. (Kyoto) |
SP2016-37 |
(To be available after the conference date) [more] |
SP2016-37 pp.53-58 |
PRMU, BioX |
2016-03-25 13:00 |
Tokyo |
(Tokyo) |
Boredom state estimation from spontaneous behaviors in multiparty conversations including a robot agent Yasuhiro Shibasaki (Tokyo Tech), Kotaro Funakoshi (HRI-JP), Koichi Shinoda (Tokyo Tech) BioX2015-61 PRMU2015-184 |
Conversation systems need to have the abilities to grasp the human internal information such as bore- dom state to becom... [more] |
BioX2015-61 PRMU2015-184 pp.119-124 |