Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2024-05-22 16:50 |
Online |
Online |
[Invited Talk]
Fundamentals of Diffusion-based Generative Models and their Application to Speech Enhancement and Separation Scheibler Robin (LY Corp.) EA2024-9 |
(To be available after the conference date) [more] |
EA2024-9 p.38 |
NLP, MSS |
2024-03-13 17:20 |
Misc. |
Kikai-Shinko-Kaikan Bldg. |
Application of Data Augmentation in Japanese Foundation Models Kazuki Era, Hidehiro Nakano (Tokyo City Univ.) MSS2023-84 NLP2023-136 |
One of the recent topics is data augmentation. Data augmentation is a method of augmenting training data to improve the ... [more] |
MSS2023-84 NLP2023-136 pp.66-69 |
SS |
2024-03-07 17:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
For evaluating the effectiveness of CodeT5 transfer learning in refactoring recommendations. Yuto Nakajima, Kenji Fujiwara (Tokyo City University) SS2023-62 |
Refactoring is "the process of restructuring the internal architecture of software to make it easier to understand and m... [more] |
SS2023-62 pp.79-84 |
SIP, SP, EA, IPSJ-SLP [detail] |
2024-02-29 15:45 |
Okinawa |
(Primary: On-site, Secondary: Online) |
|
We have developed automatic speech recognition and dialect identification techniques by using COJADS, a corpus of Japane... [more] |
|
SIP, SP, EA, IPSJ-SLP [detail] |
2024-03-01 10:40 |
Okinawa |
(Primary: On-site, Secondary: Online) |
Substitution of Implicit Linguistic Information in Beam Search Decoding Using CTC-based Speech Recognition Models Tatsunari Takagi, Yukoh Wakabayashi (TUT), Atsunori Ogawa (NTT), Norihide Kitaoka (TUT) EA2023-106 SIP2023-153 SP2023-88 |
The rise of neural networks in the field of automatic speech recognition has notably improved the accuracy of speech rec... [more] |
EA2023-106 SIP2023-153 SP2023-88 pp.268-273 |
ITS, IE, ITE-MMS, ITE-ME, ITE-AIT [detail] |
2024-02-20 10:15 |
Hokkaido |
Hokkaido Univ. |
Image Attractiveness Analysis with Explanation using Vision-Language Model Shun Yoshida, Kaede Shiohara, Toshihiko Yamasaki (UTokyo) ITS2023-61 IE2023-50 |
There has been research on making machines analyze the image attractiveness, and in recent years, further progress has b... [more] |
ITS2023-61 IE2023-50 pp.82-87 |
ITS, IE, ITE-MMS, ITE-ME, ITE-AIT [detail] |
2024-02-20 11:00 |
Hokkaido |
Hokkaido Univ. |
Image Generation Modification with Diffusion Model through Sketch Guidance Sandra Zhang Ding, Jiafeng Mao, Kiyoharu Aizawa (UTokyo) ITS2023-64 IE2023-53 |
Large-scale image generation models have demonstrated their remarkable ability to generate diverse, high-quality images.... [more] |
ITS2023-64 IE2023-53 pp.95-99 |
HCGSYMPO (2nd) |
2023-12-11 - 2023-12-13 |
Fukuoka |
Asia pacific Import Mart (Kitakyushu) (Primary: On-site, Secondary: Online) |
Compact Emotional Space Simulating Human Percieve of Emotion Based on Crossmodal Contrastive Learning with Softlabel Seiichi Harata, Takuto Sakuma, Shohei Kato (NITech) |
This study aims to explore data-driven emotion modeling by extracting the latent space of emotions from human emotion ex... [more] |
|
PRMU, IPSJ-CVIM, IPSJ-DCC, IPSJ-CGVI |
2023-11-17 14:40 |
Tottori |
(Primary: On-site, Secondary: Online) |
Diffusion-based Geometric Unwarping and Illumination Correction for Document Images Sota Imahayashi, Guoqing Hao, Satoshi Iizuka, Kazuhiro Fukui (Univ. of Tsukuba) PRMU2023-36 |
This study proposes a method to improve the visibility of document images by correcting distortions and re-illuminating ... [more] |
PRMU2023-36 pp.113-118 |
BioX |
2023-10-13 10:20 |
Okinawa |
Nobumoto Ohama Memorial Hall |
Discrimination between Real and Generated Gestures of Speakers
-- An Attempt to Improve Generalization Performance in Unseen Generation Methods through Self-Supervised Learning -- Geng Mu (AGU), Naoshi Kaneko (TDU), Kazuhiko Sumi (AGU) BioX2023-67 |
Currently, discerning artificially generated misinformation is a critical societal challenge, with research progressing ... [more] |
BioX2023-67 pp.44-49 |
MIKA (3rd) |
2023-10-10 15:35 |
Okinawa |
Okinawa Jichikaikan (Primary: On-site, Secondary: Online) |
[Poster Presentation]
An Evaluation of the Generalizability of Influencer Prediction Models between Social Networks in Different Domains Kota Tahara, Sho Tsugawa (ITF) |
Identifying influencers on social media is one of the important research issues. Various methods for identifying influen... [more] |
|
DE, IPSJ-DBS, IPSJ-IFAT [detail] |
2023-09-21 15:50 |
Fukuoka |
Kitakyushu International Conference Center |
BoxPlotQA: Visual Question Answering for Measuring Five-Number Summary and Comparison Performance with Box Plot Yusuke Tozaki, Hisashi Miyamori (Kyoto Sangyo Univ.) DE2023-16 |
Recently, visual question and answer (VQA) research on document and chart images, as well as natural images, has attract... [more] |
DE2023-16 pp.31-36 |
DE, IPSJ-DBS, IPSJ-IFAT [detail] |
2023-09-22 10:30 |
Fukuoka |
Kitakyushu International Conference Center |
Probing the ability to accurately understand and utilize the ordinal numbers by visual language models Ryuto Masuda, Hisashi Miyamori (Kyoto Sangyo Univ.) DE2023-20 |
In this paper, we investigate the extent to which visual language models have the ability to accurately grasp and utiliz... [more] |
DE2023-20 pp.54-59 |
AI |
2023-09-12 16:50 |
Hokkaido |
|
Koki Miyauchi, Ryohei Orihara, Yuichi Sei, Yasuyuki Tahara, Akihiko Ohsuga (UEC) AI2023-34 |
Icons are indispensable elements for websites and smartphone applications. For website developers, the creation of icons... [more] |
AI2023-34 pp.201-206 |
NLC |
2023-09-07 15:55 |
Osaka |
Osaka Metropolitan University. Nakamozu Campus. (Primary: On-site, Secondary: Online) |
Sentiment-based correction and interpretation of the diffusion index Wanwan Zheng (Nagoya Univ.) NLC2023-12 |
Reading economic indices is common when assessing economic conditions. In recent years, however, many studies have been ... [more] |
NLC2023-12 pp.63-68 |
CQ, MIKA (Joint) |
2023-08-31 15:55 |
Fukushima |
Tenjin-Misaki Sports Park |
Semantic Communication with Masked Autoencoders: Enhancing Efficiency in Image Transmission Jiale Wu, Zhaoyang Du, Celimuge Wu, Tsutomu Yoshinaga (UEC) CQ2023-29 |
Semantic communication, a promising candidate for 6G technology, has become a research hot spot. However, existing studi... [more] |
CQ2023-29 pp.20-25 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-23 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Poster Presentation]
Generation of colored subtitle images based on emotional information of speech utterances Fumiya Nakamura (Kobe Univ.), Ryo Aihara (Mitsubishi Electric), Ryoichi Takashima, Tetsuya Takiguchi (Kobe Univ.), Yusuke Itani (Mitsubishi Electric) SP2023-11 |
Conventional automatic subtitle generation systems based on speech recognition do not take into account paralinguistic i... [more] |
SP2023-11 pp.54-59 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
[Short Paper]
SBERT-based Musical Components Estimation from Lyrics Trained with Imbalanced "Orpheus" Data Mastuti Puspitasari, Takuya Takahashi (UEC), Gen Hori (AU), Shigeki Sagayama, Toru Nakashika (UEC) SP2023-18 |
This research was done to develop neural models that are capable of estimating appropriate musical components based on l... [more] |
SP2023-18 pp.86-90 |
SP, IPSJ-MUS, IPSJ-SLP [detail] |
2023-06-24 13:50 |
Tokyo |
(Primary: On-site, Secondary: Online) |
Domain adaptation of speech recognition models based on self-supervised learning using target domain speech Takahiro Kinouchi (TUT), Atsunori Ogawa (NTT), Yuko Wakabayashi, Norihide Kitaoka (TUT) SP2023-19 |
In this study, we propose a domain adaptation method using only speech data in the target domain without using transcrib... [more] |
SP2023-19 pp.91-96 |
ET, IPSJ-CLE |
2023-06-17 16:05 |
Tokyo |
Tokyo Metropolitan University (Primary: On-site, Secondary: Online) |
Construction Support System of Behavioral Model of Opponent in Strategy Board Game Ryoma Kayano, Tomoko Kojiri (Kansai Univ.) ET2023-6 |
In a strategy board game such as Othello, it is important to determine one’s own actions by estimating the actions of th... [more] |
ET2023-6 pp.11-17 |