Online edition: ISSN 2432-6380
[TOP] | [2018] | [2019] | [2020] | [2021] | [2022] | [2023] | [2024] | [Japanese] / [English]
ITS2024-51
Online Continual Learning on a Synthetic Contaminated Datastream
Maorong Wang, Nicolas Michel, Jiafeng Mao, Toshihiko Yamasaki (UTokyo)
pp. 1 - 6
ITS2024-52
Llava-Planner: Enhancing Spatial Awareness of LLaVA for Cost-Effective Path Planning
Ling Xiao, Hiromasa Yamanishi, Toshihiko Yamasaki (UTokyo)
pp. 7 - 12
ITS2024-53
Dataset Construction and Quality Classification Model Development for Automating Agricultural Crop Sorting Tasks
Koyu Mizutani, Toshihiko Yamasaki (UTokyo)
pp. 13 - 18
ITS2024-54
Learned Image Compression with State Space Model
Shimon Murai (Waseda Univ.), Heming Sun (YNU), Jiro Katto (Waseda Univ.)
pp. 19 - 23
ITS2024-55
Deep learning study of the relationship between aesthetic evaluations by category
Akihiro Tsuchiya, Shunsuke Haga, Yoshiaki Yasumura (SIT)
pp. 24 - 29
ITS2024-56
Leveraging Multimodal Large Language Models for Japanese Recipe Generation from Food Photos
Yuki Imajuku, Yoko Yamakata, Kiyoharu Aizawa (Univ. of Tokyo)
pp. 30 - 35
ITS2024-57
A Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Model
Shiho Noda, Atsuyuki Miyai, Qing Yu (UTokyo), Go Irie (TUS), Kiyoharu Aizawa (UTokyo)
pp. 36 - 41
ITS2024-58
Pseudo-anomaly data generation method based on SVDD for anomaly detection
Bourouis mouad, Tsuchio shuta, Kitamura takuya (NIT)
pp. 42 - 46
ITS2024-59
Recipe suggestion system with generative AI based on flyer image
Koga Tanaka, Takuya Kitamura (NIT)
pp. 47 - 52
ITS2024-60
Towards optimized remote operation of driverless buses
-- Prelimanry study on evaluating remote monitoring performance using VR --
Koki Masuda (AIST/TUS), Yanbin Wu, Naohisa Hashimoto (AIST)
pp. 53 - 57
ITS2024-61
Research on a system for identifying pedestrians' intention to cross the road by posture estimation
Daina Fujii (TUS), Shin Kato (AIST)
pp. 58 - 63
ITS2024-62
Impainting of Preceding Vehicles from In-vehicle Camera Images based on Spatio-Temporal Image Processing
-- Aiming for Real Video Driving Simulator --
Shintaro Ono, Kazuki Matsumoto, Yudai Inoue, Da Li (Fukuoka Univ.)
pp. 64 - 70
ITS2024-63
Stress Reduction Method for Self-Driving Car by AR Information in Obstacle Avoidance
Ryosuke Shigeto, Isidro Butaslac, Taishi Sawabe (NAIST), Masayuki Kanbara (NAIST/Konan Univ), Hirokazu Kato (NAIST)
pp. 71 - 74
ITS2024-64
Difficulty-aware Generation for Image Classification
Zerun Wang (UTokyo), Jiafeng Mao, Xueting Wang (CyberAgent), Toshihiko Yamasaki (UTokyo)
pp. 75 - 79
ITS2024-65
Momentum Knowledge Distillation for Enhanced Online Continual Learning
Nicolas Michel, Wang Maorong, Ling Xiao, Toshihiko Yamasaki (UTokyo)
pp. 80 - 85
ITS2024-66
Exploiting LLM Quantization
Kazuki Egashira (Univ. of Tokyo), Mark Vero, Robin Staab, Jingxuan He, Martin Vechev (ETHZ)
pp. 86 - 91
ITS2024-67
Proposal of an object detection model based on intermediate feature fusion of visible light and infrared light images
Keito Sasamori (MIT), Kousuke Hirasawa (KONICA MINOLTA), Satoshi Kondo (MIT)
pp. 92 - 96
ITS2024-68
Automatic Scoring of Sports Based on Scene Segmentation and Motion Feature Learning Using Video Analysis
Shun Tateuchi, Satoshi Kondo (Muroran Institute of Tech.)
pp. 97 - 100
ITS2024-69
TourLMM: Task-Adaptive Retrieval-Augmented Multimodal Model for Tourism Domain
Hiromasa Yamanishi, Xiao Ling, Toshihiko Yamasaki (UTokyo)
pp. 101 - 106
ITS2024-70
StyleRec: Multimodal Recommendation Using Visual-Language Features Containing Sentimental Styles
Chi Zhang, Luwei Zhang, Toshihiko Yamasaki (UTokyo)
pp. 107 - 111
ITS2024-71
Feature Point Matching Accuracy Using Hyperspectral Images for High-performance 3D Reconstruction
Tomohiro Yuguchi, Kaito Houjho, Ryota Fuse, Terumasa Aoki (TUT)
pp. 112 - 117
ITS2024-72
Research on Lightweight Models for Speeding up Human Pose Estimation
Akihiro Mizuki, Kazuki Ikeda, Shoto Morishita, Qiu Chen (Kogakuin Univ.)
pp. 118 - 122
ITS2024-73
Adversarial Attack-Based Method for Protecting Facial Images from Generative AI
Tomohiro Isono, Seishu Matsui, Terumasa Aoki (TUT)
pp. 123 - 128
ITS2024-74
Explainable Image Aesthetic Assessment Leveraging Vision-Language Models
Supatta Viriyavisuthisakul (PIM), Shun Yoshida, Kaede Shiohara, Ling Xiao, Toshihiko Yamasaki (UTokyo)
pp. 129 - 133
ITS2024-75
Application of Adversarial Defense Methods for High-Accuracy Underwater Object Detection
Seishu Matsui, Terumasa Aoki (Tokyo University of Technology)
pp. 134 - 139
ITS2024-76
Vision Transformer Construction Method Suitable for Medical Image
Fumito Shimizu, Terumasa Aoki (TUT)
pp. 140 - 145
ITS2024-77
Distinction Between Real and AI-Generated Images Based on Absolute Error Images
Sora Kawai, Terumasa Aoki (TUT)
pp. 146 - 151
ITS2024-78
A Study of Spectrally-varying PSFs-based Compressive Spectral Imaging with Structured Illumination
Ryo Shirakawa, Yoko Sogabe, Shoichiro Saito, Masaki Kitahara (NTT)
pp. 152 - 157
ITS2024-79
Communication-Efficient Real-Time External Abnormality Detection for AVs Using MLLMs
Kiminobu Makino, Naoya Kaneko, Toru Furusawa, Haruka Futatsubashi, Shota Mishima (TMC)
pp. 158 - 163
ITS2024-80
A preliminary evaluation of a Lighting HMI to Improve Passengers' Safety and Comfort in Driverless Buses.
Satoru Ogino, Kosei Ido (AIST/TUS), Yanbin Wu, Toru Kumagai, Takahiro Miura, Naohisa Hashimoto (AIST)
pp. 164 - 169
ITS2024-81
On the Effectiveness of Information Screens Applied Nudegs for Decreasing Passage Time of Airport Security Checkpoint
Taisei Ichimura (Saitama Univ.), Yuichiro Ishikawa, Naoya Kawase, Hisakazu Mizuno, Yuichiro Hirata (CJIAC), Hideo Ideguchi, Hiroyuki Uchiyama (AGP), Tetsuya Manabe (Saitama Univ.)
pp. 170 - 175
ITS2024-82
View Generation of the Subject from Free Viewpoint Images and Its Evaluation
Sota Yamazaki, Kaito Houjho, Aoki Terumasa (TUT)
pp. 176 - 181
ITS2024-83
Skin Cancer Classification Using Hyperspectral Images Generated from RGB Images
Ryota Fuse, Koki Shimizu, Terumasa Aoki (TUT)
pp. 182 - 187
ITS2024-84
Performance Enhancement Free Viewpoint Image Methods Using for Deep-Learning Super-Resolution Techniques
Kaito Houjho, Ren Nakasato, Terumasa Aoki (TUT)
pp. 188 - 193
ITS2024-85
Optimal Bit Allocation in Multilayer Coding
Yuichi Kondo, Yuichi Kusakabe, Atsuro Ichigaya (NHK)
pp. 194 - 199
ITS2024-86
Effect of image feature localization by patching on subjective evaluation of coding distortion by binary decision
Soichiro Honda, Kohei Hayashi, Hirokazu Kamei (NITech), Yoshihiro Maeda (SIT), Norishige Fukushima (NITech)
pp. 200 - 205
ITS2024-87
Image Quality Enhancement of 3D Vascular Images Using Diffusion Models
Masahiro Maruichi, Yasumura Yoshiaki (SIT)
pp. 206 - 211
ITS2024-88
Multi-Stage Context Model for 3D Point Cloud Compression
Hayato Shimizu, Ao LUO (Waseda), Fangzheng Lin (Science Tokyo), Jiro Katto (Waseda)
pp. 212 - 217
ITS2024-89
A Study of Frequency Domain Image Super-Resolution Using Discrete Cosine Transform
Junichi Nakajima, Takayuki Kurozumi (NTT)
pp. 218 - 223
ITS2024-90
Automatic Classification of Fundamental Sports Movements in Children's Developmental Stages Using a Depth Camera
Kousei Yoshino (Muroran Institute of Tec.), Hajime Toda (Sapporo Medical Univ.), Satoshi Kondo (Muroran Institute of Tec.)
pp. 224 - 227
ITS2024-91
Evaluation of Doctor Proficiency Using Behavioral Quality Assessments from Surgical Videos
Manato Yatabe, Satoshi Kondo (MIT)
pp. 228 - 230
ITS2024-92
Bowing Motion Analysis and Skill Evaluation Using Violin-Mounted Camera and Image Recognition AI
Kosei Sabato, Nobutaka Kuroki, Masahiro Numa (Kobe Univ.)
pp. 231 - 236
ITS2024-93
Research on Handwritten Character Beautification Using Generative Adversarial Networks
Kota Fukushima, Akihiro Ito, Qiu Chen (Kogakuin Univ.)
pp. 237 - 241
ITS2024-94
Amodal Human Segmentation Using Human Pose Estimation
Taiki Sugiura, Toru Tamaki (Nitech)
pp. 242 - 247
ITS2024-95
Report generation of chest X-ray images by LLM using RAG
Ayato Ota (MIT), Chiharu Kai, Satoshi Kasai (NUHW), Satoshi Kondo (MIT)
pp. 248 - 251
ITS2024-96
Alignment of mammograms using a registration model
Masahiro Shirahata (MIT), Chiharu Kai, Satoshi Kasai (NUHW), Satoshi Kondo (MIT)
pp. 252 - 257
ITS2024-97
Development of a 360-Degree Video-Based Virtual Indoor Exploration System and Its Application to Train Stations
Zhuofan Sun (The Univ. of Tokyo), Naoki Sugimoto (MMMakerSugi), Leslie Woehler (The Univ. of Tokyo), Satoshi Ikehata (NII), Kiyoharu Aizawa (The Univ. of Tokyo)
pp. 258 - 263
ITS2024-98
Construction of a Perception-Based Facial Image Similarity Scale and Its Application to Face Anonymization
Haruka Kumagai, Leslie Wohler (UTokyo), Satoshi Ikehata (NII), Kiyoharu Aizawa (UTokyo)
pp. 264 - 269
Note: Each article is a technical report without peer review, and its polished version will be published elsewhere.