About Me
I am Tomoya Yoshida, a Ph.D. student in the Graduate School of Informatics at Kyoto University. My interests: Vision-and-Language, Egocentric vision, and Robot learning.
Experiences
Research Assistant
Feb 2024 - Present · NII LLMC
Research Assistant
Aug 2023 - Present · RIKEN AIP
Software Engineer Intern
Aug 2022 - Oct 2022 · Morpho, Inc.
Education
Ph.D in Informatics
2024 - Present · Kyoto University
M.S. in Informatics
2022 - 2024 · Kyoto University
B.S. in Emerging Multi-Interdisciplinary Engineering
2018 - 2022 · The University of Electro-Communications
Publications
Preprint
2025. Tomohiro Nishimoto, Taichi Nishimura, Koki Yamamoto, Keisuke Shirai, Hirotaka Kameko, Yuto Haneji, Tomoya Yoshida, Keiya Kajimura, Taiyu Cui, Chihiro Nishiwaki, Eriko Daikoku, Natsuko Okuda, Fumihito Ono, and Shinsuke Mori. BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset using Micro QR Codes. Arxiv:2404.03161.
2025. Yuto Haneji, Taichi Nishimura, Hirotaka Kameko, Keisuke Shirai, Tomoya Yoshida, Keiya Kajimura, Koki Yamamoto, Taiyu Cui, Tomohiro Nishimoto, and Shinsuke Mori. EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts. Arxiv:2410.05343.
2024. Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and Shinsuke Mori. Text-driven Affordance Learning from Egocentric Vision. Arxiv:2404.02523.
International Journal, Conference, and Workshop
2025. Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and Shinsuke Mori. Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision. CVPR.
Domestic Journal, Conference, and Workshop
2023. 吉田智哉, 西村太一, 亀甲博貴, 森信介. 単語の階層関係に基づくデータ拡張を利用した画像キャプション生成の検討. 言語処理学会第29回年次大会 (NLP2023).
2024. 吉田智哉, 栗田修平, 西村太一, 森信介. 一人称視点に基づくテキスト駆動型アフォーダンス及び軌跡の学習. 言語処理学会第30回年次大会 (NLP2024).