Tomoya Yoshida

Tomoya Yoshida

Ph.D. student

About Me

I am Tomoya Yoshida, a Ph.D. student in the Graduate School of Informatics at Kyoto University. My interests: Vision-and-Language, Egocentric vision, and Robot learning.

Experiences

Research Assistant

Feb 2024 - Present · NII LLMC

Research Assistant

Aug 2023 - Present · RIKEN AIP

Software Engineer Intern

Aug 2022 - Oct 2022 · Morpho, Inc.

Education

Ph.D in Informatics

2024 - Present · Kyoto University

M.S. in Informatics

2022 - 2024 · Kyoto University

B.S. in Emerging Multi-Interdisciplinary Engineering

2018 - 2022 · The University of Electro-Communications

Publications

Preprint

2025. Tomohiro Nishimoto, Taichi Nishimura, Koki Yamamoto, Keisuke Shirai, Hirotaka Kameko, Yuto Haneji, Tomoya Yoshida, Keiya Kajimura, Taiyu Cui, Chihiro Nishiwaki, Eriko Daikoku, Natsuko Okuda, Fumihito Ono, and Shinsuke Mori. BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset using Micro QR Codes. Arxiv:2404.03161.

2025. Yuto Haneji, Taichi Nishimura, Hirotaka Kameko, Keisuke Shirai, Tomoya Yoshida, Keiya Kajimura, Koki Yamamoto, Taiyu Cui, Tomohiro Nishimoto, and Shinsuke Mori. EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts. Arxiv:2410.05343.

2024. Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and Shinsuke Mori. Text-driven Affordance Learning from Egocentric Vision. Arxiv:2404.02523.

International Journal, Conference, and Workshop

2025. Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, and Shinsuke Mori. Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision. CVPR.

Domestic Journal, Conference, and Workshop

2023. 吉田智哉, 西村太一, 亀甲博貴, 森信介. 単語の階層関係に基づくデータ拡張を利用した画像キャプション生成の検討. 言語処理学会第29回年次大会 (NLP2023).

2024. 吉田智哉, 栗田修平, 西村太一, 森信介. 一人称視点に基づくテキスト駆動型アフォーダンス及び軌跡の学習. 言語処理学会第30回年次大会 (NLP2024).