論文 深掘り arXiv 発表: 2026-05-25

Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning

Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning

著者: Jun-Tao Tang, Yu-Cheng Shi, Zhen-Hao Xie, Da-Wei Zhou

要約

Multimodal Large Language Models (MLLMs) achieve versatility by reformulating diverse tasks into a unified instruction-following framework via instruction tuning. However, real-world deployment requires continuous adaptation to emerging tasks, motivating Multimodal Continual Instruction Tuning (MCIT…

#llm#multimodal#fine-tuning

同じカテゴリの記事