MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
要約
Long-term agent memory is increasingly multimodal, yet existing evaluations rarely test whether agents preserve the visual evidence needed for later reasoning. In prior work, many visually grounded questions can be answered using only captions or textual traces, allowing answers to be inferred witho…