論文 Hugging Face 発表: 2026-05-18 HF ↑11

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

著者: Yujie Wei, Yujin Han, Zhekai Chen, Yongming Li, Kaixun Jiang ほか18名

要約

Video generation is rapidly evolving from single-shot synthesis to complex multi-shot audio-video (MSAV) narratives to meet real-world demands. However, evaluating such frontier models remains a fundamental challenge. Existing benchmarks are limited in scope and data diversity, and rely on rigid eva…

#benchmark#agent#alignment

同じカテゴリの記事