論文 深掘り Hugging Face 発表: 2026-05-18 HF ↑31

CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition

CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition

著者: Hongji Yang, Songlian Li, Yucheng Zhou, Xiaotong Zhao, Alan Zhao ほか2名

要約

Recent diffusion models achieve strong photorealism and fluency in video generation, yet remain fragile under abstract, sparse or complex conditions, leading to poor performance in professional production workflows such as storyboard sketches and clay render conditions. Existing video generation mod…

#multimodal#diffusion#rl#benchmark

同じカテゴリの記事