LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation
LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation
要約
Autoregressive (AR) video diffusion enables variable-length synthesis, but long-horizon generation often suffers from accumulated errors and identity drift. For efficiency, existing methods commonly adopt sliding-window attention during generation. This creates an irreversible generation trajectory:…