World Action Models: The Next Frontier in Embodied AI
World Action Models: The Next Frontier in Embodied AI
要約
Vision-Language-Action (VLA) models have achieved strong semantic generalization for embodied policy learning, yet they learn reactive observation-to-action mappings without explicitly modeling how the physical world evolves under intervention. A growing body of work addresses this limitation by int…