VLA从“能跑基准”转向“补齐部署短板”
当天最强主线是 VLA 进入“部署修补期”。多篇工作不再追求更大模型,而是直接补真实使用中的脆弱点:语言约束失效、相机视角变化、长时程技能表示不足。共同特征是少改模型,更多在推理时或数据组织上做增强。
Representative sources
- Restoring Linguistic Grounding in VLA Models via Train-Free Attention Recalibration — Ninghao Zhang; Bin Zhu; Shijie Zhou; Jingjing Chen
- AnyCamVLA: Zero-Shot Camera Adaptation for Viewpoint Robust Vision-Language-Action Models — Hyeongjun Heo; Seungyeon Woo; Sang Min Kim; Junho Kim; Junho Lee; Yonghyeon Lee; …
- Hierarchical Latent Action Model — Hanjung Kim; Lerrel Pinto; Seon Joo Kim