---
kind: trend
trend_doc_id: 67
granularity: day
period_start: '2026-03-06T00:00:00'
period_end: '2026-03-07T00:00:00'
topics:
- vla
- robotics
- embodiment-transfer
- long-horizon-control
- industrial-readiness
run_id: materialize-outputs
aliases:
- recoleta-trend-67
tags:
- recoleta/trend
- topic/vla
- topic/robotics
- topic/embodiment-transfer
- topic/long-horizon-control
- topic/industrial-readiness
language_code: zh-CN
---

# VLA部署短板修补加速：语言服从、视角鲁棒与真实采摘落地

## Overview
今天的论文集中在一个很明确的方向：让机器人基础模型更能在真实环境里工作。重点不是再做更大模型，而是补语言理解、视角变化、长任务控制和部署评估这些短板。主要观察-语言约束开始被单独诊断。RestoringLinguisticGroundinginVLAModelsviaTrain-Free…指出VLA存在“语言失明”，即看到场景后会忽视矛盾指令。

## Clusters

### VLA从“能跑基准”转向“补齐部署短板”

当天最强主线是 VLA 进入“部署修补期”。多篇工作不再追求更大模型，而是直接补真实使用中的脆弱点：语言约束失效、相机视角变化、长时程技能表示不足。共同特征是少改模型，更多在推理时或数据组织上做增强。

#### Representative sources
- [Restoring Linguistic Grounding in VLA Models via Train-Free Attention Recalibration](../Inbox/2026-03-06--restoring-linguistic-grounding-in-vla-models-via-train-free-attention-recalibration.md) — Ninghao Zhang; Bin Zhu; Shijie Zhou; Jingjing Chen
- [AnyCamVLA: Zero-Shot Camera Adaptation for Viewpoint Robust Vision-Language-Action Models](../Inbox/2026-03-06--anycamvla-zero-shot-camera-adaptation-for-viewpoint-robust-vision-language-action-models.md) — Hyeongjun Heo; Seungyeon Woo; Sang Min Kim; Junho Kim; Junho Lee; Yonghyeon Lee; …
- [Hierarchical Latent Action Model](../Inbox/2026-03-06--hierarchical-latent-action-model.md) — Hanjung Kim; Lerrel Pinto; Seon Joo Kim


### 泛化改进开始依赖数据结构与层级表示

两篇工作都说明，机器人泛化不只靠更多数据。跨具身迁移里，带对应关系的配对演示比单纯堆异构数据更有效；长时程学习里，从无标签视频抽取分层技能，可明显提高数据效率。这表明“数据结构”和“时序抽象”正在成为新的杠杆。

#### Representative sources
- [Data Analogies Enable Efficient Cross-Embodiment Transfer](../Inbox/2026-03-06--data-analogies-enable-efficient-cross-embodiment-transfer.md) — Jonathan Yang; Chelsea Finn; Dorsa Sadigh
- [Hierarchical Latent Action Model](../Inbox/2026-03-06--hierarchical-latent-action-model.md) — Hanjung Kim; Lerrel Pinto; Seon Joo Kim


### 真实应用扩展，但产业成熟度仍偏早期

真实场景落地开始从实验室任务走向农业与工业评估。一篇论文把开源 VLA 微调到温室草莓采摘，给出成功率、周期时间和损伤率；另一篇综述则指出，工业级成熟度仍明显不足。两者合起来看，领域化落地已有进展，但距离标准化工业部署还有很长一段路。

#### Representative sources
- [HarvestFlex: Strawberry Harvesting via Vision-Language-Action Policy Adaptation in the Wild](../Inbox/2026-03-06--harvestflex-strawberry-harvesting-via-vision-language-action-policy-adaptation-in-the-wild.md) — Ziyang Zhao; Shuheng Wang; Zhonghua Miao; Ya Xiong
- [Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment Framework](../Inbox/2026-03-06--robotic-foundation-models-for-industrial-control-a-comprehensive-survey-and-readiness-assessment-framework.md) — David Kube; Simon Hadwiger; Tobias Meisen


### 效率优化与数据采集工具同步推进

除了操作，VLA 也在向实时导航和高质量示教接口延伸。导航侧通过免训练 token 剪枝减轻推理负担；示教侧通过低成本力反馈手套提高灵巧操作数据质量。这些工作都在补“系统链路”中的非模型环节。

#### Representative sources
- [History-Conditioned Spatio-Temporal Visual Token Pruning for Efficient Vision-Language Navigation](../Inbox/2026-03-06--history-conditioned-spatio-temporal-visual-token-pruning-for-efficient-vision-language-navigation.md) — Qitong Wang; Yijun Liang; Ming Li; Tianyi Zhou; Christopher Rasmussen
- [CDF-Glove: A Cable-Driven Force Feedback Glove for Dexterous Teleoperation](../Inbox/2026-03-06--cdf-glove-a-cable-driven-force-feedback-glove-for-dexterous-teleoperation.md) — Huayue Liang; Ruochong Li; Yaodong Yang; Long Zeng; Yuanpei Chen; Xueqian Wang
