alphaXiv

History

Papers Benchmarks

Spirit AI

237

24 Sep 2025

agents computer-science artificial-intelligence

Do You Need Proprioceptive States in Visuomotor Policies?

New York University Tongji University

Shanghai Jiao Tong University

Tsinghua University Spirit AI

Robotic visuomotor policies achieve dramatically improved spatial generalization by removing proprioceptive state inputs. This approach leverages a relative end-effector action space and comprehensive egocentric vision to enable robust task performance across varied spatial configurations, while also enhancing data efficiency and cross-embodiment adaptation.

5,996

17 May 2025

computer-science robotics

OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning

Shanghai Artificial Intelligence Laboratory

Fudan University

Tsinghua University Spirit AI Shanghai Qi-Zhi Institute

Researchers from Tsinghua University and collaborating institutions introduce OneTwoVLA, a unified vision-language-action model that dynamically switches between reasoning and acting modes for robotic control, alongside a scalable pipeline for generating embodied reasoning data that enables enhanced performance on long-horizon tasks while maintaining real-time responsiveness.

188

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Do You Need Proprioceptive States in Visuomotor Policies?

OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning

Events

AI for Law

Personalize Your Feed