Ask or search anything...

Events

AI for Law01/09 · Joel Niklaus · Hugging Face

LiAuto Inc.

The Better You Learn, The Smarter You Prune: Towards Efficient Vision-language-action Models via Differentiable Token Pruning

21 Sep 2025

Chinese Academy of Sciences Tsinghua University logo

Tsinghua University

LightVLA introduces a differentiable token pruning framework that simultaneously boosts task success rates and reduces computational overhead in Vision-Language-Action (VLA) models, making them more efficient for deployment on resource-constrained platforms. The framework achieved a 2.6% improvement in task success rate and a 59.1% reduction in total FLOPs on the LIBERO benchmark, relative to its foundation model, OpenVLA-OFT.

View blog

#attention-mechanisms #computer-science #computation-and-language

Resources 15

644

AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention

02 Dec 2025

The Chinese University of Hong Kong, Shenzhen Beijing University of Technology

Researchers from LiAuto Inc. developed the AVA-VLA framework, reformulating Vision-Language-Action models from a Partially Observable Markov Decision Process perspective, which allows for dynamic visual attention based on historical context. The system achieves state-of-the-art success rates on robot manipulation benchmarks and demonstrates robust real-world performance on a dual-arm robot.

View blog

#attention-mechanisms #computer-science #computer-vision-and-pattern-recognition

Resources

126

COPO: Consistency-Aware Policy Optimization

06 Aug 2025

Fudan University Shanghai Jiaotong University logo

Shanghai Jiaotong University

The COPO framework enhances Large Language Models' reasoning capabilities by resolving vanishing gradients in Group-Relative Policy Optimization (GRPO). It integrates local and global optimization strategies, ensuring all training samples contribute meaningful learning signals, leading to superior performance on mathematical reasoning tasks.

View blog

#agents #computer-science #artificial-intelligence

Resources

There are no more papers matching your filters at the moment.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Dark mode

Ask or search anything...

Events