Ask or search anything...

History

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

Hot

Maitrix

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

27 Jun 2025

Husky Doge (Husky)

Chuanyang Jin

University of Michigan

University of California, San Diego

A new atomic evaluation framework, WM-ABench, systematically assesses Vision-Language Models' capabilities as internal world models, revealing significant limitations in their spatial, temporal, and dynamic scene understanding, as well as their ability to perform causal, transitive, and compositional predictions, falling substantially short of human performance.

View blog

#computer-science #artificial-intelligence #computation-and-language

Resources

510

There are no more papers matching your filters at the moment.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Dark mode

Ask or search anything...

Events