Simple Automation
OpenGVL introduces an open-source benchmark and tool that evaluates Vision-Language Models (VLMs) on their ability to predict robotic task progress from visual observations, aiding in automated data curation. The system quantifies a significant performance disparity between open-source and proprietary VLMs while effectively identifying various data quality issues in diverse robotics datasets.
There are no more papers matching your filters at the moment.