alphaXiv

History

Papers Benchmarks

Smartesting

365

02 Apr 2025

computer-science software-engineering

Are Autonomous Web Agents Good Testers?

LaBRI Smartesting

Researchers from Smartesting and Université de Bordeaux explored the efficacy of autonomous web agents for executing natural language test cases, introducing a dedicated benchmark and two open-source agent implementations. Their advanced agent, PinATA, achieved a 50% higher True Accuracy compared to a baseline, though a qualitative analysis revealed five categories of persistent limitations.

There are no more papers matching your filters at the moment.

Events

AI for Law
Joel Niklaus· Hugging Face
01/09
Register
Watch recordings

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Are Autonomous Web Agents Good Testers?

Events

AI for Law

Personalize Your Feed