OphNet is introduced as a large-scale video benchmark for ophthalmic surgical workflow understanding, comprising 2,278 videos (285 hours) with hierarchical, expert-annotated classifications for 66 surgery types, 102 phases, and 150 operations. This dataset addresses the scarcity of high-quality data in surgical AI, enabling advanced tasks like temporal localization and phase anticipation, with baseline experiments demonstrating strong performance, such as 66.1% Top-1 accuracy for phase classification using ViFi-CLIP.
View blogA conceptual framework introduces 'Agent for Science' (Agent4S) as the Fifth Scientific Paradigm, distinct from 'AI for Science' (AI4S), positing that LLM-driven agents can automate the entire scientific research workflow to overcome current productivity limitations. The paper defines a five-level hierarchy for Agent4S, progressing from single-tool automation to autonomous multi-laboratory collaboration, aiming to accelerate discovery.
View blog