DataPerf introduces an open-source benchmark suite designed to reorient machine learning research by fostering innovation in data-centric AI. This suite, managed by MLCommons, provides a standardized platform for evaluating data curation, selection, cleaning, and acquisition techniques, with initial baselines across diverse tasks demonstrating considerable scope for improvement in data quality.
Researchers from Stanford University and DeepLearning.AI developed STARC-9, a large-scale, high-quality dataset for multi-class tissue classification in colorectal cancer (CRC) histopathology, alongside DeepCluster++, a semi-automated framework for its construction. Models trained on STARC-9 achieved up to 99% accuracy in tissue classification and significantly improved tumor segmentation Dice scores by 14-35% compared to existing public datasets.
There are no more papers matching your filters at the moment.