This study from the National Research Council of Canada and McGill University comprehensively evaluates text classification methods in the LLM era, focusing on data scarcity and multilingual performance. It demonstrates that zero-shot LLMs excel in sentiment analysis, few-shot fine-tuning improves performance for complex tasks, and synthetic data can serve as an effective alternative to labeled data across eight languages.
View blogThe Euclid Collaboration developed a strong lensing discovery engine combining machine learning, citizen science, and expert assessment, leading to the identification of 497 strong gravitational lens candidates from the Euclid Quick Data Release 1. This includes 243 previously unpublished high-confidence candidates and demonstrates a detection rate of 20.3 lens candidates per square degree, with a significant number having small Einstein radii below 1 arcsecond.
View blogThe mdCATH dataset provides extensive all-atom molecular dynamics simulations for over 5,000 protein domains from the CATH classification system, capturing protein dynamics across five temperatures and five independent replicas. This unique resource includes instantaneous atomic forces alongside coordinates, enabling the training of next-generation machine learning potentials for computational biophysics and facilitating proteome-wide statistical analyses of protein unfolding.
View blogThis research from the University of Pisa and Sant’Anna School of Advanced Studies introduces a Continual Policy Distillation framework that enables a single soft robotic controller to learn dexterous in-hand manipulation of various objects sequentially. The framework effectively consolidates knowledge from multiple object-specific expert policies, mitigating catastrophic forgetting and achieving high manipulation performance on diverse objects, with replay-based strategies like Reward Prioritized Experience Replay showing performance comparable to training on all data simultaneously.
View blogA deep learning approach for outdoor environment reconstruction, developed at Yerevan State University and CNR Pisa, exclusively leverages ambient radio frequency signals, demonstrating a viable alternative to vision-based methods. The transformer-based model achieved an IoU of 42.2% and a Chamfer distance of 18.3m on a synthetic dataset, showing improved resilience and scalability for mapping solutions.
View blog