Ask or search anything...

History

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

Hot

Ring Team

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

18 Jun 2025

x jw

Inclusion AI Ring Team

Ring-lite, developed by Inclusion AI's Ling Team, presents an open-source multi-domain reasoning model based on a Mixture-of-Experts (MoE) architecture, achieving strong performance while activating fewer parameters. It introduces C3PO, an algorithm-system co-design, to stabilize reinforcement learning (RL) training and mitigate 'reward collapse' in LLMs.

View blog

#computer-science #artificial-intelligence #computation-and-language

Resources 105

428

There are no more papers matching your filters at the moment.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Dark mode

Ask or search anything...

Events