Ring Team
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
18 Jun 2025

Ring-lite, developed by Inclusion AI's Ling Team, presents an open-source multi-domain reasoning model based on a Mixture-of-Experts (MoE) architecture, achieving strong performance while activating fewer parameters. It introduces C3PO, an algorithm-system co-design, to stabilize reinforcement learning (RL) training and mitigate 'reward collapse' in LLMs.

View blog
Resources105
There are no more papers matching your filters at the moment.