alphaXiv

Nanbeige LLM Lab

06 Dec 2025

chain-of-thought computer-science computation-and-language

Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models

The Nanbeige4-3B model family from the Nanbeige LLM Lab at Boss Zhipin introduces a 3-billion-parameter language model that consistently outperforms much larger open-source models, setting new state-of-the-art averages in mathematical and scientific reasoning. This performance is achieved through a multi-stage training pipeline incorporating advanced data filtering, a fine-grained learning rate scheduler, dual-level preference distillation, and multi-stage reinforcement learning.

There are no more papers matching your filters at the moment.

Events

AI for Law
Joel Niklaus· Hugging Face
01/09
Register
Watch recordings

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models

Events

AI for Law

Personalize Your Feed