alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

We're hiring
PaperBlogResources

FlowRL: Matching Reward Distributions for LLM Reasoning

BibTex
Copy
@misc{zhu2025flowrlmatchingreward,
      title={FlowRL: Matching Reward Distributions for LLM Reasoning},
      author={Xuekai Zhu and Daixuan Cheng and Dinghuai Zhang and Hengli Li and Kaiyan Zhang and Che Jiang and Youbang Sun and Ermo Hua and Yuxin Zuo and Xingtai Lv and Qizheng Zhang and Lin Chen and Fanghao Shao and Bo Xue and Yunchong Song and Zhenjie Yang and Ganqu Cui and Ning Ding and Jianfeng Gao and Xiaodong Liu and Bowen Zhou and Hongyuan Mei and Zhouhan Lin},
      year={2025},
      eprint={2509.15207},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2509.15207},
}
GitHub
FlowRL
92
HTTPS
https://github.com/Xuekai-Zhu/FlowRL
SSH
git@github.com:Xuekai-Zhu/FlowRL.git
CLI
gh repo clone Xuekai-Zhu/FlowRL
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.
Audio lecture
Q&A format