alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Ask or search anything...

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

Chinese Academy of Sciences

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

08 Nov 2025

University of Illinois at Urbana-Champaign

University of California, Santa Barbara

A comprehensive survey formally defines Agentic Reinforcement Learning (RL) for Large Language Models (LLMs) as a Partially Observable Markov Decision Process (POMDP), distinct from conventional LLM-RL, and provides a two-tiered taxonomy of capabilities and task domains. The work consolidates open-source resources and outlines critical open challenges for the field.

#agentic-frameworks #agents #computer-science

Paper thumbnail

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

25 Aug 2025

Huawei Noah’s Ark Lab Chinese Academy of Sciences logo

Chinese Academy of Sciences

Researchers from UCL AI Centre and Huawei Noah’s Ark Lab developed Memento, a memory-based learning framework enabling LLM agents to continually adapt and improve without fine-tuning their underlying large language models. The framework achieved top performance on complex benchmarks, including 87.88% Pass@3 on GAIA and 95.0% accuracy on SimpleQA, demonstrating efficient, robust adaptation and generalization.

#agentic-frameworks #agents #computer-science

Resources 1,566

Paper thumbnail

A Survey of Context Engineering for Large Language Models

21 Jul 2025

Chinese Academy of Sciences Tsinghua University logo

Tsinghua University

Mei et al. formalize "Context Engineering" as a systematic discipline for optimizing information supplied to Large Language Models, proposing a comprehensive taxonomy that unifies fragmented research domains. Their analysis identifies a critical "comprehension-generation asymmetry," where LLMs demonstrate strong understanding but limitations in generating equally sophisticated long-form outputs.

#computer-science #computation-and-language #human-ai-interaction

Resources 2,332

Paper thumbnail

Ultrafast Dynamics of Bilayer and Trilayer Nickelate Superconductors

08 Mar 2024

Chinese Academy of Sciences Tsinghua University logo

Tsinghua University

In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ultrafast dynamics of the bilayer and trilayer nickelates at ambient pressure. Firstly, we observe a coherent phonon mode in La4Ni3O10 involving the collective vibration of La, Ni, and O atoms, which is absent in La3Ni2O7. Secondly, the temperature-dependent relaxation time diverges near the density-wave transition temperature of La4Ni3O10, in drastic contrast to kink-like changes in La3Ni2O7. Moreover, we estimate the electron-phonon coupling constants to be 0.05~0.07 and 0.12~0.16 for La3Ni2O7 and La4Ni3O10, respectively, suggesting a relatively minor role of electron-phonon coupling in the electronic properties of Lan+1NinO3n+1. Our work not only sheds light on the relevant microscopic interaction but also establishes a foundation for further studying the interplay between superconductivity and density-wave transitions in nickelate superconductors.

#strongly-correlated-electrons #superconductivity #physics

Paper thumbnail

Driving factors behind multiple populations

31 Jan 2024

University of Toronto Chinese Academy of Sciences logo

Chinese Academy of Sciences

Star clusters were historically considered simple stellar populations, with all stars sharing the same age and initial chemical composition. However, the presence of chemical anomalies in globular clusters (GCs), called multiple stellar populations (MPs), has challenged star formation theories in dense environments. Literature studies show that mass, metallicity, and age are likely controlling parameters for the manifestation of MPs. Identifying the limit between clusters with/without MPs in physical parameter space is crucial to reveal the driving mechanism behind their presence. In this study, we look for MP signals in Whiting 1, traditionally considered a young GC. Using the Magellan telescope, we obtained low-resolution spectra within

\rm \lambda\lambda = 3850-5500 Å

for eight giants of Whiting 1. We measured the C and N abundances from the CN and CH spectral indices. C and N abundances have variations comparable with their measurement errors (

\sim0.1

dex), suggesting that MPs are absent from Whiting 1. Combining these findings with literature studies, we propose a limit in the metallicity vs. cluster compactness index parameter space, which relatively clearly separates star clusters with/without MPs (GCs/open clusters). This limit is physically motivated. On a larger scale, the galactic environment determines cluster compactness and metallicity, leading to metal-rich, diffuse, old clusters formed ex situ. Our proposed limit also impacts our understanding of the formation of the Sagittarius dwarf galaxy: star clusters formed after the first starburst (age

\lesssim 8-10

Gyr). These clusters are simple stellar populations because the enriched galactic environment is no longer suitable for MP formation.

#astrophysics-of-galaxies #physics

Paper thumbnail

Deep Generative Demand Learning for Newsvendor and Pricing

13 Nov 2024

Chinese Academy of Sciences

University of Science and Technology of China

Researchers from the University of Science and Technology of China and the Chinese Academy of Sciences developed a framework utilizing conditional deep generative models (cDGMs) to learn demand distributions influenced by price and contextual features. This approach enables robust, data-driven optimization of inventory and pricing decisions, demonstrating superior profitability and asymptotic optimality compared to traditional methods in both simulations and a real-world case study.

#computer-science #machine-learning #deep-reinforcement-learning

Paper thumbnail

Spatially Randomized Designs Can Enhance Policy Evaluation

18 Mar 2024

Chinese Academy of Sciences Peking University logo

Peking University

This article studies the benefits of using spatially randomized experimental designs which partition the experimental area into distinct, non-overlapping units with treatments assigned randomly. Such designs offer improved policy evaluation in online experiments by providing more precise policy value estimators and more effective A/B testing algorithms than traditional global designs, which apply the same treatment across all units simultaneously. We examine both parametric and nonparametric methods for estimating and inferring policy values based on this randomized approach. Our analysis includes evaluating the mean squared error of the treatment effect estimator and the statistical power of the associated tests. Additionally, we extend our findings to experiments with spatio-temporal dependencies, where treatments are allocated sequentially over time, and account for potential temporal carryover effects. Our theoretical insights are supported by comprehensive numerical experiments.

Paper thumbnail

Automatic State Machine Inference for Binary Protocol Reverse Engineering

03 Dec 2024

Chinese Academy of Sciences

Researchers at the Institute of Information Engineering, Chinese Academy of Sciences, developed an automatic framework to infer Protocol State Machines (PSMs) for unknown network protocols directly from mixed traffic environments. This framework accurately clusters protocol formats and sessions, then reconstructs PSMs for both binary and text-based protocols like TLSv1.2 and SMTP, achieving high matching coefficients of 1.0 and 0.91/0.86 respectively.

#clustering-algorithms #computer-science #cryptography-and-security

Paper thumbnail

CSST Strong Lensing Preparation: Forecasting the galaxy-galaxy strong lensing population for the China Space Station Telescope

30 Jul 2024

Chinese Academy of Sciences Beijing Normal University logo

Beijing Normal University

Galaxy-galaxy strong gravitational lensing (GGSL) is a powerful probe for the formation and evolution of galaxies and cosmology, while the sample size of GGSLs leads to considerable uncertainties and potential bias. The China Space Station Telescope (CSST, to be launched in late 2026) will conduct observations across 17,500 square degrees of the sky, capturing images in the

ugriz

bands with a spatial resolution comparable to that of the Hubble Space Telescope. We ran a set of Monte Carlo simulations to predict that the CSST's wide-field survey will observe

\sim

160,000 galaxy-galaxy strong lenses over its lifespan, increasing the number of existing galaxy-galaxy strong lens samples by three orders of magnitude. This is comparable to the capabilities of the

\it Euclid

telescope but with the added benefit of additional color information. Specifically, the CSST can detect strong lenses with Einstein radii about

0.64\pm0.42^{"}

, corresponding to the velocity dispersions of

217.19 \pm 50.55 \, \text{km/s}

. These lenses exhibit a median magnification of

\sim

5. The apparent magnitude of the unlensed sources in the g-band is

25.87 \pm 1.19

. The signal-to-noise ratio of the lensed images covers a range of

\sim 20

to

\sim 1000

, allowing us to determine the Einstein radius with an accuracy ranging from

\sim 1 \%

to

\sim 0.1 \%

, ignoring various modeling systematics. Our estimates indicate that CSST can observe rare systems like double source-plane and spiral galaxy lenses. The above selection functions of the CSST strong lensing observation help optimize the strategy of finding and modeling GGSLs.

#cosmology-and-nongalactic-astrophysics #astrophysics-of-galaxies #physics

Paper thumbnail

An Analysis for Image-to-Image Translation and Style Transfer

12 Aug 2024

Chinese Academy of Sciences

Researchers from the Chinese Academy of Sciences provide a systematic analysis differentiating image-to-image translation (I2I) and style transfer (ST), two foundational generative AI techniques. The analysis clarifies their distinct concepts, forms, training modes, and evaluation processes, offering a framework to address existing confusion in the research community.

#computer-science #computer-vision-and-pattern-recognition #image-and-video-processing

Paper thumbnail

BEACON: JWST NIRCam Pure-parallel Imaging Survey. I. Survey Design and Initial Results

05 Dec 2024

Tohoku University

California Institute of Technology

We introduce the Bias-free Extragalactic Analysis for Cosmic Origins with NIRCam (BEACON) survey, a JWST Cycle2 program allocated up to 600 pure-parallel hours of observations. BEACON explores high-latitude areas of the sky with JWST/NIRCam over

\sim100

independent sightlines, totaling

\sim0.3

deg

^2

, reaching a median F444W depth of

\approx28.2

AB mag (5

\sigma

). Based on existing JWST observations in legacy fields, we estimate that BEACON will photometrically identify 25--150 galaxies at

z&gt;10

and 500--1000 at

z\sim7

--10 uniquely enabled by an efficient multiple filter configuration spanning

0.9

--5.0

\mu

m. The expected sample size of

z&gt;10

galaxies will allow us to obtain robust number density estimates and to discriminate between different models of early star formation. In this paper, we present an overview of the survey design and initial results using the first 19 fields. We present 129 galaxy candidates at

z&gt;7

identified in those fields, including 11 galaxies at

z&gt;10

and several UV-luminous (

M_{\rm UV}&lt;-21

mag) galaxies at

z\sim8

. The number densities of

z&lt;13

galaxies inferred from the initial fields are overall consistent with those in the literature. Despite reaching a considerably large volume (

\sim10^5

Mpc

^3

), however, we find no galaxy candidates at

z&gt;13

, providing us with a complimentary insight into early galaxy evolution with minimal cosmic variance. We publish imaging and catalog data products for these initial fields. Upon survey completion, all BEACON data will be coherently processed and distributed to the community along with catalogs for redshift and other physical quantities.

#astrophysics-of-galaxies #physics

Paper thumbnail

Quantum Compiling with Reinforcement Learning on a Superconducting Processor

18 Jun 2024

Wuhan University Chinese Academy of Sciences logo

Chinese Academy of Sciences

Researchers developed and experimentally validated a reinforcement learning-based quantum compiler on a 9-qubit superconducting processor, demonstrating its ability to find shorter, hardware-optimized quantum circuits. This approach achieved superior experimental fidelities on noisy intermediate-scale quantum (NISQ) devices compared to conventional compilation methods, notably reducing the 3-qubit Quantum Fourier Transform to just seven CZ gates.

#computer-science #machine-learning #optimization-methods

Paper thumbnail

Celestial CFT from CHY Formalism: Center Charge and Finite Size Effect

22 Feb 2024

Chinese Academy of Sciences

Scattering amplitudes in gauge theories can be calculated either by bulk theories in 4d Minkowski space-time(

Mink_4

), or perceived as the correlation functions in celestial CFT(CCFT) living in the celestial sphere at null infinity, where an infinite-dimensional asymptotic symmetry, BMS group, resides. Another well developed method is the CHY formalism, which formulates the scattering amplitude in terms of the correlation functions on a 2d world sheet, on which an ambitwistor string theory is defined. The relationship between CHY theory and CCFT is encoded in scattering equations, which are algebraic equations lacking of analytical solutions in general. So we start from the CHY formalism, take the collinear limit, then find a nice operator formalism for the CCFT. In particular, the center charge

c

is calculated to be 36 for the CCFT related to the 4d Yang-Mills theory. It then follows that the 4d cosmological constant naturally arises as the finite size effect in 2d CCFT, which is calculated by the method of the

T\bar{T}

perturbed CCFT.

#high-energy-physics-theory #physics

Paper thumbnail

Why Do MLLMs Struggle with Spatial Understanding? A Systematic Analysis from Data to Architecture

02 Sep 2025

Chinese Academy of Sciences

A systematic analysis from researchers at Chinese Academy of Sciences and Tsinghua University investigates why Multimodal Large Language Models struggle with spatial understanding, introducing a novel multi-view spatial understanding benchmark (MulSeT). The study reveals that architectural design limitations, particularly in visual encoder positional encodings, are more constraining than insufficient training data for complex spatial reasoning.

#attention-mechanisms #computer-science #computer-vision-and-pattern-recognition

Paper thumbnail

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

27 Mar 2025

junyu-luo

Junyu Luo

University of Washington Harvard University logo

Harvard University

A comprehensive survey from an international research consortium led by Peking University examines Large Language Model (LLM) agents through a methodology-centered taxonomy, analyzing their construction, collaboration mechanisms, and evolution while providing a unified architectural framework for understanding agent systems across different application domains.

#agentic-frameworks #agents #computer-science

Paper thumbnail

A Survey on LLM-as-a-Judge

19 Oct 2025

Chinese Academy of Sciences Imperial College London logo

Imperial College London

Researchers present a comprehensive survey of the "LLM-as-a-Judge" paradigm, providing formal definitions, a unified framework, and an empirical meta-evaluation to assess the reliability of large language models in evaluative roles. The work identifies effective strategies and highlights persistent biases and the need for advanced benchmarks.

#computer-science #artificial-intelligence #computation-and-language

Paper thumbnail

A Survey of Vibe Coding with Large Language Models

14 Oct 2025

Chinese Academy of Sciences Peking University logo

Peking University

Researchers from ICT, CAS and collaborating institutions present the first comprehensive survey of Vibe Coding, a novel LLM-powered software development methodology, formalizing its processes and outlining five distinct development models. The work thoroughly analyzes the ecosystem's infrastructure, revealing critical challenges in human-AI collaboration and a shift in developer roles.

#agentic-frameworks #agents #computer-science

Paper thumbnail

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

14 Oct 2025

Chinese Academy of Sciences

DriveVLA-W0 integrates world modeling into Vision-Language-Action (VLA) models for autonomous driving, utilizing future image prediction as a dense self-supervision signal. This framework amplifies data scaling laws, enabling VLAs to achieve state-of-the-art performance and enhanced generalization by learning robust environmental representations.

#autonomous-vehicles #computer-science #artificial-intelligence

Paper thumbnail

SpikingBrain: Spiking Brain-inspired Large Models

01 Dec 2025

Chinese Academy of Sciences Beihang University logo

Beihang University

Mainstream Transformer-based large language models face major efficiency bottlenecks: training computation scales quadratically with sequence length, and inference memory grows linearly, limiting long-context processing. Building large models on non-NVIDIA platforms also poses challenges for stable and efficient training. To address this, we introduce SpikingBrain, a family of brain-inspired models designed for efficient long-context training and inference. SpikingBrain leverages the MetaX GPU cluster and focuses on three aspects: (1) Model Architecture: linear and hybrid-linear attention architectures with adaptive spiking neurons; (2) Algorithmic Optimizations: an efficient, conversion-based training pipeline and a dedicated spike coding framework; (3) System Engineering: customized training frameworks, operator libraries, and parallelism strategies tailored to MetaX hardware. Using these techniques, we develop two models: SpikingBrain-7B, a linear LLM, and SpikingBrain-76B, a hybrid-linear MoE LLM. These models demonstrate the feasibility of large-scale LLM development on non-NVIDIA platforms, and training remains stable for weeks on hundreds of MetaX GPUs with Model FLOPs Utilization at expected levels. SpikingBrain achieves performance comparable to open-source Transformer baselines while using only about 150B tokens for continual pre-training. Our models also significantly improve long-context efficiency and deliver inference with (partially) constant memory and event-driven spiking behavior. For example, SpikingBrain-7B attains over 100x speedup in Time to First Token for 4M-token sequences. Furthermore, the proposed spiking scheme achieves 69.15 percent sparsity, enabling low-power operation. Overall, this work demonstrates the potential of brain-inspired mechanisms to drive the next generation of efficient and scalable large model design.

#computer-science #artificial-intelligence #computation-and-language

Resources 1,022

Paper thumbnail

From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence

06 Dec 2025

Monash University CSIRO

A comprehensive synthesis of Large Language Models for automated software development covers the entire model lifecycle, from data curation to autonomous agents, and offers practical guidance derived from empirical experiments on pre-training, fine-tuning, and reinforcement learning, alongside a detailed analysis of challenges and future directions.

#agentic-frameworks #agents #ai-for-cybersecurity

Paper thumbnail

There are no more papers matching your filters at the moment.