alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Ask or search anything...

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

The Kempner Institute for the Study of Natural and Artificial Intelligence Harvard University logo

Harvard University

The Art of Scaling Reinforcement Learning Compute for LLMs

15 Oct 2025

Harvard University UT Austin

Researchers introduced a predictive framework for Reinforcement Learning (RL) in Large Language Models (LLMs) using a sigmoidal compute-performance curve, enabling performance extrapolation from smaller runs. Their ScaleRL recipe, demonstrated over 100,000 GPU-hours, achieves an asymptotic reward of 0.61 on verifiable math problems, outperforming established methods while exhibiting predictable scaling across model size, generation length, and multi-task settings.

#computer-science #artificial-intelligence #machine-learning

Paper thumbnail

Reasoning with Sampling: Your Base Model is Smarter Than You Think

16 Oct 2025

Harvard University

Researchers at Harvard University developed power sampling, a training-free method leveraging the Metropolis-Hastings algorithm to sample from a sharpened distribution of a base large language model. This technique unlocks latent reasoning capabilities, achieving single-shot performance comparable to or exceeding reinforcement learning post-training methods across various tasks, while also preserving generation diversity.

#computer-science #artificial-intelligence #computation-and-language

Paper thumbnail

Long-term Evolution of Binary Orbits Induced by Circumbinary Disks

03 Aug 2024

University of Amsterdam Harvard University logo

Harvard University

Circumbinary disks are found in a variety of astrophysical scenarios, spanning binary star formation to accreting supermassive black hole binaries. The interaction with a circumbinary disk can yield opposite effects on the binary orbit leading to circularization, or exciting the eccentricity, widening the orbit or shrinking it and facilitating mergers. We present a new formalism for the long-term evolution of the disk-binary interaction based on the results of recent suites of hydrodynamic simulations, which resolve the complex geometry of the gas in the vicinity of the binary and fully account for the gravitational and accretion forces. We release a python package, \texttt{spindler}, that implements our model. We show that, unless the mass reservoir feeding the disk is comparable to the mass of the binary, accretion onto the binary depletes the disk mass before inducing a significant change in orbital separation or mass ratio. This finding implies that, in most scenarios, interaction with a circumbinary disk is not an efficient mechanism to shrink the orbit of the binary. However, as long as the mass of the disk is at least a few percent of the mass of the binary, the interaction can excite the eccentricity up to an equilibrium value, and induce a statistical correlation between mass ratio and eccentricity. We consider the applicability of our model to a variety of astrophysical scenarios: during star formation, in evolved stellar binaries, triples and in supermassive black hole binaries. We discuss the theoretical and observational implications of our predictions.

#high-energy-astrophysical-phenomena #solar-and-stellar-astrophysics #physics

Paper thumbnail

Dark Matter Searches on a Photonic Chip

30 Jan 2024

University of Illinois at Urbana-Champaign University of Victoria

Dark matter (DM) with masses of order an electronvolt or below can have a non-zero coupling to electromagnetism. In these models, the ambient DM behaves as a new classical source in Maxwell's equations, which can excite potentially detectable electromagnetic (EM) fields in the laboratory. We describe a new proposal for using integrated photonics to search for such DM candidates with masses in the 0.1 eV - few eV range. This approach offers a wide range of wavelength-scale devices like resonators and waveguides that can enable a novel and exciting experimental program. In particular, we show how refractive index-modulated resonators, such as grooved or periodically-poled microrings, or patterned slabs, support EM modes with efficient coupling to DM. When excited by the DM, these modes can be read out by coupling the resonators to a waveguide that terminates on a micron-scale-sized single photon detector, such as a single pixel of an ultra-quiet charge-coupled device or a superconducting nanowire. We then estimate the sensitivity of this experimental concept in the context of axion-like particle and dark photon models of DM, showing that the scaling and confinement advantages of nanophotonics may enable exploration of new DM parameter space.

#high-energy-physics-phenomenology #physics #optics

Paper thumbnail

Parameter identifiability in PDE models of fluorescence recovery after photobleaching

02 Mar 2024

Harvard University Duke University logo

Duke University

Identifying unique parameters for mathematical models describing biological data can be challenging and often impossible. Parameter identifiability for partial differential equations models in cell biology is especially difficult given that many established \textit{in vivo} measurements of protein dynamics average out the spatial dimensions. Here, we are motivated by recent experiments on the binding dynamics of the RNA-binding protein PTBP3 in RNP granules of frog oocytes based on fluorescence recovery after photobleaching (FRAP) measurements. FRAP is a widely-used experimental technique for probing protein dynamics in living cells, and is often modeled using simple reaction-diffusion models of the protein dynamics. We show that current methods of structural and practical parameter identifiability provide limited insights into identifiability of kinetic parameters for these PDE models and spatially-averaged FRAP data. We thus propose a pipeline for assessing parameter identifiability and for learning parameter combinations based on re-parametrization and profile likelihoods analysis. We show that this method is able to recover parameter combinations for synthetic FRAP datasets and investigate its application to real experimental data.

#dynamical-systems #mathematics #quantitative-methods

Paper thumbnail

Unified Differentiable Learning of Electric Response

07 Jun 2024

Harvard University Robert Bosch, LLC

Predicting response of materials to external stimuli is a primary objective of computational materials science. However, current methods are limited to small-scale simulations due to the unfavorable scaling of computational costs. Here, we implement an equivariant machine-learning framework where response properties stem from exact differential relationships between a generalized potential function and applied external fields. Focusing on responses to electric fields, the method predicts electric enthalpy, forces, polarization, Born charges, and polarizability within a unified model enforcing the full set of exact physical constraints, symmetries and conservation laws. Through application to

\alpha

-SiO

_2

, we demonstrate that our approach can be used for predicting vibrational and dielectric properties of materials, and for conducting large-scale dynamics under arbitrary electric fields at unprecedented accuracy and scale. We apply our method to ferroelectric BaTiO

_3

and capture the temperature-dependence and time evolution of hysteresis, revealing the underlying microscopic mechanisms of nucleation and growth that govern ferroelectric domain switching.

#materials-science #physics

Paper thumbnail

BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings

30 Nov 2024

Harvard University University of Massachusetts Amherst

Researchers from the University of Massachusetts Amherst and Harvard University developed BOTS, a method combining Batch Bayesian Optimization with Extended Thompson Sampling, to optimize adaptive interventions in severely episode-limited reinforcement learning environments. It effectively overcomes the myopic nature of standard contextual bandits and achieves superior performance in mobile health simulations, even with very few trials.

#active-learning #bayesian-optimization #computer-science

Paper thumbnail

AcrosticSleuth: Probabilistic Identification and Ranking of Acrostics in Multilingual Corpora

08 Aug 2024

Harvard University

University of Texas at Austin

A computational tool, AcrosticSleuth, identifies and probabilistically ranks acrostics in multilingual corpora, achieving F1 scores up to 0.66 on known Russian acrostics. This tool, developed by researchers from Tufts, UW-Madison, UT Austin, and Harvard, also led to the discovery of previously unrecognized acrostics in significant historical texts, including one in Thomas Hobbes' *The Elements of Law*.

#computer-science #computation-and-language #information-extraction

Paper thumbnail

Fossil and present-day stromatolite ooids contain a meteoritic polymer of glycine and iron

29 Mar 2024

Harvard University Argonne National Laboratory logo

Argonne National Laboratory

Hemoglycin, a space polymer of glycine and iron, has been identified in the carbonaceous chondritic meteorites Allende, Acfer 086, Kaba, Sutters Mill and Orgueil. Its core form has a mass of 1494Da and is basically an antiparallel pair of polyglycine strands linked at each end by an iron atom. The polymer forms two- and three- dimensional lattices with an inter-vertex distance of 4.9nm. Here the extraction technique for meteorites is applied to a 2.1Gya fossil stromatolite to reveal the presence of hemoglycin by mass spectrometry. Intact ooids from a recent (3,000Ya) stromatolite exhibited the same visible hemoglycin fluorescence in response to x-rays as an intact crystal from the Orgueil meteorite. X-ray analysis confirmed the existence in ooids of an internal 3-dimensional lattice of 4.9nm inter-vertex spacing, matching the spacing of lattices in meteoritic crystals. FTIR measurements of acid-treated ooid and a Sutters Mill meteoritic crystal both show the presence, via the splitting of the Amide I band, of an extended anti-parallel beta sheet structure. It seems probable that the copious in-fall of carbonaceous meteoritic material, from Archaean times onward, has left traces of hemoglycin in sedimentary carbonates and potentially has influenced ooid formation.

#earth-and-planetary-astrophysics #physics #geophysics

Paper thumbnail

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

27 Mar 2025

junyu-luo

Junyu Luo

University of Washington Harvard University logo

Harvard University

A comprehensive survey from an international research consortium led by Peking University examines Large Language Model (LLM) agents through a methodology-centered taxonomy, analyzing their construction, collaboration mechanisms, and evolution while providing a unified architectural framework for understanding agent systems across different application domains.

#agentic-frameworks #agents #computer-science

Paper thumbnail

Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

30 Sep 2025

Google DeepMind University of Waterloo logo

University of Waterloo

Researchers at Harvard University, Google DeepMind, and collaborating institutions reverse-engineered successful Implicit Chain-of-Thought (ICoT) Transformers to understand why standard models fail at multi-digit multiplication. They discovered that ICoT models establish long-range dependencies through attention trees for partial product caching and represent digits using Fourier bases, findings that led to a simple auxiliary loss intervention enabling a standard Transformer to achieve 99% accuracy on 4x4 multiplication.

#attention-mechanisms #chain-of-thought #computer-science

Paper thumbnail

Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models

13 Oct 2025

Harvard University MIT logo

Equilibrium Matching (EqM) introduces a generative modeling framework that learns a time-invariant equilibrium gradient of an implicit energy landscape, enabling high-fidelity image generation without explicit time-conditioning. The method achieves an FID of 1.90 on ImageNet 256x256, outperforming leading diffusion and flow-based models, and supports flexible optimization-based sampling and intrinsic capabilities like out-of-distribution detection and image composition.

#computer-science #artificial-intelligence #computer-vision-and-pattern-recognition

Paper thumbnail

Early science acceleration experiments with GPT-5

20 Nov 2025

University of Cambridge Harvard University logo

Harvard University

OpenAI researchers and collaborators evaluate GPT-5's utility in accelerating scientific research across diverse fields, demonstrating its capacity for contributing to known result rediscovery, literature search, collaborative problem-solving, and the generation of novel scientific findings. The model proved to compress research timelines from months to hours and provided verifiable new insights in mathematics, physics, and biology.

#agents #chain-of-thought #computer-science

Paper thumbnail

Limitations in design and applications of ultra-small mode volume photonic crystals

16 Apr 2024

Harvard University Universität Hamburg

Ultra-small mode volume nanophotonic crystal cavities have been proposed as powerful tools for increasing coupling rates in cavity quantum electrodynamics systems. However, their adoption in quantum information applications remains elusive. In this work, we investigate possible reasons why, and analyze the impact of different low mode volume resonator design choices on their utility in quantum optics experiments. We analyze band structure features and loss rates of low mode volume bowtie cavities in diamond and demonstrate independent design control over cavity-emitter coupling strength and loss rates. Further, using silicon vacancy centers in diamond as exemplary emitters, we investigate the influence of placement imprecision. We find that the benefit on photon collection efficiency and indistinguishability is limited, while the fabrication complexity of ultra-small cavity designs increases substantially compared to conventional photonic crystals. We conclude that ultra-small mode volume designs are primarily of interest for dispersive spin-photon interactions, which are of great interest for future quantum networks.

#physics #optics #quantum-physics

Paper thumbnail

Energy-Based Transformers are Scalable Learners and Thinkers

02 Jul 2025

alexi-gladstone

Alexi Gladstone

University of Illinois at Urbana-Champaign Harvard University logo

Harvard University

This paper introduces Energy-Based Transformers (EBTs), a new class of models that enable scalable System 2 thinking through unsupervised learning by reframing prediction as an optimization process over a learned energy function. EBTs demonstrate superior scaling rates compared to standard Transformers in language and video, improve performance by up to 29% with increased inference-time computation, and achieve better generalization on out-of-distribution data across diverse modalities.

#computer-science #artificial-intelligence #computation-and-language

Paper thumbnail

Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think

28 Sep 2025

Harvard University Chinese Academy of Sciences logo

Chinese Academy of Sciences

Representation Entanglement for Generation (REG) introduces an image-class denoising paradigm, achieving up to 63x faster training for Diffusion Transformers while setting a new FID record of 1.8 on ImageNet 256x256 by structurally integrating a high-level class token with image latents.

#computer-science #computer-vision-and-pattern-recognition #efficient-transformers

Paper thumbnail

Democratizing AI scientists using ToolUniverse

22 Oct 2025

Harvard University Harvard Medical School

TOOLUNIVERSE establishes an open-source ecosystem that standardizes AI-tool interaction and empowers AI scientists to autonomously discover, create, optimize, and compose scientific tools. The platform successfully demonstrated its utility in a therapeutic discovery case study, identifying a novel drug candidate for hypercholesterolemia with validated properties.

#agentic-frameworks #agents #ai-for-health

Paper thumbnail

How Reinforcement Learning After Next-Token Prediction Facilitates Learning

13 Oct 2025

Harvard University New York University logo

New York University

A theoretical framework, supported by empirical simulations, clarifies how reinforcement learning (RL) applied after next-token prediction facilitates reasoning in Large Language Models (LLMs). The work shows RL effectively up-samples rare, high-quality chain-of-thought demonstrations, leading to rapid generalization and a concurrent increase in response length.

#chain-of-thought #computer-science #machine-learning

Paper thumbnail

Any-Order Flexible Length Masked Diffusion

07 Sep 2025

Harvard University Kempner Institute

FlexMDMs enable discrete diffusion models to generate sequences of variable lengths and perform token insertions by extending the stochastic interpolant framework with a novel joint interpolant for continuous-time Markov chains. The approach more accurately models length distributions, boosts planning task success rates by nearly 60%, and improves performance on math and code infilling tasks by up to 13% after efficient retrofitting of existing large-scale MDMs.

#computer-science #machine-learning #fine-tuning

Paper thumbnail

Matryoshka Representation Learning

08 Feb 2024

gantavya-bhatt

Gantavya Bhatt

Aniket Rege

University of Washington Harvard University logo

Harvard University

Matryoshka Representation Learning from researchers at the University of Washington, Google Research, and Harvard University develops a method to encode multi-fidelity information within a single embedding, allowing truncated prefixes to serve as progressively finer-grained representations. This approach enables substantial efficiency gains in large-scale classification and retrieval tasks, achieving comparable accuracy to full-dimensional models with significantly reduced computational cost and memory footprint.

#computer-science #computer-vision-and-pattern-recognition #machine-learning

Paper thumbnail

There are no more papers matching your filters at the moment.