alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Ask or search anything...

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

California Institute of TechnologyJet Propulsion Laboratory

Gravity Spy: Lessons Learned and a Path Forward

01 Feb 2024

California Institute of Technology Northwestern University logo

Northwestern University

The Gravity Spy project aims to uncover the origins of glitches, transient bursts of noise that hamper analysis of gravitational-wave data. By using both the work of citizen-science volunteers and machine-learning algorithms, the Gravity Spy project enables reliable classification of glitches. Citizen science and machine learning are intrinsically coupled within the Gravity Spy framework, with machine-learning classifications providing a rapid first-pass classification of the dataset and enabling tiered volunteer training, and volunteer-based classifications verifying the machine classifications, bolstering the machine-learning training set and identifying new morphological classes of glitches. These classifications are now routinely used in studies characterizing the performance of the LIGO gravitational-wave detectors. Providing the volunteers with a training framework that teaches them to classify a wide range of glitches, as well as additional tools to aid their investigations of interesting glitches, empowers them to make discoveries of new classes of glitches. This demonstrates that, when giving suitable support, volunteers can go beyond simple classification tasks to identify new features in data at a level comparable to domain experts. The Gravity Spy project is now providing volunteers with more complicated data that includes auxiliary monitors of the detector to identify the root cause of glitches.

#instrumentation-and-methods-for-astrophysics #general-relativity-and-quantum-cosmology #physics

Paper thumbnail

On categorification of Stokes coefficients in Chern-Simons theory

18 Mar 2024

California Institute of Technology ICTP

We consider a finite-dimensional oscillatory integral which provides a "finite-dimensional model" for analytically continued

SU(2)

Chern-Simons theory on closed 3-manifolds that are described by plumbing trees. This model allows an efficient description of Stokes phenomenon for perturbative expansions in Chern-Simons theory around classical solutions -

SL(2,\mathbb{C})

flat connections. Moreover, the Stokes coefficients can be categorified, i.e. promoted to graded vector spaces, in terms of this finite-dimensional model. At least naively, the categorification gives BPS spectrum of 5d maximally supersymmetric Yang-Mills theory on the 3-manifold times a line with appropriate boundary conditions. We also comment on necessity of taking into account "flat connections at infinity" to capture Stokes phenomenon for certain 3-manifolds.

#high-energy-physics-theory #mathematics #geometric-topology

Paper thumbnail

BEACON: JWST NIRCam Pure-parallel Imaging Survey. I. Survey Design and Initial Results

05 Dec 2024

Tohoku University

California Institute of Technology

We introduce the Bias-free Extragalactic Analysis for Cosmic Origins with NIRCam (BEACON) survey, a JWST Cycle2 program allocated up to 600 pure-parallel hours of observations. BEACON explores high-latitude areas of the sky with JWST/NIRCam over

\sim100

independent sightlines, totaling

\sim0.3

deg

^2

, reaching a median F444W depth of

\approx28.2

AB mag (5

\sigma

). Based on existing JWST observations in legacy fields, we estimate that BEACON will photometrically identify 25--150 galaxies at

z&gt;10

and 500--1000 at

z\sim7

--10 uniquely enabled by an efficient multiple filter configuration spanning

0.9

--5.0

\mu

m. The expected sample size of

z&gt;10

galaxies will allow us to obtain robust number density estimates and to discriminate between different models of early star formation. In this paper, we present an overview of the survey design and initial results using the first 19 fields. We present 129 galaxy candidates at

z&gt;7

identified in those fields, including 11 galaxies at

z&gt;10

and several UV-luminous (

M_{\rm UV}&lt;-21

mag) galaxies at

z\sim8

. The number densities of

z&lt;13

galaxies inferred from the initial fields are overall consistent with those in the literature. Despite reaching a considerably large volume (

\sim10^5

Mpc

^3

), however, we find no galaxy candidates at

z&gt;13

, providing us with a complimentary insight into early galaxy evolution with minimal cosmic variance. We publish imaging and catalog data products for these initial fields. Upon survey completion, all BEACON data will be coherently processed and distributed to the community along with catalogs for redshift and other physical quantities.

#astrophysics-of-galaxies #physics

Paper thumbnail

JWST and ALMA discern the assembly of structural and obscured components in a high-redshift starburst galaxy

10 May 2024

California Institute of Technology

We present observations and analysis of the starburst, PACS-819, at z=1.45 (

M_*=10^{10.7}

M

_{ \odot}

), using high-resolution (

0^{\prime \prime}.1

; 0.8 kpc) ALMA and multi-wavelength JWST images from the COSMOS-Web program. Dissimilar to HST/ACS images in the rest-frame UV, the redder NIRCam and MIRI images reveal a smooth central mass concentration and spiral-like features, atypical for such an intense starburst. Through dynamical modeling of the CO J=5--4 emission with ALMA, PACS-819 is rotation-dominated thus has a disk-like nature. However, kinematic anomalies in CO and asymmetric features in the bluer JWST bands (e.g., F150W) support a more disturbed nature likely due to interactions. The JWST imaging further enables us to map the distribution of stellar mass and dust attenuation, thus clarifying the relationships between different structural components, not discernable in the previous HST images. The CO J = 5 -- 4 and FIR dust continuum emission are co-spatial with a heavily-obscured starbursting core (<1 kpc) which is partially surrounded by much less obscured star-forming structures including a prominent arc, possibly a tidally-distorted dwarf galaxy, and a clump, either a sign of an ongoing violent disk instability or a recently accreted low-mass satellite. With spatially-resolved maps, we find a high molecular gas fraction in the central area reaching

\sim3

(

M_{\text{gas}}

/

M_*

) and short depletion times (

M_{\text{gas}}/SFR\sim

120 Myrs) across the entire system. These observations provide insights into the complex nature of starbursts in the distant universe and underscore the wealth of complementary information from high-resolution observations with both ALMA and JWST.

#astrophysics-of-galaxies #physics

Paper thumbnail

A Global Perspective with Updated Constraints on the Ultra-hot Jupiter WASP-19b: Atmospheric Properties and Stellar Activity

04 Dec 2024

California Institute of Technology University College London logo

University College London

We present a detailed reanalysis of the atmospheric properties of WASP-19b, an ultra-hot Jupiter (1.14 M Jup, 1.41 R Jup) orbiting an active Sun-like star every 0.79 day. We reanalyze a transit and secondary eclipse of WASP-19b observed by the Hubble Space Telescope's Wide Field Camera 3 spectrograph (1.1 - 1.7 microns). When combined with Spitzer photometry at longer wavelengths, our analyses indicate the presence of water absorption features in both the planet's transmission and emission spectra, consistent with results from previously published studies. We jointly fit WASP-19b's dayside emission and transmission spectra with a retrieval model in order to constrain its atmospheric composition, and explore the effect of stellar activity on its transmission spectrum in greater depth. We also compare our dayside emission spectrum to predictions from a general circulation model, and conclude that magnetic drag appears to be relatively unimportant in shaping WASP-19b's atmospheric circulation. Lastly, we compare the size of WASP-19b's dayside water absorption feature to the population of hot Jupiters with similar measurements, and show that it is located in the transitional irradiation regime where temperature inversions first begin to emerge. As in previous studies, we find that the current observations provide relatively weak constraints on this planet's atmospheric properties. These constraints could be significantly improved by the addition of spectroscopically resolved observations at longer wavelengths with JWST/NIRSpec PRISM.

#earth-and-planetary-astrophysics #physics

Paper thumbnail

A Uniform Analysis of Debris Disks with the Gemini Planet Imager II: Constraints on Dust Density Distribution Using Empirically-Informed Scattering Phase Functions

31 Jan 2024

California Institute of Technology University of Victoria

Spatially-resolved images of debris disks are necessary to determine disk morphological properties and the scattering phase function (SPF) which quantifies the brightness of scattered light as a function of phase angle. Current high-contrast imaging instruments have successfully resolved several dozens of debris disks around other stars, but few studies have investigated trends in the scattered-light, resolved population of debris disks in a uniform and consistent manner. We have combined Karhunen-Loeve Image Projection (KLIP) with radiative-transfer disk forward modeling in order to obtain the highest quality image reductions and constrain disk morphological properties of eight debris disks imaged by the Gemini Planet Imager at H-band with a consistent and uniformly-applied approach. In describing the scattering properties of our models, we assume a common SPF informed from solar system dust scattering measurements and apply it to all systems. We identify a diverse range of dust density properties among the sample, including critical radius, radial width, and vertical width. We also identify radially narrow and vertically extended disks that may have resulted from substellar companion perturbations, along with a tentative positive trend in disk eccentricity with relative disk width. We also find that using a common SPF can achieve reasonable model fits for disks that are axisymmetric and asymmetric when fitting models to each side of the disk independently, suggesting that scattering behavior from debris disks may be similar to Solar System dust.

#earth-and-planetary-astrophysics #physics

Paper thumbnail

KAN: Kolmogorov-Arnold Networks

09 Feb 2025

ziming-liu

Ziming Liu

California Institute of Technology Northeastern University logo

Northeastern University

Kolmogorov–Arnold Networks (KANs) are introduced as a novel neural network architecture, leveraging the Kolmogorov–Arnold Representation Theorem by placing learnable univariate activation functions on network edges. This design achieves superior accuracy and parameter efficiency compared to Multi-Layer Perceptrons on various function approximation and scientific tasks, while providing intrinsic interpretability for discovering underlying mathematical laws.

#computer-science #disordered-systems-and-neural-networks #artificial-intelligence

Resources 4,225

Paper thumbnail

Euclid preparation XLVI. The Near-IR Background Dipole Experiment with Euclid

24 Jun 2024

arthurmloureiro

Arthur Loureiro

California Institute of Technology University of Oslo

Verifying the fully kinematic nature of the cosmic microwave background (CMB) dipole is of fundamental importance in cosmology. In the standard cosmological model with the Friedman-Lemaitre-Robertson-Walker (FLRW) metric from the inflationary expansion the CMB dipole should be entirely kinematic. Any non-kinematic CMB dipole component would thus reflect the preinflationary structure of spacetime probing the extent of the FLRW applicability. Cosmic backgrounds from galaxies after the matter-radiation decoupling, should have kinematic dipole component identical in velocity with the CMB kinematic dipole. Comparing the two can lead to isolating the CMB non-kinematic dipole. It was recently proposed that such measurement can be done using the near-IR cosmic infrared background (CIB) measured with the currently operating Euclid telescope, and later with Roman. The proposed method reconstructs the resolved CIB, the Integrated Galaxy Light (IGL), from Euclid's Wide Survey and probes its dipole, with a kinematic component amplified over that of the CMB by the Compton-Getting effect. The amplification coupled with the extensive galaxy samples forming the IGL would determine the CIB dipole with an overwhelming signal/noise, isolating its direction to sub-degree accuracy. We develop details of the method for Euclid's Wide Survey in 4 bands spanning 0.6 to 2 mic. We isolate the systematic and other uncertainties and present methodologies to minimize them, after confining the sample to the magnitude range with negligible IGL/CIB dipole from galaxy clustering. These include the required star-galaxy separation, accounting for the extinction correction dipole using the method newly developed here achieving total separation, accounting for the Earth's orbital motion and other systematic effects. (Abridged)

#cosmology-and-nongalactic-astrophysics #general-relativity-and-quantum-cosmology #physics

Paper thumbnail

A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics

26 Nov 2024

shengchao-liu

Shengchao Liu

University of Toronto

California Institute of Technology

In drug discovery, molecular dynamics (MD) simulation for protein-ligand binding provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. There has been a long history of improving the efficiency of MD simulations through better numerical methods and, more recently, by utilizing machine learning (ML) methods. Yet, challenges remain, such as accurate modeling of extended-timescale simulations. To address this issue, we propose NeuralMD, the first ML surrogate that can facilitate numerical MD and provide accurate simulations in protein-ligand binding dynamics. We propose a principled approach that incorporates a novel physics-informed multi-grained group symmetric framework. Specifically, we propose (1) the BindingNet model that satisfies group symmetry using vector frames and captures the multi-level protein-ligand interactions, and (2) an augmented neural differential equation solver that learns the trajectory under Newtonian mechanics. For the experiment, we design ten single-trajectory and three multi-trajectory binding simulation tasks. We demonstrate the efficiency and effectiveness of NeuralMD, achieving over 1K

\times

speedup compared to standard numerical MD simulations. NeuralMD also outperforms all other ML approaches, achieving up to 15

\times

reduction in reconstruction error and 70% increase in validity. Additionally, we qualitatively illustrate that the oscillations in the predicted trajectories align more closely with ground-truth dynamics than those of other machine-learning methods. We believe NeuralMD paves the foundation for a new research paradigm in simulating protein-ligand dynamics.

#ai-for-health #computer-science #artificial-intelligence

Paper thumbnail

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

28 Oct 2021

California Institute of Technology Nanjing University logo

Nanjing University

SegFormer presents an efficient and robust Transformer-based framework for semantic segmentation, outperforming prior methods in accuracy while significantly reducing model size and computational cost. The model achieves state-of-the-art results on ADE20K, Cityscapes, and COCO-Stuff, showcasing superior efficiency and robustness to common corruptions.

#computer-science #computer-vision-security #computer-vision-and-pattern-recognition

Resources 2,939

Paper thumbnail

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

18 Oct 2025

xu-zhongxing

Zhongxing Xu

South China University of Technology

California Institute of Technology

A comprehensive survey by researchers from Shanghai AI Lab and various global institutions outlines the intricate relationship between scientific large language models (Sci-LLMs) and their data foundations, tracing their evolution towards autonomous agents for scientific discovery. The paper establishes a taxonomy for scientific data and knowledge, meticulously reviews over 270 datasets and 190 benchmarks, and identifies critical data challenges alongside future paradigms.

#agentic-frameworks #agents #computer-science

Paper thumbnail

DreamGen: Unlocking Generalization in Robot Learning through Video World Models

17 Jun 2025

University of Washington

DREAMGEN introduces a pipeline that repurposes video world models as scalable synthetic data generators for robot learning, effectively mitigating the reliance on extensive human teleoperation. This approach allows robot policies to generalize to 22 novel behaviors and operate successfully in 10 previously unseen environments, starting from a minimal initial real-world dataset.

#computer-science #artificial-intelligence #machine-learning

Paper thumbnail

Microsoft COCO: Common Objects in Context

21 Feb 2015

California Institute of Technology Cornell University logo

Cornell University

The Microsoft COCO dataset introduces a large-scale collection of images featuring common objects in diverse, cluttered contexts, precisely annotated with per-instance segmentation masks. This resource was designed to advance object recognition, contextual reasoning, and fine-grained localization beyond previous benchmarks, driving the development of more robust computer vision models.

#computer-science #computer-vision-security #computer-vision-and-pattern-recognition

Paper thumbnail

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

04 Nov 2025

California Institute of Technology University of Cambridge logo

University of Cambridge

A comprehensive survey systematically reviews advancements in feed-forward 3D reconstruction and view synthesis since 2020, categorizing methods by underlying scene representations such as NeRF, pointmaps, and 3D Gaussian Splatting. It details how deep learning has enabled significantly faster and more generalizable 3D vision, highlighting diverse applications and critical open research challenges.

#computer-science #computer-vision-and-pattern-recognition #generative-models

Paper thumbnail

4KAgent: Agentic Any Image to 4K Super-Resolution

09 Jul 2025

California Institute of Technology UT Austin

Researchers from Texas A&M University, Stanford University, and others developed 4KAgent, a multi-agent AI system that universally upscales any image to 4K resolution, adapting to diverse degradations and domains without retraining. The agentic framework achieved state-of-the-art perceptual and fidelity metrics across 26 distinct benchmarks, including natural, AI-generated, and various scientific imaging types.

#agentic-frameworks #agents #ai-for-health

Paper thumbnail

Cartridges: Lightweight and general-purpose long context representations via self-study

13 Jun 2025

Google DeepMind

California Institute of Technology

A research team at Stanford University's Hazy Research developed "Cartridges," a method to create lightweight, general-purpose representations of long text corpora for Large Language Models. This approach significantly reduces memory usage by 38.6x and boosts inference throughput by 26.4x compared to standard in-context learning, while maintaining or improving performance on long-context tasks.

#computer-science #artificial-intelligence #computation-and-language

Paper thumbnail

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

14 Oct 2025

California Institute of Technology Alibaba Group logo

A collaboration between AMAP, Alibaba Group, NVIDIA, and Caltech presents EPG, a self-supervised pre-training framework that advances end-to-end pixel-space generative modeling to achieve high image quality and efficiency, outperforming prior pixel-space methods and rivaling latent-space models while enabling the first VAE-free consistency model training on high-resolution images.

#computer-science #computer-vision-and-pattern-recognition

Paper thumbnail

Eureka: Human-Level Reward Design via Coding Large Language Models

30 Apr 2024

California Institute of Technology UT Austin

EUREKA, developed by NVIDIA and academic partners, is an algorithm that leverages Large Language Models to automatically design executable reward functions for reinforcement learning agents. It successfully generates human-level reward code that outperforms manually engineered rewards across various robotic tasks, including complex dexterous manipulation like pen spinning, significantly reducing the bottleneck of reward engineering.

#computer-science #artificial-intelligence #machine-learning

Resources 2,892

Paper thumbnail

Hierarchical phased-array antennas coupled to Al KIDs: a scalable architecture for multi-band mm/submm focal planes

31 Jan 2024

California Institute of Technology Jet Propulsion Laboratory

We present the optical characterization of two-scale hierarchical phased-array antenna kinetic inductance detectors (KIDs) for millimeter/submillimeter wavelengths. Our KIDs have a lumped-element architecture with parallel plate capacitors and aluminum inductors. The incoming light is received with a hierarchical phased array of slot-dipole antennas, split into 4 frequency bands (between 125 GHz and 365 GHz) with on-chip lumped-element band-pass filters, and routed to different KIDs using microstriplines. Individual pixels detect light for the 3 higher frequency bands (190-365 GHz) and the signals from four individual pixels are coherently summed to create a larger pixel detecting light for the lowest-frequency band (125-175 GHz). The spectral response of the band-pass filters was measured using Fourier transform spectroscopy (FTS), the far-field beam pattern of the phased-array antennas was obtained using an infrared source mounted on a 2-axis translating stage, and the optical efficiency of the KIDs was characterized by observing loads at 294 K and 77 K. We report on the results of these three measurements.

#instrumentation-and-methods-for-astrophysics #physics #instrumentation-and-detectors

Paper thumbnail

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

30 Nov 2025

California Institute of Technology UC Berkeley logo

VLASH introduces a future-state-aware asynchronous inference framework that enables Vision-Language-Action (VLA) models to achieve real-time, smooth, and fast-reaction control in robotics. It allows VLAs to successfully perform dynamic tasks like playing ping-pong with a human, providing significant speedups and improved accuracy without modifying existing model architectures.

#computer-science #artificial-intelligence #machine-learning

Paper thumbnail

There are no more papers matching your filters at the moment.