alphaXiv

1,575

14 Nov 2024

computer-science information-theory mathematics

Learning Properties of Quantum States Without the I.I.D. Assumption

CNRS

Sorbonne Université

Inria Univ Lyon LIP6 Johannes Kepler University Linz UCBL LIP ENS Lyon RWTH Aachen University

We develop a framework for learning properties of quantum states beyond the assumption of independent and identically distributed (i.i.d.) input states. We prove that, given any learning problem (under reasonable assumptions), an algorithm designed for i.i.d. input states can be adapted to handle input states of any nature, albeit at the expense of a polynomial increase in training data size (aka sample complexity). Importantly, this polynomial increase in sample complexity can be substantially improved to polylogarithmic if the learning algorithm in question only requires non-adaptive, single-copy measurements. Among other applications, this allows us to generalize the classical shadow framework to the non-i.i.d. setting while only incurring a comparatively small loss in sample efficiency. We use rigorous quantum information theory to prove our main results. In particular, we leverage permutation invariance and randomized single-copy measurements to derive a new quantum de Finetti theorem that mainly addresses measurement outcome statistics and, in turn, scales much more favorably in Hilbert space dimension.

224

11 Jun 2025

adversarial-attacks adversarial-robustness agents

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Microsoft ISTA University of Coimbra Vietnamese German University HiddenLayer Trend Micro SK Shieldus RainaResearch

LLMail-Inject introduces a public challenge and dataset designed to evaluate indirect prompt injection attacks against an LLM-based email assistant in a realistic, end-to-end setting. The project collected over 200,000 unique attack prompts, demonstrating that end-to-end attacks are challenging to execute against layered defenses and providing insights into effective defense strategies.

8

51

15 Sep 2025

computer-science computer-vision-and-pattern-recognition image-segmentation

Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization

University of Coimbra Institute of Systems and Robotics

Autonomous systems require robust Multi-Object Tracking (MOT) capabilities to operate reliably in dynamic environments. MOT ensures consistent object identity assignment and precise spatial delineation. Recent advances in foundation models, such as SAM2, have demonstrated strong zero-shot generalization for video segmentation, but their direct application to MOTS (MOT+Segmentation) remains limited by insufficient identity management and memory efficiency. This work introduces Seg2Track-SAM2, a framework that integrates pre-trained object detectors with SAM2 and a novel Seg2Track module to address track initialization, track management, and reinforcement. The proposed approach requires no fine-tuning and remains detector-agnostic. Experimental results on KITTI MOT and KITTI MOTS benchmarks show that Seg2Track-SAM2 achieves state-of-the-art (SOTA) performance, ranking fourth overall in both car and pedestrian classes on KITTI MOTS, while establishing a new benchmark in association accuracy (AssA). Furthermore, a sliding-window memory strategy reduces memory usage by up to 75% with negligible performance degradation, supporting deployment under resource constraints. These results confirm that Seg2Track-SAM2 advances MOTS by combining robust zero-shot tracking, enhanced identity preservation, and efficient memory utilization. The code is available at this https URL

9

33

20 Oct 2025

mathematics optimization-and-control

A Frank-Wolfe-based primal heuristic for quadratic mixed-integer optimization

CNRS

Inria ENS de Lyon Zuse Institute Berlin Technische Universität Berlin LIG UCBL LIP Universit Grenoble Alpes

Gioni Mexi

We propose a primal heuristic for quadratic mixed-integer problems. Our method extends the Boscia framework -- originally a mixed-integer convex solver leveraging a Frank-Wolfe-based branch-and-bound approach -- to address nonconvex quadratic objective and constraints. We reformulate nonlinear constraints, introduce preprocessing steps, and a suite of heuristics including rounding strategies, gradient-guided selection, and large neighborhood search techniques that exploit integer-feasible vertices generated during the Frank-Wolfe iterations. Computational results demonstrate the effectiveness of our method in solving challenging MIQCQPs, achieving improvements on QPLIB instances within minutes and winning first place in the Land-Doig MIP Computational Competition 2025.

31

11 Aug 2025

computer-science distributed-parallel-and-cluster-computing physics

GPU-Accelerated Syndrome Decoding for Quantum LDPC Codes below the 63 $μ$ s Latency Threshold

Instituto de Telecomunicações University of Coimbra ISCTE-Instituto Universitrio de Lisboa

This research developed a GPU-accelerated decoder for Quantum Low-Density Parity-Check (QLDPC) codes, achieving real-time decoding latencies below the 63 μs threshold set by current quantum processors. The decoder processed a [[784, 24, 24]] QLDPC code in 43.7 μs on an RTX 4090, showcasing the practical viability of these scalable codes for fault-tolerant quantum computing.

24

09 Oct 2025

signal-processing electrical-engineering neurons-and-cognition

Optimizing BCI Rehabilitation Protocols for Stroke: Exploring Task Design and Training Duration

University of Ljubljana University of Coimbra Polytechnic Institute of Tomar Institute of Systems and Robotics Institute of Systems and Robotics, University of Coimbra

Stroke is a leading cause of long-term disability and the second most common cause of death worldwide. Although acute treatments have advanced, recovery remains challenging and limited. Brain-computer interfaces (BCIs) have emerged as a promising tool for post-stroke rehabilitation by promoting neuroplasticity. However, clinical outcomes remain variable, and optimal protocols have yet to be established. This study explores strategies to optimize BCI-based rehabilitation by comparing motor imagery of affected hand movement versus rest, instead of the conventional left-versus-right motor imagery. This alternative aims to simplify the task and address the weak contralateral activation commonly observed in stroke patients. Two datasets, one from healthy individuals and one from stroke patients, were used to evaluate the proposed approach. The results showed improved performance using both FBCSP and EEGNet. Additionally, we investigated the impact of session duration and found that shorter training sessions produced better BCI performance than longer sessions.

15,165

12 Apr 2025

computer-science distributed-parallel-and-cluster-computing

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU

Microsoft IIT Hyderabad LIP LabEx MILYON

The BANG method enables billion-scale Approximate Nearest Neighbour Search (ANNS) with high recall and throughput using a single GPU. It demonstrates 50x to 400x higher throughput on 1-billion point datasets compared to competing methods while maintaining high accuracy.

281

25 Feb 2025

cosmology-and-nongalactic-astrophysics instrumentation-and-methods-for-astrophysics high-energy-physics-experiment

WIMP Dark Matter Search using a 3.1 tonne $\times$ year Exposure of the XENONnT Experiment

We report on a search for weakly interacting massive particle (WIMP) dark matter (DM) via elastic DM-xenon-nucleus interactions in the XENONnT experiment. We combine datasets from the first and second science campaigns resulting in a total exposure of

3.1\;\text{tonne}\times\text{year}

. In a blind analysis of nuclear recoil events with energies above

3.8\,\mathrm{keV_{NR}}

, we find no significant excess above background. We set new upper limits on the spin-independent WIMP-nucleon scattering cross-section for WIMP masses above

10\,\mathrm{GeV}/c^2

with a minimum of

1.7\,\times\,10^{-47}\,\mathrm{cm^2}

at

90\,\%

confidence level for a WIMP mass of

30\,\mathrm{GeV}/c^2

. We achieve a best median sensitivity of

1.4\,\times\,10^{-47}\,\mathrm{cm^2}

for a

41\,\mathrm{GeV}/c^2

WIMP. Compared to the result from the first XENONnT science dataset, we improve our sensitivity by a factor of up to 1.8.

58

23 May 2024

computer-science computer-vision-security computer-vision-and-pattern-recognition

Event-based dataset for the detection and classification of manufacturing assembly tasks

University of Coimbra

The featured dataset, the Event-based Dataset of Assembly Tasks (EDAT24), showcases a selection of manufacturing primitive tasks (idle, pick, place, and screw), which are basic actions performed by human operators in any manufacturing assembly. The data were captured using a DAVIS240C event camera, an asynchronous vision sensor that registers events when changes in light intensity value occur. Events are a lightweight data format for conveying visual information and are well-suited for real-time detection and analysis of human motion. Each manufacturing primitive has 100 recorded samples of DAVIS240C data, including events and greyscale frames, for a total of 400 samples. In the dataset, the user interacts with objects from the open-source CT-Benchmark in front of the static DAVIS event camera. All data are made available in raw form (.aedat) and in pre-processed form (.npy). Custom-built Python code is made available together with the dataset to aid researchers to add new manufacturing primitives or extend the dataset with more samples.

16

22 Sep 2025

computer-science computational-complexity physics

Computational aspects of the trace norm contraction coefficient

Institut Polytechnique de Paris

Inria Télécom Paris UCBL LIP ENS Lyon LTCI

We show that approximating the trace norm contraction coefficient of a quantum channel within a constant factor is NP-hard. Equivalently, this shows that determining the optimal success probability for encoding a bit in a quantum system undergoing noise is NP-hard. This contrasts with the classical analogue of this problem that can clearly be solved efficiently. We also establish the NP-hardness of deciding if the contraction coefficient is equal to 1, i.e., the channel can perfectly preserve a bit. As a consequence, deciding if a non-commutative graph has an independence number of at least 2 is NP-hard. In addition, we establish a converging hierarchy of semidefinite programming upper bounds on the contraction coefficient.

15

16 Sep 2025

cosmology-and-nongalactic-astrophysics high-energy-physics-phenomenology physics

Constraints on Dark Matter Structures around Gaia Black Holes

University of Amsterdam University of Minho Universidade do Minho Universidade de Coimbra University of Coimbra Centro de Física das Universidades do Minho e do Porto CFisUC Gravitation Astroparticle Physics Amsterdam

We demonstrate that Gaia's detection of stars on wide orbits around black holes opens a new observational window on dark matter structures -- such as scalar clouds and dark matter spikes -- predicted in a range of theoretical scenarios. Using precise radial velocity measurements of these systems, we derive state-of-the-art constraints on dark matter density profiles and particle masses in previously unexplored regions of parameter space. We also test the black hole hypothesis against the alternative of a boson star composed of light scalar fields.

77

12 Dec 2024

computer-science disordered-systems-and-neural-networks artificial-intelligence

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Princeton University Princeton Neuroscience Institute Department of Physics

Transformers exhibit in-context learning (ICL): the ability to use novel information presented in the context without additional weight updates. Recent work shows that ICL emerges when models are trained on a sufficiently diverse set of tasks and the transition from memorization to generalization is sharp with increasing task diversity. One interpretation is that a network's limited capacity to memorize favors generalization. Here, we examine the mechanistic underpinnings of this transition using a small transformer applied to a synthetic ICL task. Using theory and experiment, we show that the sub-circuits that memorize and generalize can be viewed as largely independent. The relative rates at which these sub-circuits learn explains the transition from memorization to generalization, rather than capacity constraints. We uncover a memorization scaling law, which determines the task diversity threshold at which the network generalizes. The theory quantitatively explains a variety of other ICL-related phenomena, including the long-tailed distribution of when ICL is acquired, the bimodal behavior of solutions close to the task diversity threshold, the influence of contextual and data distributional statistics on ICL, and the transient nature of ICL.

31

28 Apr 2025

high-energy-physics-experiment physics

Search for Light Dark Matter in Low-Energy Ionization Signals from XENONnT

We report on a blinded search for dark matter with single- and few-electron signals in the first science run of XENONnT relying on a novel detector response framework that is physics-model-dependent. We derive 90\% confidence upper limits for dark matter-electron interactions. Heavy and light mediator cases are considered for the standard halo model and dark matter up-scattered in the Sun. We set stringent new limits on dark matter-electron scattering via a heavy mediator with a mass within 10-20\,MeV/

c^2

and electron absorption of axion-like particles and dark photons for

m_\chi

below 0.186\,keV/

c^2

.

32

13 Sep 2024

high-energy-physics-experiment physics data-analysis-statistics-and-probability

XENONnT Analysis: Signal Reconstruction, Calibration and Event Selection

The XENON Collaboration presents the detailed analysis framework for signal reconstruction, calibration, and event selection from XENONnT's first science run. This work established an ultra-low background level of (15.8 1 1.3) events/(tonne7year7keV) in the (1, 30) keV energy region, marking a five-fold reduction compared to XENON1T.

93

23 Nov 2024

solar-and-stellar-astrophysics high-energy-physics-experiment nuclear-experiment

First Indication of Solar $^8$ B Neutrinos via Coherent Elastic Neutrino-Nucleus Scattering with XENONnT

We present the first measurement of nuclear recoils from solar

^8

B neutrinos via coherent elastic neutrino-nucleus scattering with the XENONnT dark matter experiment. The central detector of XENONnT is a low-background, two-phase time projection chamber with a 5.9 t sensitive liquid xenon target. A blind analysis with an exposure of 3.51 t

\times

yr resulted in 37 observed events above 0.5 keV, with (

26.4^{+1.4}_{-1.3}

) events expected from backgrounds. The background-only hypothesis is rejected with a statistical significance of 2.73

\sigma

. The measured

^8

B solar neutrino flux of

(4.7_{-2.3}^{+3.6})\times 10^6 \mathrm{cm}^{-2}\mathrm{s}^{-1}

is consistent with results from the Sudbury Neutrino Observatory. The measured neutrino flux-weighted CE

\nu

NS cross section on Xe of

(1.1^{+0.8}_{-0.5})\times10^{-39} \mathrm{cm}^2

is consistent with the Standard Model prediction. This is the first direct measurement of nuclear recoils from solar neutrinos with a dark matter detector.

50

14 Apr 2025

active-learning computer-science machine-learning

EDCA - An Evolutionary Data-Centric AutoML Framework for Efficient Pipelines

University of Coimbra CISUC/LASI – Centre for Informatics and Systems of the University of Coimbra

An Evolutionary Data-Centric AutoML (EDCA) framework automates the creation of efficient machine learning pipelines by integrating dynamic data preprocessing and reduction with evolutionary algorithms. It achieves predictive performance comparable to leading AutoML tools while consistently using substantially less data across various classification datasets.

15

12 Sep 2019

earth-and-planetary-astrophysics physics

Transit timing variations in the WASP-4 planetary system

Transits in the planetary system WASP-4 were recently found to occur 80s earlier than expected in observations from the TESS satellite. We present 22 new times of mid-transit that confirm the existence of transit timing variations, and are well fitted by a quadratic ephemeris with period decay dP/dt = -9.2 +/- 1.1 ms/yr. We rule out instrumental issues, stellar activity and the Applegate mechanism as possible causes. The light-time effect is also not favoured due to the non-detection of changes in the systemic velocity. Orbital decay and apsidal precession are plausible but unproven. WASP-4b is only the third hot Jupiter known to show transit timing variations to high confidence. We discuss a variety of observations of this and other planetary systems that would be useful in improving our understanding of WASP-4 in particular and orbital decay in general.

15

26 Aug 2025

high-energy-physics-experiment physics instrumentation-and-detectors

Flow-dependent tagging of $^{214}$ Pb decays in the LZ dark matter detector

University of California, Santa Barbara SLAC National Accelerator Laboratory

Imperial College London University of Zurich

Stanford University

University of Michigan University of Edinburgh

University of Maryland

University of Wisconsin-Madison

King’s College London

University of California, Davis

Brown University University at Albany, SUNY Laboratório de Instrumentação e Física Experimental de Partículas (LIP)University of Coimbra University College London (UCL)Kavli Institute for Particle Astrophysics and Cosmology Lawrence Berkeley National Laboratory (LBNL)STFC Rutherford Appleton Laboratory (RAL)

The LUX-ZEPLIN (LZ) experiment is searching for dark matter interactions in a liquid xenon time projection chamber (LXe-TPC). This article demonstrates how control of the flow state in the LXe-TPC enables the identification of pairs of sequential alpha-decays, which are used to map fluid flow and ion drift in the liquid target. The resulting transport model is used to tag

^{214}

Pb beta-decays, a leading background to dark matter signals in LZ. Temporally evolving volume selections, at a cost of 9.0% of exposure, target the decay of each

^{214}

Pb atom up to 81 minutes after production, resulting in (63

\pm

6

_{\mathrm{stat}}

\pm

7

_{\mathrm{sys}}

)% identification of

^{214}

Pb decays to ground state. We also demonstrate how flow-based tagging techniques enable a novel calibration side band that is concurrent with science data.

42

12 Sep 2025

computer-science artificial-intelligence computers-and-society

A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research

Virginia Commonwealth University University of Coimbra

Group fairness in machine learning is an important area of research focused on achieving equitable outcomes across different groups defined by sensitive attributes such as race or gender. Federated Learning, a decentralized approach to training machine learning models across multiple clients, amplifies the need for fairness methodologies due to its inherent heterogeneous data distributions that can exacerbate biases. The intersection of Federated Learning and group fairness has attracted significant interest, with 48 research works specifically dedicated to addressing this issue. However, no comprehensive survey has specifically focused on group fairness in Federated Learning. In this work, we analyze the key challenges of this topic, propose practices for its identification and benchmarking, and create a novel taxonomy based on criteria such as data partitioning, location, and strategy. Furthermore, we analyze broader concerns, review how different approaches handle the complexities of various sensitive attributes, examine common datasets and applications, and discuss the ethical, legal, and policy implications of group fairness in FL. We conclude by highlighting key areas for future research, emphasizing the need for more methods to address the complexities of achieving group fairness in federated systems.

114

15 Aug 2025

computer-science emerging-technologies other-quantitative-biology

Open Questions about Time and Self-reference in Living Systems

Michigan State University

University College London Tufts University Memorial University of Newfoundland University of York University of Coimbra

Living systems exhibit a range of fundamental characteristics: they are active, self-referential, self-modifying systems. This paper explores how these characteristics create challenges for conventional scientific approaches and why they require new theoretical and formal frameworks. We introduce a distinction between 'natural time', the continuing present of physical processes, and 'representational time', with its framework of past, present and future that emerges with life itself. Representational time enables memory, learning and prediction, functions of living systems essential for their survival. Through examples from evolution, embryogenesis and metamorphosis we show how living systems navigate the apparent contradictions arising from self-reference as natural time unwinds self-referential loops into developmental spirals. Conventional mathematical and computational formalisms struggle to model self-referential and self-modifying systems without running into paradox. We identify promising new directions for modelling self-referential systems, including domain theory, co-algebra, genetic programming, and self-modifying algorithms. There are broad implications for biology, cognitive science and social sciences, because self-reference and self-modification are not problems to be avoided but core features of living systems that must be modelled to understand life's open-ended creativity.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Learning Properties of Quantum States Without the I.I.D. Assumption

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization

A Frank-Wolfe-based primal heuristic for quadratic mixed-integer optimization

GPU-Accelerated Syndrome Decoding for Quantum LDPC Codes below the 63 $μ$ s Latency Threshold

Optimizing BCI Rehabilitation Protocols for Stroke: Exploring Task Design and Training Duration

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU

WIMP Dark Matter Search using a 3.1 tonne $\times$ year Exposure of the XENONnT Experiment

Event-based dataset for the detection and classification of manufacturing assembly tasks

Computational aspects of the trace norm contraction coefficient

Constraints on Dark Matter Structures around Gaia Black Holes

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Search for Light Dark Matter in Low-Energy Ionization Signals from XENONnT

XENONnT Analysis: Signal Reconstruction, Calibration and Event Selection

First Indication of Solar $^8$ B Neutrinos via Coherent Elastic Neutrino-Nucleus Scattering with XENONnT

EDCA - An Evolutionary Data-Centric AutoML Framework for Efficient Pipelines

Transit timing variations in the WASP-4 planetary system

Flow-dependent tagging of $^{214}$ Pb decays in the LZ dark matter detector

A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research

Open Questions about Time and Self-reference in Living Systems

Events

AI for Law

Personalize Your Feed

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Learning Properties of Quantum States Without the I.I.D. Assumption

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization

A Frank-Wolfe-based primal heuristic for quadratic mixed-integer optimization

GPU-Accelerated Syndrome Decoding for Quantum LDPC Codes below the 63 μμμs Latency Threshold

Optimizing BCI Rehabilitation Protocols for Stroke: Exploring Task Design and Training Duration

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU

WIMP Dark Matter Search using a 3.1 tonne ×\times× year Exposure of the XENONnT Experiment

Event-based dataset for the detection and classification of manufacturing assembly tasks

Computational aspects of the trace norm contraction coefficient

Constraints on Dark Matter Structures around Gaia Black Holes

Differential learning kinetics govern the transition from memorization to generalization during in-context learning

Search for Light Dark Matter in Low-Energy Ionization Signals from XENONnT

XENONnT Analysis: Signal Reconstruction, Calibration and Event Selection

First Indication of Solar 8^88B Neutrinos via Coherent Elastic Neutrino-Nucleus Scattering with XENONnT

EDCA - An Evolutionary Data-Centric AutoML Framework for Efficient Pipelines

Transit timing variations in the WASP-4 planetary system

Flow-dependent tagging of 214^{214}214Pb decays in the LZ dark matter detector

A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research

Open Questions about Time and Self-reference in Living Systems

Events

AI for Law

Personalize Your Feed

GPU-Accelerated Syndrome Decoding for Quantum LDPC Codes below the 63 $μ$ s Latency Threshold

WIMP Dark Matter Search using a 3.1 tonne $\times$ year Exposure of the XENONnT Experiment

First Indication of Solar $^8$ B Neutrinos via Coherent Elastic Neutrino-Nucleus Scattering with XENONnT

Flow-dependent tagging of $^{214}$ Pb decays in the LZ dark matter detector