alphaXiv

Free University of Berlin

197

09 Aug 2025

materials-science physics computational-physics

Magnetizing altermagnets by ultrafast asymmetric spin dynamics

Charles University Free University of Berlin Max-Born-Institut für Nichtlineare Optik und Kurzzeitspektroskopie Max-Planck-Institut für Mikrostrukturphysik

Junjie He

Laser pulses are known to induce symmetric demagnetization; equal loss of magnetic moments in the identical sublattices of antiferromagnets and ferromagnets at ultrashort timescale. This is due to their identical local electronic structures guided by the underlying symmetries. Using time-dependent density functional theory, we demonstrate that laser pulses can drive asymmetric demagnetization dynamics of identical sublattices in the d-wave compensated altermagnet RuO2, resulting in a photo-induced ferrimagnetic state with a net moment of ~0.2 {\mu}B per unit cell. This metastable magnetization is highly controllable; depends on the direction of the linear polarized laser. We identify the underlying mechanism as an anisotropic optical-induced intersite spin transfer (a-OISTR) effect, originating from the momentum-dependent spin splitting unique to altermagnets. This a-OISTR effect enables the polarization of light to drive direction-selective transient spin-dependent currents between sublattices, leading to a controllable ultrafast magnetic state transition in AM. These findings uncover novel laser-driven pathways to control magnetic order in altermagnets, enabling a phase transition from AM to ferrimagnetic state.

130

31 Oct 2025

high-energy-physics-theory physics quantum-physics

Quantum harmonic oscillator, index theorem and anomaly

University of Nottingham Free University of Berlin

Researchers demonstrate that the quantum harmonic oscillator, a foundational system, exhibits a previously unrecognized bosonic anomaly and deep topological structures, identifying its internal energy with the Hirzebruch L-genus and its partition function with the Chern character of a "physical sheaf."

10 Oct 2025

econometrics economics

Sensitivity Analysis for Causal ML: A Use Case at Booking.com

University of Washington

MIT University of Hamburg Free University of Berlin Booking.com Economic AI

Causal Machine Learning has emerged as a powerful tool for flexibly estimating causal effects from observational data in both industry and academia. However, causal inference from observational data relies on untestable assumptions about the data-generating process, such as the absence of unobserved confounders. When these assumptions are violated, causal effect estimates may become biased, undermining the validity of research findings. In these contexts, sensitivity analysis plays a crucial role, by enabling data scientists to assess the robustness of their findings to plausible violations of unconfoundedness. This paper introduces sensitivity analysis and demonstrates its practical relevance through a (simulated) data example based on a use case at this http URL. We focus our presentation on a recently proposed method by Chernozhukov et al. (2023), which derives general non-parametric bounds on biases due to omitted variables, and is fully compatible with (though not limited to) modern inferential tools of Causal Machine Learning. By presenting this use case, we aim to raise awareness of sensitivity analysis and highlight its importance in real-world scenarios.

17 Feb 2025

mathematics

Uncertainty quantification for stationary and time-dependent PDEs subject to Gevrey regular random domain deformations

Free University of Berlin

We study uncertainty quantification for partial differential equations subject to domain uncertainty. We parameterize the random domain using the model recently considered by Chernov and Le (2024) as well as Harbrecht, Schmidlin, and Schwab (2024) in which the input random field is assumed to belong to a Gevrey smoothness class. This approach has the advantage of being substantially more general than models which assume a particular parametric representation of the input random field such as a Karhunen-Loeve series expansion. We consider both the Poisson equation as well as the heat equation and design randomly shifted lattice quasi-Monte Carlo (QMC) cubature rules for the computation of the expected solution under domain uncertainty. We show that these QMC rules exhibit dimension-independent, essentially linear cubature convergence rates in this framework. In addition, we complete the error analysis by taking into account the approximation errors incurred by dimension truncation of the random input field and finite element discretization. Numerical experiments are presented to confirm the theoretical rates.

28 Aug 2025

autonomous-vehicles computer-science machine-learning

Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments

Technical University of Berlin Free University of Berlin Continental Automotive Technologies GmbH

In this paper, we propose a hybrid MPC local planner that uses a learning-based approximation of a time-varying safe set, derived from local observations and applied as the MPC terminal constraint. This set can be represented as a zero-superlevel set of the value function computed via Hamilton-Jacobi (HJ) reachability analysis, which is infeasible in real-time. We exploit the property that the HJ value function can be expressed as a difference of the corresponding signed distance function (SDF) and a non-negative residual function. The residual component is modeled as a neural network with non-negative output and subtracted from the computed SDF, resulting in a real-time value function estimate that is at least as safe as the SDF by design. Additionally, we parametrize the neural residual by a hypernetwork to improve real-time performance and generalization properties. The proposed method is compared with three state-of-the-art methods in simulations and hardware experiments, achieving up to 30\% higher success rates compared to the best baseline while requiring a similar computational effort and producing high-quality (low travel-time) solutions.

01 Oct 2024

computer-science artificial-intelligence machine-learning

Towards Symbolic XAI -- Explanation Through Human Understandable Logical Relationships Between Features

Korea University Technical University of Berlin RIKEN AIP Free University of Berlin Berlin Institute for the Foundations of Learning and Data (BIFOLD)Max-Planck-Institut für Informatik

Explainable Artificial Intelligence (XAI) plays a crucial role in fostering transparency and trust in AI systems, where traditional XAI approaches typically offer one level of abstraction for explanations, often in the form of heatmaps highlighting single or multiple input features. However, we ask whether abstract reasoning or problem-solving strategies of a model may also be relevant, as these align more closely with how humans approach solutions to problems. We propose a framework, called Symbolic XAI, that attributes relevance to symbolic queries expressing logical relationships between input features, thereby capturing the abstract reasoning behind a model's predictions. The methodology is built upon a simple yet general multi-order decomposition of model predictions. This decomposition can be specified using higher-order propagation-based relevance methods, such as GNN-LRP, or perturbation-based explanation methods commonly used in XAI. The effectiveness of our framework is demonstrated in the domains of natural language processing (NLP), vision, and quantum chemistry (QC), where abstract symbolic domain knowledge is abundant and of significant interest to users. The Symbolic XAI framework provides an understanding of the model's decision-making process that is both flexible for customization by the user and human-readable through logical formulas.

23 Sep 2025

computer-science social-and-information-networks dynamical-systems

Multi-set spectral clustering of time-evolving networks using the supra-Laplacian

University of New South Wales University of Bayreuth Free University of Berlin

Complex time-varying networks are prominent models for a wide variety of spatiotemporal phenomena. The functioning of networks depends crucially on their connectivity, yet reliable techniques for learning communities in time-evolving networks remain elusive. We adapt successful spectral techniques from continuous-time dynamics on manifolds to the graph setting to fill this gap. We consider the supra-Laplacian for graphs and develop a spectral theory to underpin the corresponding algorithmic realisations. We develop spectral clustering approaches for both multiplex and non-multiplex networks, based on the eigenvectors of the supra-Laplacian and specialised Sparse EigenBasis Approximation (SEBA) post-processing of these eigenvectors. We demonstrate that our approach can outperform the Leiden algorithm applied both in spacetime and layer-by-layer, and we analyse voting data from the US senate (where senators come and go as congresses evolve) to quantify increasing polarisation in time.

28 Nov 2024

ai-for-health computer-science artificial-intelligence

AI Readiness in Healthcare through Storytelling XAI

Free University of Berlin Robert Koch Institute

Artificial Intelligence is rapidly advancing and radically impacting everyday life, driven by the increasing availability of computing power. Despite this trend, the adoption of AI in real-world healthcare is still limited. One of the main reasons is the trustworthiness of AI models and the potential hesitation of domain experts with model predictions. Explainable Artificial Intelligence (XAI) techniques aim to address these issues. However, explainability can mean different things to people with different backgrounds, expertise, and goals. To address the target audience with diverse needs, we develop storytelling XAI. In this research, we have developed an approach that combines multi-task distillation with interpretability techniques to enable audience-centric explainability. Using multi-task distillation allows the model to exploit the relationships between tasks, potentially improving interpretability as each task supports the other leading to an enhanced interpretability from the perspective of a domain expert. The distillation process allows us to extend this research to large deep models that are highly complex. We focus on both model-agnostic and model-specific methods of interpretability, supported by textual justification of the results in healthcare through our use case. Our methods increase the trust of both the domain experts and the machine learning experts to enable a responsible AI.

28 Mar 2025

soft-condensed-matter classical-analysis-and-odes mathematics

Stability of Cantilever-like Structures with Applications to Soft Robot Arms

Free University of Berlin

The application of variational principles for analyzing problems in the physical sciences is widespread. Cantilever-like problems, where one end is fixed and the other end is free, have received less attention in terms of their stability despite their prevalence. In this article, we establish stability conditions for these problems by examining the second variation of the energy functional through the generalized Jacobi condition. This requires computing conjugate points determined by solving a set of initial value problems from the linearized equilibrium equations. We apply these conditions to investigate the nonlinear stability of intrinsically curved elastic cantilevers subject to an end load. The rod deformations are modelled using Kirchhoff rod theory. The role of intrinsic curvature in inducing complex nonlinear phenomena, such as snap-back instability, is particularly emphasized. The numerical examples highlight its dependence on the system parameters. These examples illustrate potential applications in the design of flexible soft robot arms and innovative mechanisms.

04 Apr 2025

quantum-gases high-energy-physics-lattice physics

Quantum Many-Body Scarring in a Non-Abelian Lattice Gauge Theory

Istituto Nazionale di Fisica Nucleare Ludwig Maximilian University of Munich Università di Bari Max Planck Institute of Quantum Optics Free University of Berlin Universit di Padova

Quantum many-body scarring (QMBS) is an intriguing mechanism of weak ergodicity breaking that has recently spurred significant attention. Particularly prominent in Abelian lattice gauge theories (LGTs), an open question is whether QMBS nontrivially arises in non-Abelian LGTs. Here, we present evidence of robust QMBS in a non-Abelian SU(2) LGT with dynamical matter. Starting in product states that require little experimental overhead, we show that prominent QMBS arises for certain quenches, facilitated through meson and baryon-antibaryon excitations, highlighting its non-Abelian nature. The uncovered scarred dynamics manifests as long-lived coherent oscillations in experimentally accessible local observables as well as prominent revivals in the state fidelity. Our findings bring QMBS to the realm of non-Abelian LGTs, highlighting the intimate connection between scarring and gauge symmetry, and are amenable for observation in a recently proposed trapped-ion qudit quantum computer.

29 Apr 2024

computer-science machine-learning neural-and-evolutionary-computing

A supplemental investigation of non-linearity in quantum generative models with respect to simulatability and optimization

University of Chicago

University of Oxford Free University of Berlin Oxford Ionics

Rohan Kumar

Recent work has demonstrated the utility of introducing non-linearity through repeat-until-success (RUS) sub-routines into quantum circuits for generative modeling. As a follow-up to this work, we investigate two questions of relevance to the quantum algorithms and machine learning communities: Does introducing this form of non-linearity make the learning model classically simulatable due to the deferred measurement principle? And does introducing this form of non-linearity make the overall model's training more unstable? With respect to the first question, we demonstrate that the RUS sub-routines do not allow us to trivially map this quantum model to a classical one, whereas a model without RUS sub-circuits containing mid-circuit measurements could be mapped to a classical Bayesian network due to the deferred measurement principle of quantum mechanics. This strongly suggests that the proposed form of non-linearity makes the model classically in-efficient to simulate. In the pursuit of the second question, we train larger models than previously shown on three different probability distributions, one continuous and two discrete, and compare the training performance across multiple random trials. We see that while the model is able to perform exceptionally well in some trials, the variance across trials with certain datasets quantifies its relatively poor training stability.

17 May 2025

ai-for-health computer-science artificial-intelligence

Surrogate Interpretable Graph for Random Decision Forests

Free University of Berlin Robert Koch Institute

The field of health informatics has been profoundly influenced by the development of random forest models, which have led to significant advances in the interpretability of feature interactions. These models are characterized by their robustness to overfitting and parallelization, making them particularly useful in this domain. However, the increasing number of features and estimators in random forests can prevent domain experts from accurately interpreting global feature interactions, thereby compromising trust and regulatory compliance. A method called the surrogate interpretability graph has been developed to address this issue. It uses graphs and mixed-integer linear programming to analyze and visualize feature interactions. This improves their interpretability by visualizing the feature usage per decision-feature-interaction table and the most dominant hierarchical decision feature interactions for predictions. The implementation of a surrogate interpretable graph enhances global interpretability, which is critical for such a high-stakes domain.

16 Oct 2017

mathematics probability

Well-posedness of Bayesian inverse problems in quasi-Banach spaces with stable priors

Zuse Institute Berlin Free University of Berlin

The Bayesian perspective on inverse problems has attracted much mathematical attention in recent years. Particular attention has been paid to Bayesian inverse problems (BIPs) in which the parameter to be inferred lies in an infinite-dimensional space, a typical example being a scalar or tensor field coupled to some observed data via an ODE or PDE. This article gives an introduction to the framework of well-posed BIPs in infinite-dimensional parameter spaces, as advocated by Stuart (Acta Numer. 19:451--559, 2010) and others. This framework has the advantage of ensuring uniformly well-posed inference problems independently of the finite-dimensional discretisation used for numerical solution. Recently, this framework has been extended to the case of a heavy-tailed prior measure in the family of stable distributions, such as an infinite-dimensional Cauchy distribution, for which polynomial moments are infinite or undefined. It is shown that analogues of the Karhunen--Loève expansion for square-integrable random variables can be used to sample such measures on quasi-Banach spaces. Furthermore, under weaker regularity assumptions than those used to date, the Bayesian posterior measure is shown to depend Lipschitz continuously in the Hellinger and total variation metrics upon perturbations of the misfit function and observed data.

02 Dec 2021

adversarial-attacks computer-science artificial-intelligence

Deceptive AI Explanations: Creation and Detection

University of Lausanne University of Liechtenstein Free University of Berlin

Artificial intelligence (AI) comes with great opportunities but can also pose significant risks. Automatically generated explanations for decisions can increase transparency and foster trust, especially for systems based on automated predictions by AI models. However, given, e.g., economic incentives to create dishonest AI, to what extent can we trust explanations? To address this issue, our work investigates how AI models (i.e., deep learning, and existing instruments to increase transparency regarding AI decisions) can be used to create and detect deceptive explanations. As an empirical evaluation, we focus on text classification and alter the explanations generated by GradCAM, a well-established explanation technique in neural networks. Then, we evaluate the effect of deceptive explanations on users in an experiment with 200 participants. Our findings confirm that deceptive explanations can indeed fool humans. However, one can deploy machine learning (ML) methods to detect seemingly minor deception attempts with accuracy exceeding 80% given sufficient domain knowledge. Without domain knowledge, one can still infer inconsistencies in the explanations in an unsupervised manner, given basic knowledge of the predictive model under scrutiny.

29 Sep 2025

mathematics probability

Dynamical Gibbs Variational Principles for Irreversible Interacting Particle Systems with Applications to Attractor Properties

Weierstrass Institute for Applied Analysis and Stochastics Technische Universität Braunschweig Free University of Berlin

This work establishes a dynamical Gibbs variational principle and an attractor property for general irreversible interacting particle systems, rigorously demonstrating that weak limit points of time-evolved translation-invariant measures are Gibbs measures. It extends the applicability of relative entropy methods to complex stochastic systems with finite local state spaces and arbitrary finite-region updates.

28 Sep 2018

mathematics probability statistics

Random forward models and log-likelihoods in Bayesian inverse problems

University of Edinburgh The Alan Turing Institute Zuse Institute Berlin Free University of Berlin

We consider the use of randomised forward models and log-likelihoods within the Bayesian approach to inverse problems. Such random approximations to the exact forward model or log-likelihood arise naturally when a computationally expensive model is approximated using a cheaper stochastic surrogate, as in Gaussian process emulation (kriging), or in the field of probabilistic numerical methods. We show that the Hellinger distance between the exact and approximate Bayesian posteriors is bounded by moments of the difference between the true and approximate log-likelihoods. Example applications of these stability results are given for randomised misfit models in large data applications and the probabilistic solution of ordinary differential equations.

28 May 2024

computer-science machine-learning deep-reinforcement-learning

WeiPer: OOD Detection using Weight Perturbations of Class Projections

Free University of Berlin

Recent advances in out-of-distribution (OOD) detection on image data show that pre-trained neural network classifiers can separate in-distribution (ID) from OOD data well, leveraging the class-discriminative ability of the model itself. Methods have been proposed that either use logit information directly or that process the model's penultimate layer activations. With "WeiPer", we introduce perturbations of the class projections in the final fully connected layer which creates a richer representation of the input. We show that this simple trick can improve the OOD detection performance of a variety of methods and additionally propose a distance-based method that leverages the properties of the augmented WeiPer space. We achieve state-of-the-art OOD detection results across multiple benchmarks of the OpenOOD framework, especially pronounced in difficult settings in which OOD samples are positioned close to the training set distribution. We support our findings with theoretical motivations and empirical observations, and run extensive ablations to provide insights into why WeiPer works.

28 Sep 2018

molecular-networks quantitative-biology applications

wTO: an R package for computing weighted topological overlap and consensus networks with an integrated visualization tool

NTNU, Norwegian University of Science and Technology Free University of Berlin University of Leipzig Funda ̧c ̃ao Cesgranrio

Network analyses, such as of gene co-expression networks, metabolic networks and ecological networks have become a central approach for the systems-level study of biological data. Several software packages exist for generating and analyzing such networks, either from correlation scores or the absolute value of a transformed score called weighted topological overlap (wTO). However, since gene regulatory processes can up- or down-regulate genes, it is of great interest to explicitly consider both positive and negative correlations when constructing a gene co-expression network. Here, we present an R package for calculating the wTO, that, in contrast to existing packages, explicitly addresses the sign of the wTO values, and is thus especially valuable for the analysis of gene regulatory networks. The package includes the calculation of p-values (raw and adjusted) for each pairwise gene score. Our package also allows the calculation of networks from time series (without replicates). Since networks from independent datasets (biological repeats or related studies) are not the same due to technical and biological noise in the data, we additionally, incorporated a novel method for calculating a consensus network (CN) from two or more networks into our R package. We compare our new wTO package to state of art packages and demonstrate the application of the wTO and CN functions using 3 independently derived datasets from healthy human pre-frontal cortex samples. To showcase an example for the time series application we utilized a metagenomics data set. In this work, we developed a software package that allows the computation of wTO networks, CNs and a visualization tool in the R statistical environment. It is publicly available on CRAN repositories under the GPL-2 Open Source License (this https URL).

01 Jul 2025

chain-of-thought computer-science artificial-intelligence

Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment

Free University of Berlin Ruprecht Karl University of Heidelberg Ludwigsburg University of Education Leibniz Institute for Science and Mathematics Education

Large language models (LLMs) are now widely accessible, reaching learners at all educational levels. This development has raised concerns that their use may circumvent essential learning processes and compromise the integrity of established assessment formats. In physics education, where problem solving plays a central role in instruction and assessment, it is therefore essential to understand the physics-specific problem-solving capabilities of LLMs. Such understanding is key to informing responsible and pedagogically sound approaches to integrating LLMs into instruction and assessment. This study therefore compares the problem-solving performance of a general-purpose LLM (GPT-4o, using varying prompting techniques) and a reasoning-optimized model (o1-preview) with that of participants of the German Physics Olympiad, based on a set of well-defined Olympiad problems. In addition to evaluating the correctness of the generated solutions, the study analyzes characteristic strengths and limitations of LLM-generated solutions. The findings of this study indicate that both tested LLMs (GPT-4o and o1-preview) demonstrate advanced problem-solving capabilities on Olympiad-type physics problems, on average outperforming the human participants. Prompting techniques had little effect on GPT-4o's performance, while o1-preview almost consistently outperformed both GPT-4o and the human benchmark. Based on these findings, the study discusses implications for the design of summative and formative assessment in physics education, including how to uphold assessment integrity and support students in critically engaging with LLMs.

05 Dec 2021

computer-science social-and-information-networks physics

iNaturalist citizen science community during City Nature Challenge: new computational approach for analysis of user activity

UCL Humboldt University of Berlin INSERM Free University of Berlin Center for Research and Interdisciplinarity (CRI)CorrelAid OpenHumans Foundation scieneers GmbH Universit de Paris

Analysing patterns of engagement among citizen science participants can provide important insights into the organisation and practice of individual citizen science projects. In particular, methods from statistics and network science can be used to understand different types of user behaviour and user interactions to help the further implementation and organization of community efforts. Using publicly available data from the iNaturalist community and their yearly City Nature Challenges (CNC) from 2017-2020 as an example; we showcase computational methods to explore the spatio-temporal evolution of this citizen science community that typically interacts in a hybrid offline-online way. In particular, we investigate the user types present in the community along with their interactions, finding significant differences in usage-behavior on both the level of engagement and the types of community tasks/roles and how they interact with the network of contributors. We expect that these computational analysis strategies will be useful to gain further understanding of other citizen science communities and projects.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Magnetizing altermagnets by ultrafast asymmetric spin dynamics

Quantum harmonic oscillator, index theorem and anomaly

Sensitivity Analysis for Causal ML: A Use Case at Booking.com

Uncertainty quantification for stationary and time-dependent PDEs subject to Gevrey regular random domain deformations

Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments

Towards Symbolic XAI -- Explanation Through Human Understandable Logical Relationships Between Features

Multi-set spectral clustering of time-evolving networks using the supra-Laplacian

AI Readiness in Healthcare through Storytelling XAI

Stability of Cantilever-like Structures with Applications to Soft Robot Arms

Quantum Many-Body Scarring in a Non-Abelian Lattice Gauge Theory

A supplemental investigation of non-linearity in quantum generative models with respect to simulatability and optimization

Surrogate Interpretable Graph for Random Decision Forests

Well-posedness of Bayesian inverse problems in quasi-Banach spaces with stable priors

Deceptive AI Explanations: Creation and Detection

Dynamical Gibbs Variational Principles for Irreversible Interacting Particle Systems with Applications to Attractor Properties

Random forward models and log-likelihoods in Bayesian inverse problems

WeiPer: OOD Detection using Weight Perturbations of Class Projections

wTO: an R package for computing weighted topological overlap and consensus networks with an integrated visualization tool

Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment

iNaturalist citizen science community during City Nature Challenge: new computational approach for analysis of user activity

Events

AI for Law

Personalize Your Feed