alphaXiv

History

Papers Benchmarks

University of Munster

1,987

20 Aug 2024

pattern-formation-and-solitons physics optics

Excitability and memory in a time-delayed optoelectronic neuron

Universitat de les Illes Balears University of Munster

We study the dynamics of an optoelectronic circuit composed of an excitable nanoscale resonant-tunneling diode (RTD) driving a nanolaser diode (LD) coupled via time-delayed feedback. Using a combination of numerical path-continuation methods and time simulations, we demonstrate that this RTD-LD system can serve as an artificial neuron, generating pulses in the form of temporal localized states (TLSs) that can be employed as memory for neuromorphic computing. In particular, our findings reveal that the prototypical delayed FitzHugh-Nagumo model previously employed to model the RTD-LD resembles our more realistic model only in the limit of a slow RTD. We show that the RTD time scale plays a critical role in memory capacity as it governs a shift in pulse interaction from repulsive to attractive, leading to a transition from stable to unstable multi-pulse TLSs. Our theoretical analysis uncovers features and challenges previously unknown for the RTD-LD system, including the multistability of TLSs and attractive interaction forces, stemming from the previously neglected intrinsic dynamics of the laser. These effects are crucial to consider since they define the memory properties of the RTD-LD.

23 Oct 2024

high-energy-physics-experiment nuclear-experiment physics

Neutrinoless Double Beta Decay Sensitivity of the XLZD Rare Event Observatory

University of Washington University of Cincinnati University of Oslo University of Freiburg

Imperial College London

University of Notre Dame University of Zurich

New York University

Tel Aviv University

University College London

University of Oxford

Osaka University

University of Michigan University of Edinburgh

Yale University

Northwestern University Universidade de Lisboa

Columbia University

University of Florida

Rice University

Stockholm University

Lawrence Berkeley National Laboratory

Purdue University Gran Sasso Science Institute University of Liverpool

University of California, Davis University of Massachusetts Amherst

Technical University of Munich Case Western Reserve University Fermi National Accelerator Laboratory University of Sheffield

Karlsruhe Institute of Technology University of Sussex INFN, Laboratori Nazionali del Gran Sasso University of Alabama University of Adelaide University of Hawai’i Black Hills State University Universidade de Coimbra Nikhef, National Institute for Subatomic Physics Kavli IPMU (WPI), the University of Tokyo Albert Einstein Center for Fundamental Physics, University of Bern Johannes Gutenberg-Universit at Mainz Universit e Paris-Saclay Universit ´e de Montr ´eal University of Munster Universit ´e Paris-Cit ´e Queens ’ University

The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that, based on the performance of currently operating detectors using the same technology and a realistic reduction of radioactivity in detector materials, such an experiment will also be able to competitively search for neutrinoless double beta decay in

^{136}

Xe using a natural-abundance xenon target. XLZD can reach a 3

\sigma

discovery potential half-life of 5.7

\times

^{27}

yr (and a 90% CL exclusion of 1.3

\times

^{28}

yr) with 10 years of data taking, corresponding to a Majorana mass range of 7.3-31.3 meV (4.8-20.5 meV). XLZD will thus exclude the inverted neutrino mass ordering parameter space and will start to probe the normal ordering region for most of the nuclear matrix elements commonly considered by the community.

24 Mar 2025

high-energy-physics-experiment physics

Sterile-neutrino search based on 259 days of KATRIN data

University of Washington

Carnegie Mellon University Politecnico di Milano

Technical University of Munich University of North Carolina Chulalongkorn University Karlsruhe Institute of Technology (KIT)University of Wuppertal * Czech Academy of Sciences Istituto Nazionale di Fisica Nucleare INFN Max-Planck-Institut f ür Kernphysik University of Munster

Neutrinos are the most abundant fundamental matter particles in the Universe and play a crucial role in particle physics and cosmology. Neutrino oscillation, discovered about 25 years ago, reveals that the three known species mix with each other. Anomalous results from reactor and radioactive-source experiments suggest a possible fourth neutrino state, the sterile neutrino, which does not interact via the weak force. The KATRIN experiment, primarily designed to measure the neutrino mass via tritium

\beta

-decay, also searches for sterile neutrinos suggested by these anomalies. A sterile-neutrino signal would appear as a distortion in the

\beta

-decay energy spectrum, characterized by a discontinuity in curvature (kink) related to the sterile-neutrino mass. This signature, which depends only on the shape of the spectrum rather than its absolute normalization, offers a robust, complementary approach to reactor experiments. KATRIN examined the energy spectrum of 36 million tritium

\beta

-decay electrons recorded in 259 measurement days within the last 40 electronvolt below the endpoint. The results exclude a substantial part of the parameter space suggested by the gallium anomaly and challenge the Neutrino-4 claim. Together with other neutrino-disappearance experiments, KATRIN probes sterile-to-active mass splittings from a fraction of an electron-volt squared to several hundred electron-volts squared, excluding light sterile neutrinos with mixing angles above a few percent.

27 Jan 2025

computer-science machine-learning deep-reinforcement-learning

Mathematical analysis of the gradients in deep learning

The Chinese University of Hong Kong, Shenzhen Vietnam Academy of Science and Technology University of Munster

Deep learning algorithms -- typically consisting of a class of deep artificial neural networks (ANNs) trained by a stochastic gradient descent (SGD) optimization method -- are nowadays an integral part in many areas of science, industry, and also our day to day life. Roughly speaking, in their most basic form, ANNs can be regarded as functions that consist of a series of compositions of affine-linear functions with multidimensional versions of so-called activation functions. One of the most popular of such activation functions is the rectified linear unit (ReLU) function

\mathbb{R} \ni x \mapsto \max\{ x, 0 \} \in \mathbb{R}

. The ReLU function is, however, not differentiable and, typically, this lack of regularity transfers to the cost function of the supervised learning problem under consideration. Regardless of this lack of differentiability issue, deep learning practioners apply SGD methods based on suitably generalized gradients in standard deep learning libraries like {\sc TensorFlow} or {\sc Pytorch}. In this work we reveal an accurate and concise mathematical description of such generalized gradients in the training of deep fully-connected feedforward ANNs and we also study the resulting generalized gradient function analytically. Specifically, we provide an appropriate approximation procedure that uniquely describes the generalized gradient function, we prove that the generalized gradients are limiting Fréchet subgradients of the cost functional, and we conclude that the generalized gradients must coincide with the standard gradient of the cost functional on every open sets on which the cost functional is continuously differentiable.

29 Jul 2024

computer-science machine-learning deep-reinforcement-learning

Convergence rates for the Adam optimizer

The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen)University of Munster

Researchers from the University of Münster and CUHK-Shenzhen establish optimal convergence rates for the original Adam optimizer in stochastic optimization problems, demonstrating that Adam converges to the zeros of a newly identified "Adam vector field" rather than directly to the objective function's gradient zeros. They show this limit point approaches the true minimizer at a rate of M⁻¹ as mini-batch size M increases.

02 Dec 2024

computer-science artificial-intelligence machine-learning

An overview of diffusion models for generative artificial intelligence

ETH Zurich The Chinese University of Hong Kong, Shenzhen University of Munster

Researchers from the University of Münster, The Chinese University of Hong Kong, Shenzhen, and ETH Zurich present a mathematically rigorous overview of Denoising Diffusion Probabilistic Models (DDPMs), detailing their foundational stochastic processes and training objectives. The work formalizes concepts from basic DDPMs to advanced variants like Stable Diffusion, providing a comprehensive theoretical framework for generative artificial intelligence.

01 Oct 2024

physics instrumentation-and-detectors

Model-independent searches of new physics in DARWIN with a semi-supervised deep learning pipeline

CNRS University of Zurich

University of Chicago Nikhef

Rice University

University of Tokyo Gran Sasso Science Institute

Sorbonne Université

University of Groningen INAF-Astrophysical Observatory of Torino University of Munster

Researchers from the XLZD Collaboration developed a model-independent, likelihood-free search pipeline for new physics in the proposed DARWIN experiment, utilizing semi-supervised deep learning on high-dimensional detector data. The pipeline achieved a median sensitivity of approximately 3σ to reject the background-only hypothesis for a benchmark WIMP signal after 200 ton-years of exposure, substantially outperforming traditional likelihood-based methods that reached only 1σ.

15 Feb 2025

computer-science machine-learning deep-reinforcement-learning

Non-convergence to global minimizers in data driven supervised deep learning: Adam and stochastic gradient descent optimization provably fail to converge to global minimizers in the training of deep neural networks with ReLU activation

The Chinese University of Hong Kong, Shenzhen Vietnam Academy of Science and Technology University of Munster

Deep learning methods - consisting of a class of deep neural networks (DNNs) trained by a stochastic gradient descent (SGD) optimization method - are nowadays key tools to solve data driven supervised learning problems. Despite the great success of SGD methods in the training of DNNs, it remains a fundamental open problem of research to explain the success and the limitations of such methods in rigorous theoretical terms. In particular, even in the standard setup of data driven supervised learning problems, it remained an open research problem to prove (or disprove) that SGD methods converge in the training of DNNs with the popular rectified linear unit (ReLU) activation function with high probability to global minimizers in the optimization landscape. In this work we answer this question negatively. Specifically, in this work we prove for a large class of SGD methods that the considered optimizer does with high probability not converge to global minimizers of the optimization problem. It turns out that the probability to not converge to a global minimizer converges at least exponentially quickly to one as the width of the first hidden layer of the ANN and the depth of the ANN, respectively, increase. The general non-convergence results of this work do not only apply to the plain vanilla standard SGD method but also to a large class of accelerated and adaptive SGD methods such as the momentum SGD, the Nesterov accelerated SGD, the Adagrad, the RMSProp, the Adam, the Adamax, the AMSGrad, and the Nadam optimizers.

03 Mar 2025

computer-science machine-learning mathematics

Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks

The Chinese University of Hong Kong, Shenzhen Vietnam Academy of Science and Technology University of Munster

Despite the omnipresent use of stochastic gradient descent (SGD) optimization methods in the training of deep neural networks (DNNs), it remains, in basically all practically relevant scenarios, a fundamental open problem to provide a rigorous theoretical explanation for the success (and the limitations) of SGD optimization methods in deep learning. In particular, it remains an open question to prove or disprove convergence of the true risk of SGD optimization methods to the optimal true risk value in the training of DNNs. In one of the main results of this work we reveal for a general class of activations, loss functions, random initializations, and SGD optimization methods (including, for example, standard SGD, momentum SGD, Nesterov accelerated SGD, Adagrad, RMSprop, Adadelta, Adam, Adamax, Nadam, Nadamax, and AMSGrad) that in the training of any arbitrary fully-connected feedforward DNN it does not hold that the true risk of the considered optimizer converges in probability to the optimal true risk value. Nonetheless, the true risk of the considered SGD optimization method may very well converge to a strictly suboptimal true risk value.

18 Dec 2024

cosmology-and-nongalactic-astrophysics instrumentation-and-methods-for-astrophysics high-energy-physics-experiment

The neutron veto of the XENONnT experiment: Results with demineralized water

Radiogenic neutrons emitted by detector materials are one of the most challenging backgrounds for the direct search of dark matter in the form of weakly interacting massive particles (WIMPs). To mitigate this background, the XENONnT experiment is equipped with a novel gadolinium-doped water Cherenkov detector, which encloses the xenon dual-phase time projection chamber (TPC). The neutron veto (NV) tags neutrons via their capture on gadolinium or hydrogen, which release

\gamma

-rays that are subsequently detected as Cherenkov light. In this work, we present the key features and the first results of the XENONnT NV when operated with demineralized water in the initial phase of the experiment. Its efficiency for detecting neutrons is $(82\pm 1)\,\%$, the highest neutron detection efficiency achieved in a water Cherenkov detector. This enables a high efficiency of

(53\pm 3)\,\%

for the tagging of WIMP-like neutron signals, inside a tagging time window of $250\,\mathrm{\mu s}

between TPC and NV, leading to a livetime loss of

1.6\,\%$ during the first science run of XENONnT.

31 Jul 2022

dynamical-systems mathematics optimization-and-control

Optimization flows landing on the Stiefel manifold

UCLouvain Universit e Paris-Dauphine University of Munster

We study a continuous-time system that solves optimization problems over the set of orthonormal matrices, which is also known as the Stiefel manifold. The resulting optimization flow follows a path that is not always on the manifold but asymptotically lands on the manifold. We introduce a generalized Stiefel manifold to which we extend the canonical metric of the Stiefel manifold. We show that the vector field of the proposed flow can be interpreted as the sum of a Riemannian gradient on a generalized Stiefel manifold and a normal vector. Moreover, we prove that the proposed flow globally converges to the set of critical points, and any local minimum and isolated critical point is asymptotically stable.

28 May 2025

computer-science machine-learning mathematics

PADAM: Parallel averaged Adam reduces the error for stochastic optimization in scientific machine learning

The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen)University of Munster

Averaging techniques such as Ruppert--Polyak averaging and exponential movering averaging (EMA) are powerful approaches to accelerate optimization procedures of stochastic gradient descent (SGD) optimization methods such as the popular ADAM optimizer. However, depending on the specific optimization problem under consideration, the type and the parameters for the averaging need to be adjusted to achieve the smallest optimization error. In this work we propose an averaging approach, which we refer to as parallel averaged ADAM (PADAM), in which we compute parallely different averaged variants of ADAM and during the training process dynamically select the variant with the smallest optimization error. A central feature of this approach is that this procedure requires no more gradient evaluations than the usual ADAM optimizer as each of the averaged trajectories relies on the same underlying ADAM trajectory and thus on the same underlying gradients. We test the proposed PADAM optimizer in 13 stochastic optimization and deep neural network (DNN) learning problems and compare its performance with known optimizers from the literature such as standard SGD, momentum SGD, Adam with and without EMA, and ADAMW. In particular, we apply the compared optimizers to physics-informed neural network, deep Galerkin, deep backward stochastic differential equation and deep Kolmogorov approximations for boundary value partial differential equation problems from scientific machine learning, as well as to DNN approximations for optimal control and optimal stopping problems. In nearly all of the considered examples PADAM achieves, sometimes among others and sometimes exclusively, essentially the smallest optimization error. This work thus strongly suggest to consider PADAM for scientific machine learning problems and also motivates further research for adaptive averaging procedures within the training of DNNs.

12 May 2025

cosmology-and-nongalactic-astrophysics general-relativity-and-quantum-cosmology high-energy-physics-phenomenology

Reheating ACTs on Starobinsky and Higgs inflation

Universit ́e de Gen`eve Taras Shevchenko National University of Kyiv University of Munster

In the recent sixth data release (DR6) of the Atacama Cosmology Telescope (ACT) collaboration, the value of

n_{\rm s}=0.9743 \pm 0.0034

for the scalar spectral index is reported, which excludes the Starobinsky and Higgs inflationary models at

2\sigma

level. In this paper, we perform a Bayesian inference of the parameters of the Starobinsky or Higgs inflationary model with non-instantaneous reheating using the Markov chain Monte Carlo method. For the analysis, we use observational data on the cosmic microwave background collected by the Planck and ACT collaborations and on baryonic acoustic oscillations from the DESI collaboration. The reheating stage is modelled by a single parameter

R_{\rm reh}

. Using the modified Boltzmann code CLASS and the cobaya software with the GetDist package, we perform a direct inference of the model parameter space and obtain their posterior distributions. Using the Kullback--Leibler divergence, we estimate the information gain from the data, yielding

2.52

bits for the reheating parameter. Inclusion of the ACT DR6 data provides

75\%

more information about the reheating stage compared to analysis without ACT data. We draw constraints on the reheating temperature and the average equation of state. While the former can vary within

10

orders of magnitude, values in the

95\%

credible interval indicate a sufficiently low reheating temperature; for the latter there is a clear preference for values greater than

0.5

, which means that the conventional equations of state for dust

\omega=0

and relativistic matter

\omega=1/3

are excluded with more than

2\sigma

level of significance. However, there still is a big part of parameter space where Starobinsky and Higgs inflationary models exhibit a high degree of consistency with the latest observational data, particularly from ACT DR6. Therefore, it is premature to reject these models.

05 Jun 2025

high-energy-physics-experiment high-energy-physics-theory nuclear-experiment

Challenging Spontaneous Quantum Collapse with XENONnT

CNRS

University of Amsterdam

New York University

University of Chicago Nikhef

Tsinghua University

Columbia University Weizmann Institute of Science

University of Tokyo

Stockholm University Gran Sasso Science Institute University of Bologna University of Torino

University of Groningen Universit ̈at Freiburg INFN, Laboratori Nazionali del Gran Sasso IMT Atlantique IN2P3 Kavli Institute for Cosmological Physics SUBATECH University of Coimbra LPNHE Enrico Fermi Institute INFN-Torino INFN-Bologna Institute for Cosmic Ray Research Kavli Institute for the Physics and Mathematics of the Universe (WPI),INAF-Astrophysical Observatory of Torino Kamioka Observatory Oskar Klein Centre Exzellenzcluster PRISMA+LIBPhys Nantes Universit ´e Johannes Gutenberg-Universit at Mainz Sorbonne Universit e Max-Planck-Institut f ür Kernphysik University of Munster University of Z urich

We report on the search for X-ray radiation as predicted from dynamical quantum collapse with low-energy electronic recoil data in the energy range of 1-140 keV from the first science run of the XENONnT dark matter detector. Spontaneous radiation is an unavoidable effect of dynamical collapse models, which were introduced as a possible solution to the long-standing measurement problem in quantum mechanics. The analysis utilizes a model that for the first time accounts for cancellation effects in the emitted spectrum, which arise in the X-ray range due to the opposing electron-proton charges in xenon atoms. New world-leading limits on the free parameters of the Markovian continuous spontaneous localization and Di\'osi-Penrose models are set, improving previous best constraints by two orders of magnitude and a factor of five, respectively. The original values proposed for the strength and the correlation length of the continuous spontaneous localization model are excluded experimentally for the first time.

02 Apr 2025

computer-science computational-engineering-finance-and-science mathematical-software

Vectorised Parallel in Time methods for low-order discretizations with application to Porous Media problems

Deutsches Klimarechenzentrum University of Munster

High order methods have shown great potential to overcome performance issues of simulations of partial differential equations (PDEs) on modern hardware, still many users stick to low-order, matrixbased simulations, in particular in porous media applications. Heterogeneous coefficients and low regularity of the solution are reasons not to employ high order discretizations. We present a new approach for the simulation of instationary PDEs that allows to partially mitigate the performance problems. By reformulating the original problem we derive a parallel in time time integrator that increases the arithmetic intensity and introduces additional structure into the problem. By this it helps accelerate matrix-based simulations on modern hardware architectures. Based on a system for multiple time steps we will formulate a matrix equation that can be solved using vectorised solvers like Block Krylov methods. The structure of this approach makes it applicable for a wide range of linear and nonlinear problems. In our numerical experiments we present some first results for three different PDEs, a linear convection-diffusion equation, a nonlinear diffusion-reaction equation and a realistic example based on the Richards' equation.

20 Jun 2024

computer-science machine-learning deep-reinforcement-learning

Learning rate adaptive stochastic gradient descent optimization methods: numerical simulations for deep learning methods for partial differential equations and convergence analyses

The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen)University of Munster

It is known that the standard stochastic gradient descent (SGD) optimization method, as well as accelerated and adaptive SGD optimization methods such as the Adam optimizer fail to converge if the learning rates do not converge to zero (as, for example, in the situation of constant learning rates). Numerical simulations often use human-tuned deterministic learning rate schedules or small constant learning rates. The default learning rate schedules for SGD optimization methods in machine learning implementation frameworks such as TensorFlow and Pytorch are constant learning rates. In this work we propose and study a learning-rate-adaptive approach for SGD optimization methods in which the learning rate is adjusted based on empirical estimates for the values of the objective function of the considered optimization problem (the function that one intends to minimize). In particular, we propose a learning-rate-adaptive variant of the Adam optimizer and implement it in case of several neural network learning problems, particularly, in the context of deep learning approximation methods for partial differential equations such as deep Kolmogorov methods, physics-informed neural networks, and deep Ritz methods. In each of the presented learning problems the proposed learning-rate-adaptive variant of the Adam optimizer faster reduces the value of the objective function than the Adam optimizer with the default learning rate. For a simple class of quadratic minimization problems we also rigorously prove that a learning-rate-adaptive variant of the SGD optimization method converges to the minimizer of the considered minimization problem. Our convergence proof is based on an analysis of the laws of invariant measures of the SGD method as well as on a more general convergence analysis for SGD with random but predictable learning rates which we develop in this work.

09 Aug 2024

physics instrumentation-and-detectors

Measurement of the electric potential and the magnetic field in the shifted analysing plane of the KATRIN experiment

University of Washington Politecnico di Milano

Technical University of Munich University of North Carolina Suranaree University of Technology Karlsruhe Institute of Technology (KIT)Triangle Universities Nuclear Laboratory University of Wuppertal Max Planck Institute for Physics Universidad Autonoma de Madrid * Czech Academy of Sciences Istituto Nazionale di Fisica Nucleare INFN University of Munster

The projected sensitivity of the effective electron neutrino-mass measurement with the KATRIN experiment is below 0.3 eV (90 % CL) after five years of data acquisition. The sensitivity is affected by the increased rate of the background electrons from KATRIN's main spectrometer. A special shifted-analysing-plane (SAP) configuration was developed to reduce this background by a factor of two. The complex layout of electromagnetic fields in the SAP configuration requires a robust method of estimating these fields. We present in this paper a dedicated calibration measurement of the fields using conversion electrons of gaseous

^\mathrm{83m}

Kr, which enables the neutrino-mass measurements in the SAP configuration.

30 Mar 2022

high-energy-physics-phenomenology physics

White Paper on Forward Physics, BFKL, Saturation Physics and Diffraction

CNRS

INFN Pennsylvania State University

CERN

Lawrence Berkeley National Laboratory Institut Polytechnique de Paris Universidade Federal do Rio Grande do Sul PUC-Rio

Durham University Czech Technical University in Prague Institute of Physics of the Czech Academy of Sciences The University of Kansas National Centre for Nuclear Research (NCBJ)Institute of Nuclear Physics, Polish Academy of Sciences Baruch College AGH University of Science and Technology IFT-UNESP Laboratório de Instrumentação e Física Experimental de Partículas (LIP)European Centre for Theoretical Studies in Nuclear Physics and Related Areas (ECT*)Instituto de F ́ısica Te ́orica UAM/CSIC INFN-TIFPA Trento Institute of Fundamental Physics and Applications KIT ORNL Fondazione Bruno Kessler (FBK)Universidade Federal de Pelotas Universidad de las Américas Puebla Universit‘a della Calabria CTP South American Institute for Fundamental Research City University of New York, Graduate Center Universit e Paris-Saclay Universidad Aut ´ onoma de Madrid `Ecole Polytechnique University of Munster

The goal of this whitepaper is to give a comprehensive overview of the rich field of forward physics. We discuss the occurrences of BFKL resummation effects in special final states, such as Mueller-Navelet jets, jet gap jets, and heavy quarkonium production. It further addresses TMD factorization at low x and the manifestation of a semi-hard saturation scale in (generalized) TMD PDFs. More theoretical aspects of low x physics, probes of the quark gluon plasma, as well as the possibility to use photon-hadron collisions at the LHC to constrain hadronic structure at low x, and the resulting complementarity between LHC and the EIC are also presented. We also briefly discuss diffraction at colliders as well as the possibility to explore further the electroweak theory in central exclusive events using the LHC as a photon-photon collider.

03 Dec 2024

pattern-formation-and-solitons physics optics

Thermo-optical spiking and mixed-mode oscillations in injected Kerr microcavities

Universitat de les Illes Balears University of Munster

We investigate the nonlinear dynamics of vertically emitting Kerr microcavities under detuned optical injection, considering the impact of slow thermal effects. Our model integrates thermal detuning caused by refractive index shifts due to heating. Through numerical and analytical approaches, we uncover a rich spectrum of dynamical behaviors, including excitable thermo-optical pulses, mixed-mode oscillations, and chaotic spiking, governed by a higher-dimensional canard scenario. Introducing a long external feedback loop with time delays comparable to the microcavity photon lifetime but shorter than thermal relaxation timescales, reveals how delay affects excitability and stabilizes temporal localized states. Our findings extend the understanding of excitable systems, demonstrating how thermal and feedback mechanisms interplay to shape nonlinear optical dynamics. Further, our approach paves the way for the study of cavity stabilization and cavity cooling using an additional control beam.

23 Dec 2020

computer-science artificial-intelligence software-engineering

XNAP: Making LSTM-based Next Activity Predictions Explainable by Using LRP

Friedrich-Alexander-Universität Erlangen-Nürnberg Vienna University of Economics and Business (WU)University of Munster

Predictive business process monitoring (PBPM) is a class of techniques designed to predict behaviour, such as next activities, in running traces. PBPM techniques aim to improve process performance by providing predictions to process analysts, supporting them in their decision making. However, the PBPM techniques` limited predictive quality was considered as the essential obstacle for establishing such techniques in practice. With the use of deep neural networks (DNNs), the techniques` predictive quality could be improved for tasks like the next activity prediction. While DNNs achieve a promising predictive quality, they still lack comprehensibility due to their hierarchical approach of learning representations. Nevertheless, process analysts need to comprehend the cause of a prediction to identify intervention mechanisms that might affect the decision making to secure process performance. In this paper, we propose XNAP, the first explainable, DNN-based PBPM technique for the next activity prediction. XNAP integrates a layer-wise relevance propagation method from the field of explainable artificial intelligence to make predictions of a long short-term memory DNN explainable by providing relevance values for activities. We show the benefit of our approach through two real-life event logs.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Excitability and memory in a time-delayed optoelectronic neuron

Neutrinoless Double Beta Decay Sensitivity of the XLZD Rare Event Observatory

Sterile-neutrino search based on 259 days of KATRIN data

Mathematical analysis of the gradients in deep learning

Convergence rates for the Adam optimizer

An overview of diffusion models for generative artificial intelligence

Model-independent searches of new physics in DARWIN with a semi-supervised deep learning pipeline

Non-convergence to global minimizers in data driven supervised deep learning: Adam and stochastic gradient descent optimization provably fail to converge to global minimizers in the training of deep neural networks with ReLU activation

Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks

The neutron veto of the XENONnT experiment: Results with demineralized water

Optimization flows landing on the Stiefel manifold

PADAM: Parallel averaged Adam reduces the error for stochastic optimization in scientific machine learning

Reheating ACTs on Starobinsky and Higgs inflation

Challenging Spontaneous Quantum Collapse with XENONnT

Vectorised Parallel in Time methods for low-order discretizations with application to Porous Media problems

Learning rate adaptive stochastic gradient descent optimization methods: numerical simulations for deep learning methods for partial differential equations and convergence analyses

Measurement of the electric potential and the magnetic field in the shifted analysing plane of the KATRIN experiment

White Paper on Forward Physics, BFKL, Saturation Physics and Diffraction

Thermo-optical spiking and mixed-mode oscillations in injected Kerr microcavities

XNAP: Making LSTM-based Next Activity Predictions Explainable by Using LRP

Events

AI for Law

Personalize Your Feed