Federal University of Santa Maria
Kolmogorov-Arnold Networks (KANs) have shown potential as an alternative to Multi-Layer Perceptrons (MLPs) in neural networks, providing universal function approximation with fewer parameters and reduced memory usage. In this paper, we explore the use of KANs as function approximators within the Proximal Policy Optimization (PPO) algorithm. We evaluate this approach by comparing its performance to the original MLP-based PPO using the DeepMind Control Proprio Robotics benchmark. Our results indicate that the KAN-based reinforcement learning algorithm can achieve comparable performance to its MLP-based counterpart, often with fewer parameters. These findings suggest that KANs may offer a more efficient option for reinforcement learning models.
13
The interplay between thermodynamics, general relativity and quantum mechanics has long intrigued researchers. Recently, important advances have been obtained in thermodynamics, mainly regarding its application to the quantum domain through fluctuation theorems. In this letter, we apply Fermi normal coordinates to report a fully general relativistic detailed quantum fluctuation theorem based on the two point measurement scheme. We demonstrate how the spacetime curvature can produce entropy in a localized quantum system moving in a general spacetime. The example of a quantum harmonic oscillator living in an expanding universe is presented. This result implies that entropy production is strongly observer dependent and deeply connects the arrow of time with the causal structure of the spacetime.
As quantum devices scale toward practical machine learning applications, the binary qubit paradigm faces expressivity and resource efficiency limitations. Multi-level quantum systems, or qudits, offer a promising alternative by harnessing a larger Hilbert space, enabling richer data embeddings, more compact variational circuits, and support for multi-valued problem structures. In this work, we review the role of qudits in quantum machine learning techniques, mainly variational quantum algorithms and quantum neural networks. Drawing on recent experimental demonstrations, including high-level superconducting transmons, qutrit-based combinatorial optimization, and single-qudit classifiers, we highlight how qudit architectures can reduce circuit depth and parameter counts while maintaining competitive fidelity. We further assess the evolving software ecosystem, from specialized simulators and differentiable-programming libraries to extensions of mainstream frameworks. We also identify key challenges in control complexity, noise management, and tooling maturity.
The forecast of wave variables are important for several applications that depend on a better description of the ocean state. Due to the chaotic behaviour of the differential equations which model this problem, a well know strategy to overcome the difficulties is basically to run several simulations, by for instance, varying the initial condition, and averaging the result of each of these, creating an ensemble. Moreover, in the last few years, considering the amount of available data and the computational power increase, machine learning algorithms have been applied as surrogate to traditional numerical models, yielding comparative or better results. In this work, we present a methodology to create an ensemble of different artificial neural networks architectures, namely, MLP, RNN, LSTM, CNN and a hybrid CNN-LSTM, which aims to predict significant wave height on six different locations in the Brazilian coast. The networks are trained using NOAA's numerical reforecast data and target the residual between observational data and the numerical model output. A new strategy to create the training and target datasets is demonstrated. Results show that our framework is capable of producing high efficient forecast, with an average accuracy of 80%80\%, that can achieve up to 88%88\% in the best case scenario, which means 5%5\% reduction in error metrics if compared to NOAA's numerical model, and a increasingly reduction of computational cost.
This paper introduces novel deep reinforcement learning (Deep-RL) techniques using parallel distributional actor-critic networks for navigating terrestrial mobile robots. Our approaches use laser range findings, relative distance, and angle to the target to guide the robot. We trained agents in the Gazebo simulator and deployed them in real scenarios. Results show that parallel distributional Deep-RL algorithms enhance decision-making and outperform non-distributional and behavior-based approaches in navigation and spatial generalization.
A novel neural network architecture from Brazilian physics researchers combines mixed-function neurons with second-order neurons to solve differential equations, reducing parameter count by up to 4 orders of magnitude compared to standard Physics-Informed Neural Networks while maintaining accuracy and enabling extraction of interpretable analytical expressions.
We investigate the formulation of work distributions for quantum scalar fields in static curved spacetimes by extending the Ramsey interferometric protocol originally developed in previous works for flat spacetimes. The use of Unruh-DeWitt particle detectors provides a causally consistent framework to define and measure work statistics, avoiding the limitations of the two-time projective measurement scheme in relativistic quantum field theory. We derive a non-perturbative expression for the characteristic function of the quantum field and apply it to thermal Kubo-Martin-Schwinger (KMS) states, showing that the resulting work distributions satisfy both the Crooks fluctuation theorem and the Jarzynski equality. Furthermore, we analyse the case of a pointlike detector, obtaining compact expressions for the first two moments of the work distribution, allowing us to recover the standard fluctuation-dissipation relation in the high-temperature limit. Our results demonstrate that fluctuation theorems hold for quantum fields interacting with Unruh-DeWitt particle detectors in static curved spacetimes.
Digital quantum simulation has emerged as a powerful approach to investigate complex quantum systems using digital quantum computers. Many-particle bosonic systems and intricate optical experimental setups pose significant challenges for classical simulation methods. Utilizing a recently developed formalism that maps bosonic operators to Pauli operators via the Gray code, we digitally simulate interferometric variants of Afshar's experiment on IBM's quantum computers. We investigate the analogous experiments of Unruh and Pessoa Júnior, exploring discussions on the apparent violation of Bohr's complementarity principle when considering the entire experimental setup. Furthermore, we analyze these experiments within the framework of an updated quantum complementarity principle, which applies to specific quantum state preparations and remains consistent with the foundational principles of quantum mechanics. Our quantum computer demonstration results are in good agreement with the theoretical predictions and underscore the potential of quantum computers as effective simulators for bosonic systems.
Quantum computing with qudits, an extension of qubits to multiple levels, is a research field less mature than qubit-based quantum computing. However, qudits can offer some advantages over qubits, by representing information with fewer separated components. In this article, we present QuForge, a Python-based library designed to simulate quantum circuits with qudits. This library provides the necessary quantum gates for implementing quantum algorithms, tailored to any chosen qudit dimension. Built on top of differentiable frameworks, QuForge supports execution on accelerating devices such as GPUs and TPUs, significantly speeding up simulations. It also supports sparse operations, leading to a reduction in memory consumption compared to other libraries. Additionally, by constructing quantum circuits as differentiable graphs, QuForge facilitates the implementation of quantum machine learning algorithms, enhancing the capabilities and flexibility of quantum computing research.
Quantum entanglement is a foundational resource in quantum information science, underpinning applications across physics. However, detecting and quantifying entanglement remains a significant challenge. Here, we introduce a variational quantum algorithm inspired by Uhlmann's theorem to quantify the Bures entanglement of general quantum states, a method that naturally extends to other quantum resources, including genuine multipartite entanglement, quantum discord, quantum coherence, and total correlations, while also enabling reconstruction of the closest free states. The algorithm requires a polynomial number of ancillary qubits and circuit depth relative to the system size, dimensionality, and free state cardinality, making it scalable for practical implementations. Thus, it provides a versatile and efficient framework for quantifying quantum resources, demonstrated through several applications.
In this work, we present Curled-Dreamer, a novel reinforcement learning algorithm that integrates contrastive learning into the DreamerV3 framework to enhance performance in visual reinforcement learning tasks. By incorporating the contrastive loss from the CURL algorithm and a reconstruction loss from autoencoder, Curled-Dreamer achieves significant improvements in various DeepMind Control Suite tasks. Our extensive experiments demonstrate that Curled-Dreamer consistently outperforms state-of-the-art algorithms, achieving higher mean and median scores across a diverse set of tasks. The results indicate that the proposed approach not only accelerates learning but also enhances the robustness of the learned policies. This work highlights the potential of combining different learning paradigms to achieve superior performance in reinforcement learning applications.
We introduce a novel framework for simulating spin models using differentiable programming, an approach that leverages the advancements in machine learning and computational efficiency. We focus on three distinct spin systems: the Ising model, the Potts model, and the Cellular Potts model, demonstrating the practicality and scalability of our framework in modeling these complex systems. Additionally, this framework allows for the optimization of spin models, which can adjust the parameters of a system by a defined objective function. In order to simulate these models, we adapt the Metropolis-Hastings algorithm to a differentiable programming paradigm, employing batched tensors for simulating spin lattices. This adaptation not only facilitates the integration with existing deep learning tools but also significantly enhances computational speed through parallel processing capabilities, as it can be implemented on different hardware architectures, including GPUs and TPUs.
Identifying phase transition points is a fundamental challenge in condensed matter physics, particularly for transitions driven by quantum interference effects, such as Anderson and many-body localization. Recent studies have demonstrated that quantum coherence provides an effective means of detecting localization transitions, offering a practical alternative to full quantum state tomography and related approaches. Building on this idea, we investigate localization transitions through complementarity relations that connect local predictability, local coherence, and entanglement in bipartite pure states. Our results show that predictability serves as a robust and efficient marker for localization transitions. Crucially, its experimental determination requires exponentially fewer measurements than coherence or entanglement, making it a powerful tool for probing quantum phase transitions.
This paper develops a geometrodynamic extension of Bohmian mechanics to describe quantum tunneling through a potential barrier, treating particle trajectories as geodesics in an Alcubierre-type spacetime. The model provides analytical expressions for the quantum potential, particle dynamics, and tunneling time, explicitly linked to the underlying spacetime geometry. For narrow barriers, the tunneling time depends on the barrier width, while for sufficiently wide barriers, it saturates to a constant value-recovering the Hartman effect. This behavior arises from a geometric self-regulation mechanism, where the quantum potential dynamically adjusts the spacetime distortion to maintain a fixed tunneling time, consistent with relativistic causality despite effective superluminal propagation. The results establish a direct connection between quantum tunneling and spacetime geometry, offering a unified framework to interpret the Hartman effect. This approach naturally incorporates relativistic constraints while suggesting that similar geometric mechanisms may underlie other quantum phenomena, such as topological phases in condensed matter systems.
In this paper, we introduce a data-compilation ensemble, primarily intended to serve as a resource for researchers in the field of dereverberation, particularly for data-driven approaches. It comprises speech and song samples, together with acoustic guitar sounds, with original annotations pertinent to emotion recognition and Music Information Retrieval (MIR). Moreover, it includes a selection of impulse response (IR) samples with varying Reverberation Time (RT) values, providing a wide range of conditions for evaluation. This data-compilation can be used together with provided Python scripts, for generating auralized data ensembles in different sizes: tiny, small, medium and large. Additionally, the provided metadata annotations also allow for further analysis and investigation of the performance of dereverberation algorithms under different conditions. All data is licensed under Creative Commons Attribution 4.0 International License.
The rapid development of quantum computers promises transformative impacts across diverse fields of science and technology. Quantum neural networks (QNNs), as a forefront application, hold substantial potential. Despite the multitude of proposed models in the literature, persistent challenges, notably the vanishing gradient (VG) and cost function concentration (CFC) problems, impede their widespread success. In this study, we introduce a novel approach to quantum neural network construction, specifically addressing the issues of VG and CFC. Our methodology employs ensemble learning, advocating for the simultaneous deployment of multiple quantum circuits with a depth equal to 11, a departure from the conventional use of a single quantum circuit with depth LL. We assess the efficacy of our proposed model through a comparative analysis with a conventionally constructed QNN. The evaluation unfolds in the context of a classification problem, yielding valuable insights into the potential advantages of our innovative approach.
3
Accurate calibration of finite element (FE) models is essential across various biomechanical applications, including human intervertebral discs (IVDs), to ensure their reliability and use in diagnosing and planning treatments. However, traditional calibration methods are computationally intensive, requiring iterative, derivative-free optimization algorithms that often take days to converge. This study addresses these challenges by introducing a novel, efficient, and effective calibration method demonstrated on a human L4-L5 IVD FE model as a case study using a neural network (NN) surrogate. The NN surrogate predicts simulation outcomes with high accuracy, outperforming other machine learning models, and significantly reduces the computational cost associated with traditional FE simulations. Next, a Projected Gradient Descent (PGD) approach guided by gradients of the NN surrogate is proposed to efficiently calibrate FE models. Our method explicitly enforces feasibility with a projection step, thus maintaining material bounds throughout the optimization process. The proposed method is evaluated against SOTA Genetic Algorithm and inverse model baselines on synthetic and in vitro experimental datasets. Our approach demonstrates superior performance on synthetic data, achieving an MAE of 0.06 compared to the baselines' MAE of 0.18 and 0.54, respectively. On experimental specimens, our method outperforms the baseline in 5 out of 6 cases. While our approach requires initial dataset generation and surrogate training, these steps are performed only once, and the actual calibration takes under three seconds. In contrast, traditional calibration time scales linearly with the number of specimens, taking up to 8 days in the worst-case. Such efficiency paves the way for applying more complex FE models, potentially extending beyond IVDs, and enabling accurate patient-specific simulations.
Bohr's complementarity principle has long been a fundamental concept in quantum mechanics, positing that, within a given experimental setup, a quantum system (or quanton) can exhibit either its wave-like character, denoted as WW, or its particle-like character, denoted as PP, but not both simultaneously. Modern interpretations of Bohr's complementarity principle acknowledge the coexistence of these aspects in the same experiment while introducing the constraint W+PαW + P \le \alpha. Notably, estimations of WW or PP frequently rely on indirect retrodiction methods, a practice that has led to the claim of the violation of Bohr's complementarity principle. By taking a different route, recent advancements demonstrate that quantum complementarity relations can be rigorously derived from the axioms of quantum mechanics. To reconcile these observations and eliminate potential paradoxes or violations, we propose an updated formulation for the quantum complementarity principle, which is stated as follows: \textit{For a given quantum state preparation ρt\rho_t at a specific instant of time tt, the wave and particle behaviors of a quanton are constrained by a complementarity relation $\mathfrak{W}(\rho_t) + \mathfrak{P}(\rho_t) \le \alpha(d)$, which is derived directly from the axioms of quantum mechanics.}
Wishart random matrices with a sparse or diluted structure are ubiquitous in the processing of large datasets, with applications in physics, biology and economy. In this work we develop a theory for the eigenvalue fluctuations of diluted Wishart random matrices, based on the replica approach of disordered systems. We derive an analytical expression for the cumulant generating function of the number of eigenvalues IN(x)\mathcal{I}_N(x) smaller than xR+x\in\mathbb{R}^{+}, from which all cumulants of IN(x)\mathcal{I}_N(x) and the rate function Ψx(k)\Psi_{x}(k) controlling its large deviation probability Prob[IN(x)=kN]eNΨx(k)\text{Prob}[\mathcal{I}_N(x)=kN] \asymp e^{-N\Psi_{x}(k)} follow. Explicit results for the mean value and the variance of IN(x)\mathcal{I}_N(x), its rate function, and its third cumulant are discussed and thoroughly compared to numerical diagonalization, showing a very good agreement. The present work establishes the theoretical framework put forward in a recent letter [Phys. Rev. Lett. 117, 104101] as an exact and compelling approach to deal with eigenvalue fluctuations of sparse random matrices.
This work presents a study on parallel and distributional deep reinforcement learning applied to the mapless navigation of UAVs. For this, we developed an approach based on the Soft Actor-Critic method, producing a distributed and distributional variant named PDSAC, and compared it with a second one based on the traditional SAC algorithm. In addition, we also embodied a prioritized memory system into them. The UAV used in the study is based on the Hydrone vehicle, a hybrid quadrotor operating solely in the air. The inputs for the system are 23 range findings from a Lidar sensor and the distance and angles towards a desired goal, while the outputs consist of the linear, angular, and, altitude velocities. The methods were trained in environments of varying complexity, from obstacle-free environments to environments with multiple obstacles in three dimensions. The results obtained, demonstrate a concise improvement in the navigation capabilities by the proposed approach when compared to the agent based on the SAC for the same amount of training steps. In summary, this work presented a study on deep reinforcement learning applied to mapless navigation of drones in three dimensions, with promising results and potential applications in various contexts related to robotics and autonomous air navigation with distributed and distributional variants.
There are no more papers matching your filters at the moment.