alphaXiv

History

Papers Benchmarks

ao Paulo

151

10 Feb 2025

ai-for-cybersecurity computer-science artificial-intelligence

AI Alignment at Your Discretion

Harvard University Ghent University

University of Oxford University of S ao Paulo ":

Claudio Mayrink Verdun

Researchers from Ghent and Harvard Universities introduce a framework for "alignment discretion," a concept adapted from legal philosophy, to analyze how human and algorithmic annotators interpret and apply principles in AI alignment. Their empirical analysis, using novel metrics on safety datasets, reveals that human annotators exhibit substantial arbitrariness, and current large language models often fail to replicate human principle prioritization.

09 Sep 2024

bayesian-deep-learning causal-inference computer-science

K-Fold Causal BART for CATE Estimation

ao Paulo Institute of Mathematics and Computer Sciences at University of S ":

This research aims to propose and evaluate a novel model named K-Fold Causal Bayesian Additive Regression Trees (K-Fold Causal BART) for improved estimation of Average Treatment Effects (ATE) and Conditional Average Treatment Effects (CATE). The study employs synthetic and semi-synthetic datasets, including the widely recognized Infant Health and Development Program (IHDP) benchmark dataset, to validate the model's performance. Despite promising results in synthetic scenarios, the IHDP dataset reveals that the proposed model is not state-of-the-art for ATE and CATE estimation. Nonetheless, the research provides several novel insights: 1. The ps-BART model is likely the preferred choice for CATE and ATE estimation due to better generalization compared to the other benchmark models - including the Bayesian Causal Forest (BCF) model, which is considered by many the current best model for CATE estimation, 2. The BCF model's performance deteriorates significantly with increasing treatment effect heterogeneity, while the ps-BART model remains robust, 3. Models tend to be overconfident in CATE uncertainty quantification when treatment effect heterogeneity is low, 4. A second K-Fold method is unnecessary for avoiding overfitting in CATE estimation, as it adds computational costs without improving performance, 5. Detailed analysis reveals the importance of understanding dataset characteristics and using nuanced evaluation methods, 6. The conclusion of Curth et al. (2021) that indirect strategies for CATE estimation are superior for the IHDP dataset is contradicted by the results of this research. These findings challenge existing assumptions and suggest directions for future research to enhance causal inference methodologies.

09 Dec 2024

high-energy-astrophysical-phenomena physics

Model-independent constraints on Lorentz Invariance Violation with update observations of Gamma-Ray Bursts

Beijing Normal University Yangzhou University Chongqing University of Posts and Telecommunications China West Normal University ao Paulo Universidade de S ":

Searching the possible Lorentz Invariance Violation (LIV) from astrophysical sources such as gamma-ray bursts (GRBs) is essential for finding evidences of new theories of quantum gravity. However, the effect of the underlying cosmological model is still understudied in the previous analysis. We take a novel approach using artificial neural networks to reconstruct the expansion history of the universe, thereby eliminating the influence of potential cosmological models to constrain LIV. 74 time delays from GRBs are considered to obtain stringent results on LIV, including 37 time delays measurements from GRB 160625B across various energy bands at redshift

z = 1.41

, and 37 additional GRBs with time delays spanning redshifts

0.117\leq z \leq1.99

. Our analysis yields stringent constraints on both linear and quadratic LIV, with

E_{QG,1} \geq 2.63 \times 10^{15}

GeV

and

E_{QG,2} \geq 1.19 \times 10^{10}

GeV

that are four and nine orders of magnitude beneath the Planck energy scale, and shows the positive intrinsic time delay in GRBs. Our results demonstrate that such combination would significantly improve the precision and robustness of final results. Taking this into account may be an important contribution in the case of possible LIV detection in the future.

26 Mar 2025

mesoscale-and-nanoscale-physics soft-condensed-matter statistical-mechanics

A brief introduction to fluctuation theorems: from theory to experiments

Universidade Estadual de Campinas ao Paulo Universidade de S nsk University of Gda e de Lyon Universit´e Claude Bernard (Lyon 1)Universit ",":

Thermodynamics is a fundamental branch of physics, and over the years, it has evolved to include mesoscopic and out-of-equilibrium systems driven by theoretical and experimental advances at the micro- and nanoscale. This development has led to \textit{stochastic thermodynamics}, a framework that connects microscopic fluctuations with macroscopic laws. Despite their significance, fundamental ideas, such as fluctuation theorems, are frequently not covered in current curricula, leaving them largely unknown across many disciplines. Here, we present the core results of stochastic thermodynamics, particularly the Jarzynski equality and the Crooks theorem, using an integrated approach combining theoretical foundations with experimental verification using optical tweezers. This approach helps to clarify the fundamentals, linking theoretical ideas to real elements in the lab, showing the simplicity of the apparatus, and presenting detailed procedures for calibration and calculation of relevant quantities to enable the implementation of these experiments in new research and teaching laboratories. The goal is to enrich thermodynamics education and to stimulate exploration in this evolving field.

28 May 2025

physics quantum-physics

Quantum processes as thermodynamic resources: the role of non-Markovianity

University of Nottingham ao Paulo Universidade de S ":

Quantum thermodynamics studies how quantum systems and operations may be exploited as sources of work to perform useful thermodynamic tasks. In real-world conditions, the evolution of open quantum systems typically displays memory effects, resulting in a non-Markovian dynamics. The associated information backflow has been observed to provide advantage in certain thermodynamic tasks. However, a general operational connection between non-Markovianity and thermodynamics in the quantum regime has remained elusive. Here, we analyze the role of non-Markovianity in the central task of extracting work via thermal operations from general multitime quantum processes, as described by process tensors. By defining a hierarchy of four classes of extraction protocols, expressed as quantum combs, we reveal three different physical mechanisms (work investment, multitime correlations, and system-environment correlations) through which non-Markovianity increases the work distillable from the process. The advantages arising from these mechanisms are linked precisely to a quantifier of the non-Markovianity of the process. These results show in very general terms how non-Markovianity of any given quantum process is a fundamental resource that unlocks an enhanced performance in thermodynamics.

21 Dec 2022

physics geophysics

Full scale, microscopically resolved tomographies of sandstone and carbonate rocks augmented by experimental porosity and permeability values

IBM Research University of S ao Paulo os de Geologia Ltda Solintec Consultoria e Servic ":

We report a dataset containing full-scale, 3D images of rock plugs augmented by petrophysical lab characterization data for application in digital rock and capillary network analysis. Specifically, we have acquired microscopically resolved tomography datasets of 18 cylindrical sandstone and carbonate rock samples having lengths of 25.4 mm and diameters of 9.5 mm, respectively. Based on the micro-tomography data, we have computed porosity-values for each imaged rock sample. For validating the computed porosity values with a complementary lab method, we have measured porosity for each rock sample by using standard petrophysical characterization techniques. Overall, the tomography-based porosity values agree with the measurement results obtained from the lab, with values ranging from 8% to 30%. In addition, we provide for each rock sample the experimental permeabilities, with values ranging from 0.4 mD to above 5D. This dataset will be essential for establishing, benchmarking, and referencing the relation between porosity and permeability of reservoir rock at pore scale.

02 Jul 2025

physics quantum-physics quantum-machine-learning

Anomalous discharging of quantum batteries: the ergotropic Mpemba effect

University of Rochester Trinity College Dublin ao Paulo Universidade de S

Anomalous thermal relaxation is ubiquitous in nonequilibrium statistical mechanics. An emblematic example of this is the Mpemba effect, where an initially ``hot'' system cools faster than an initially ``cooler'' one. This effect has recently been studied in a variety of different classical and quantum settings. In this Letter, we find a novel signature of the Mpemba effect in the context of quantum batteries. We identify situations where batteries in higher charge states can discharge faster than less charged states. Specifically, we consider a quantum battery encoded in a single bosonic mode that is charged using unitary Gaussian operations. We show that the ergotropy, used here as a dynamical indicator of the energy stored in the battery, can be recast as a phase space relative entropy between the system's state and the unitarily connected passive state, at each time. Our formalism allows us to compute the ergotropy analytically under dissipative dynamics and allows us to understand the conditions which give rise to a Mpemba effect. We also find situations where two batteries charged to the same value using different operations can discharge at different rates.

18 Dec 2024

ai-for-health computer-science conversational-ai

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

Harvard University Stevens Institute of Technology University of the Philippines University of Michigan - Ann Arbor ao Paulo Universidade Federal de S National Yang Ming-Chiao Tung University ":

Current ophthalmology clinical workflows are plagued by over-referrals, long waits, and complex and heterogeneous medical records. Large language models (LLMs) present a promising solution to automate various procedures such as triaging, preliminary tests like visual acuity assessment, and report summaries. However, LLMs have demonstrated significantly varied performance across different languages in natural language question-answering tasks, potentially exacerbating healthcare disparities in Low and Middle-Income Countries (LMICs). This study introduces the first multilingual ophthalmological question-answering benchmark with manually curated questions parallel across languages, allowing for direct cross-lingual comparisons. Our evaluation of 6 popular LLMs across 7 different languages reveals substantial bias across different languages, highlighting risks for clinical deployment of LLMs in LMICs. Existing debiasing methods such as Translation Chain-of-Thought or Retrieval-augmented generation (RAG) by themselves fall short of closing this performance gap, often failing to improve performance across all languages and lacking specificity for the medical domain. To address this issue, We propose CLARA (Cross-Lingual Reflective Agentic system), a novel inference time de-biasing method leveraging retrieval augmented generation and self-verification. Our approach not only improves performance across all languages but also significantly reduces the multilingual bias gap, facilitating equitable LLM application across the globe.

02 Jan 2019

agent-based-systems computer-science machine-learning

General Subpopulation Framework and Taming the Conflict Inside Populations

Kyushu University University of S ao Paulo ":

Structured evolutionary algorithms have been investigated for some time. However, they have been under-explored specially in the field of multi-objective optimization. Despite their good results, the use of complex dynamics and structures make their understanding and adoption rate low. Here, we propose the general subpopulation framework that has the capability of integrating optimization algorithms without restrictions as well as aid the design of structured algorithms. The proposed framework is capable of generalizing most of the structured evolutionary algorithms, such as cellular algorithms, island models, spatial predator-prey and restricted mating based algorithms under its formalization. Moreover, we propose two algorithms based on the general subpopulation framework, demonstrating that with the simple addition of a number of single-objective differential evolution algorithms for each objective the results improve greatly, even when the combined algorithms behave poorly when evaluated alone at the tests. Most importantly, the comparison between the subpopulation algorithms and their related panmictic algorithms suggests that the competition between different strategies inside one population can have deleterious consequences for an algorithm and reveal a strong benefit of using the subpopulation framework. The code for SAN, the proposed multi-objective algorithm which has the current best results in the hardest benchmark, is available at the following this https URL

20 Feb 2025

computer-science computational-geometry human-computer-interaction

ZigzagNetVis: Suggesting temporal resolutions for graph visualization using zigzag persistence

University of S ao Paulo Fundac ̧ao Getulio Vargas ":

Temporal graphs are commonly used to represent complex systems and track the evolution of their constituents over time. Visualizing these graphs is crucial as it allows one to quickly identify anomalies, trends, patterns, and other properties that facilitate better decision-making. In this context, selecting an appropriate temporal resolution is essential for constructing and visually analyzing the layout. The choice of resolution is particularly important, especially when dealing with temporally sparse graphs. In such cases, changing the temporal resolution by grouping events (i.e., edges) from consecutive timestamps -- a technique known as timeslicing -- can aid in the analysis and reveal patterns that might not be discernible otherwise. However, selecting an appropriate temporal resolution is a challenging task. In this paper, we propose ZigzagNetVis, a methodology that suggests temporal resolutions potentially relevant for analyzing a given graph, i.e., resolutions that lead to substantial topological changes in the graph structure. ZigzagNetVis achieves this by leveraging zigzag persistent homology, a well-established technique from Topological Data Analysis (TDA). To improve visual graph analysis, ZigzagNetVis incorporates the colored barcode, a novel timeline-based visualization inspired by persistence barcodes commonly used in TDA. We also contribute with a web-based system prototype that implements suggestion methodology and visualization tools. Finally, we demonstrate the usefulness and effectiveness of ZigzagNetVis through a usage scenario, a user study with 27 participants, and a detailed quantitative evaluation.

12 Dec 2022

high-energy-physics-phenomenology physics

Resonant Lepton-Gluon Collisions at the Large Hadron Collider

Universitat de Barcelona Universidade Federal do Rio Grande do Norte ao Paulo Universidade de S Universidade Federal de S ":

We study the lepton-induced resonant production of color-adjoint leptons (leptogluons) at the LHC employing the lepton parton density function of the proton. We demonstrate that this production mechanism can be useful to extend the LHC ability to search for leptogluons beyond purely quark/gluon initiated production processes up to ~ 3.5 TeV leptogluon masses and O(1) TeV compositeness scales. Discerning leptogluons from scalar and vector leptoquarks is also possible in this channel, given a data sample containing the order of 100 signal events. We argue that the resonant channel can be combined with leptogluon pair and associated leptogluon-lepton productions to boost exclusion limits and discovery prospects at the LHC.

02 Jul 2022

computer-science programming-languages

On Structuring Functional Programs with Monoidal Profunctors

ao Paulo Universidade de S Universidad Nacional de Rosario CIFASIS-CONICET ":

We study monoidal profunctors as a tool to reason and structure pure functional programs both from a categorical perspective and as a Haskell implementation. From the categorical point of view we approach them as monoids in a certain monoidal category of profunctors. We study properties of this monoidal category and construct and implement the free monoidal profunctor. We study the relationship of the monoidal construction to optics, and introduce a promising generalization of the implementation which we illustrate by introducing effectful monoidal profunctors.

09 Jun 2025

bayesian-optimization causal-inference computer-science

Forests for Differences: Robust Causal Inference Beyond Parametric DiD

ao Paulo Institute of Mathematics and Computer Sciences at University of S ":

This paper introduces the Difference-in-Differences Bayesian Causal Forest (DiD-BCF), a novel non-parametric model addressing key challenges in DiD estimation, such as staggered adoption and heterogeneous treatment effects. DiD-BCF provides a unified framework for estimating Average (ATE), Group-Average (GATE), and Conditional Average Treatment Effects (CATE). A core innovation, its Parallel Trends Assumption (PTA)-based reparameterization, enhances estimation accuracy and stability in complex panel data settings. Extensive simulations demonstrate DiD-BCF's superior performance over established benchmarks, particularly under non-linearity, selection biases, and effect heterogeneity. Applied to U.S. minimum wage policy, the model uncovers significant conditional treatment effect heterogeneity related to county population, insights obscured by traditional methods. DiD-BCF offers a robust and versatile tool for more nuanced causal inference in modern DiD applications.

08 Jan 2025

computer-science computer-vision-security computer-vision-and-pattern-recognition

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

ao Paulo Universidade de S ":

Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate information from image or a video capture. These systems have gained popularity due to the wide availability of low-cost surveillance cameras and advances in Deep Learning. Typically, video-based ALPR systems rely on multiple frames to detect the vehicle and recognize the license plates. Therefore, we propose a system capable of extracting exactly one frame per vehicle and recognizing its license plate characters from this singular image using an Optical Character Recognition (OCR) model. Early experiments show that this methodology is viable.

05 Aug 2023

computer-science artificial-intelligence machine-learning

dPASP: A Comprehensive Differentiable Probabilistic Answer Set Programming Environment For Neurosymbolic Learning and Reasoning

ao Paulo Universidade de S ":

We present dPASP, a novel declarative probabilistic logic programming framework for differentiable neuro-symbolic reasoning. The framework allows for the specification of discrete probabilistic models with neural predicates, logic constraints and interval-valued probabilistic choices, thus supporting models that combine low-level perception (images, texts, etc), common-sense reasoning, and (vague) statistical knowledge. To support all such features, we discuss the several semantics for probabilistic logic programs that can express nondeterministic, contradictory, incomplete and/or statistical knowledge. We also discuss how gradient-based learning can be performed with neural predicates and probabilistic choices under selected semantics. We then describe an implemented package that supports inference and learning in the language, along with several example programs. The package requires minimal user knowledge of deep learning system's inner workings, while allowing end-to-end training of rather sophisticated models and loss functions.

19 May 2021

computer-science machine-learning neural-and-evolutionary-computing

The World as a Graph: Improving El Niño Forecasts with Graph Neural Networks

University of Illinois at Urbana-Champaign

Technical University of Munich Technical University of Darmstadt Warsaw University of Technology University of S ao Paulo African Institute for Mathematical Sciences Duquesne University ":

Deep learning-based models have recently outperformed state-of-the-art seasonal forecasting models, such as for predicting El Niño-Southern Oscillation (ENSO). However, current deep learning models are based on convolutional neural networks which are difficult to interpret and can fail to model large-scale atmospheric patterns. In comparison, graph neural networks (GNNs) are capable of modeling large-scale spatial dependencies and are more interpretable due to the explicit modeling of information flow through edge connections. We propose the first application of graph neural networks to seasonal forecasting. We design a novel graph connectivity learning module that enables our GNN model to learn large-scale spatial interactions jointly with the actual ENSO forecasting task. Our model, \graphino, outperforms state-of-the-art deep learning-based models for forecasts up to six months ahead. Additionally, we show that our model is more interpretable as it learns sensible connectivity structures that correlate with the ENSO anomaly pattern.

03 Jun 2024

physics quantum-physics

Relations between Markovian and non-Markovian correlations in multitime quantum processes

ao Paulo Universidade de S ":

In the dynamics of open quantum systems, information may propagate in time through either the system or the environment, giving rise to Markovian and non-Markovian temporal correlations, respectively. However, despite their notable coexistence in most physical situations, it is not yet clear how these two quantities may limit the existence of one another. Here, we address this issue by deriving several inequalities relating the temporal correlations of general multi-time quantum processes. The dynamics are described by process tensors and the correlations are quantified by the mutual information between subsystems of their Choi states. First, we prove a set of upper bounds to the non-Markovianity of a process given the degree of Markovianity in each of its steps. This immediately implies a non-trivial maximum value for the non-Markovianity of any process, independently of its Markovianity. Finally, we obtain how the non-Markovianity limits the amount of total temporal correlations that could be present in a given process. These results show that, although any multi-time process must pay a price in total correlations to have a given amount of non-Markovianity, this price vanishes exponentially with the number of steps of the process, while the maximum non-Markovianity grows only linearly. This implies that even a highly non-Markovian process might be arbitrarily close to having maximum total correlations if it has a sufficiently large number of steps.

25 Jun 2020

computer-science computer-vision-and-pattern-recognition domain-adaptation

Self-supervised Learning for Astronomical Image Classification

University of S ao Paulo ":

In Astronomy, a huge amount of image data is generated daily by photometric surveys, which scan the sky to collect data from stars, galaxies and other celestial objects. In this paper, we propose a technique to leverage unlabeled astronomical images to pre-train deep convolutional neural networks, in order to learn a domain-specific feature extractor which improves the results of machine learning techniques in setups with small amounts of labeled data available. We show that our technique produces results which are in many cases better than using ImageNet pre-training.

22 Oct 2024

mesoscale-and-nanoscale-physics statistical-mechanics physics

Dephasing-assisted transport in a tight-binding chain with a linear potential

University of Pittsburgh

University of Bristol Trinity College Dublin ao Paulo Universidade de S ":

An environment interacting with a quantum system can enhance transport through the suppression of quantum effects responsible for localization. In this paper, we study the interplay between bulk dephasing and a linear potential in a boundary-driven tight-binding chain. A linear potential induces Wannier-Stark localization in the absence of noise, while dephasing induces diffusive transport in the absence of a tilt. We derive an approximate expression for the steady-state current as a function of both dephasing and tilt which closely matches the exact solution for a wide range of parameters. From it, we find that the maximum current occurs for a dephasing rate equal to the period of Bloch oscillations in the Wannier-Stark localized system. We also find that the current displays a maximum as a function of the system size, provided that the total potential tilt across the chain remains constant. Our results can be verified in current experimental platforms and represents a step forward in analytical studies of environment-assisted transport.

16 Jul 2019

computer-science machine-learning statistics

Online Local Boosting: improving performance in online decision trees

University of S ao Paulo Londrina State University ":

As more data are produced each day, and faster, data stream mining is growing in importance, making clear the need for algorithms able to fast process these data. Data stream mining algorithms are meant to be solutions to extract knowledge online, specially tailored from continuous data problem. Many of the current algorithms for data stream mining have high processing and memory costs. Often, the higher the predictive performance, the higher these costs. To increase predictive performance without largely increasing memory and time costs, this paper introduces a novel algorithm, named Online Local Boosting (OLBoost), which can be combined into online decision tree algorithms to improve their predictive performance without modifying the structure of the induced decision trees. For such, OLBoost applies a boosting to small separate regions of the instances space. Experimental results presented in this paper show that by using OLBoost the online learning decision tree algorithms can significantly improve their predictive performance. Additionally, it can make smaller trees perform as good or better than larger trees.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

AI Alignment at Your Discretion

K-Fold Causal BART for CATE Estimation

Model-independent constraints on Lorentz Invariance Violation with update observations of Gamma-Ray Bursts

A brief introduction to fluctuation theorems: from theory to experiments

Quantum processes as thermodynamic resources: the role of non-Markovianity

Full scale, microscopically resolved tomographies of sandstone and carbonate rocks augmented by experimental porosity and permeability values

Anomalous discharging of quantum batteries: the ergotropic Mpemba effect

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

General Subpopulation Framework and Taming the Conflict Inside Populations

ZigzagNetVis: Suggesting temporal resolutions for graph visualization using zigzag persistence

Resonant Lepton-Gluon Collisions at the Large Hadron Collider

On Structuring Functional Programs with Monoidal Profunctors

Forests for Differences: Robust Causal Inference Beyond Parametric DiD

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

dPASP: A Comprehensive Differentiable Probabilistic Answer Set Programming Environment For Neurosymbolic Learning and Reasoning

The World as a Graph: Improving El Niño Forecasts with Graph Neural Networks

Relations between Markovian and non-Markovian correlations in multitime quantum processes

Self-supervised Learning for Astronomical Image Classification

Dephasing-assisted transport in a tight-binding chain with a linear potential

Online Local Boosting: improving performance in online decision trees

Events

AI for Law

Personalize Your Feed