alphaXiv

History

Papers Benchmarks

Berlin School of Mind and BrainHumboldt-Universität zu Berlin

1,556

30 Jan 2024

mathematics probability

Stochastic Volterra Equations for the Local Times of Spectrally Positive Stable Processes

Humboldt-Universität zu Berlin

This paper is concerned with the evolution dynamics of local times of a spectrally positive stable process in the spatial direction. The main results state that conditioned on the finiteness of the first time at which the local time at zero exceeds a given value, the local times at positive half line are equal in distribution to the unique solution of a stochastic Volterra equation driven by a Poisson random measure whose intensity coincides with the Lévy measure. This helps us to provide not only a simple proof for the Hölder regularity, but also a uniform upper bound for all moments of the Hölder coefficient as well as a maximal inequality for the local times. Moreover, based on this stochastic Volterra equation, we extend the method of duality to establish an exponential-affine representation of the Laplace functional in terms of the unique solution of a nonlinear Volterra integral equation associated with the Laplace exponent of the stable process.

142

06 Nov 2025

computer-science databases

SHARP: Shared State Reduction for Efficient Matching of Sequential Patterns

Aalto University Humboldt-Universität zu Berlin

The detection of sequential patterns in data is a basic functionality of modern data processing systems for complex event processing (CEP), OLAP, and retrieval-augmented generation (RAG). In practice, pattern matching is challenging, since common applications rely on a large set of patterns that shall be evaluated with tight latency bounds. At the same time, matching needs to maintain state, i.e., intermediate results, that grows exponentially in the input size. Hence, systems turn to best-effort processing, striving for maximal recall under a latency bound. Existing techniques, however, consider each pattern in isolation, neglecting the optimization potential induced by state sharing in pattern matching. In this paper, we present SHARP, a library that employs state reduction to achieve efficient best-effort pattern matching. To this end, SHARP incorporates state sharing between patterns through a new abstraction, coined pattern-sharing degree (PSD). At runtime, this abstraction facilitates the categorization and indexing of partial pattern matches. Based thereon, once a latency bound is exceeded, SHARP realizes best-effort processing by selecting a subset of partial matches for further processing in constant time. In experiments with real-world data, SHARP achieves a recall of 97%, 96% and 73% for pattern matching in CEP, OLAP, and RAG applications, under a bound of 50% of the average processing latency.

773

21 May 2025

agents chain-of-thought computer-science

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

We directly compare the persuasion capabilities of a frontier large language model (LLM; Claude Sonnet 3.5) against incentivized human persuaders in an interactive, real-time conversational quiz setting. In this preregistered, large-scale incentivized experiment, participants (quiz takers) completed an online quiz where persuaders (either humans or LLMs) attempted to persuade quiz takers toward correct or incorrect answers. We find that LLM persuaders achieved significantly higher compliance with their directional persuasion attempts than incentivized human persuaders, demonstrating superior persuasive capabilities in both truthful (toward correct answers) and deceptive (toward incorrect answers) contexts. We also find that LLM persuaders significantly increased quiz takers' accuracy, leading to higher earnings, when steering quiz takers toward correct answers, and significantly decreased their accuracy, leading to lower earnings, when steering them toward incorrect answers. Overall, our findings suggest that AI's persuasion capabilities already exceed those of humans that have real-money bonuses tied to performance. Our findings of increasingly capable AI persuaders thus underscore the urgency of emerging alignment and governance frameworks.

18 Feb 2025

computer-science computer-vision-security computer-vision-and-pattern-recognition

WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields

Humboldt-Universität zu Berlin

Karlsruhe Institute of Technology GFZ, German Research Centre for Geosciences

Weeds are one of the major reasons for crop yield loss but current weeding practices fail to manage weeds in an efficient and targeted manner. Effective weed management is especially important for crops with high worldwide production such as maize, to maximize crop yield for meeting increasing global demands. Advances in near-sensing and computer vision enable the development of new tools for weed management. Specifically, state-of-the-art segmentation models, coupled with novel sensing technologies, can facilitate timely and accurate weeding and monitoring systems. However, learning-based approaches require annotated data and show a lack of generalization to aerial imaging for different crops. We present a novel dataset for semantic and instance segmentation of crops and weeds in agricultural maize fields. The multispectral UAV-based dataset contains images with RGB, red-edge, and near-infrared bands, a large number of plant instances, dense annotations for maize and four weed classes, and is multitemporal. We provide extensive baseline results for both tasks, including probabilistic methods to quantify prediction uncertainty, improve model calibration, and demonstrate the approach's applicability to out-of-distribution data. The results show the effectiveness of the two additional bands compared to RGB only, and better performance in our target domain than models trained on existing datasets. We hope our dataset advances research on methods and operational systems for fine-grained weed identification, enhancing the robustness and applicability of UAV-based weed management. The dataset and code are available at this https URL

18 Sep 2025

computer-science computation-and-language human-ai-interaction

Large Language Model probabilities cannot distinguish between possible and impossible language

Humboldt-Universität zu Berlin Universitat Autònoma de Barcelona Institució Catalana de Recerca i Estudis Avançats (ICREA)Ecovadis

A controversial test for Large Language Models concerns the ability to discern possible from impossible language. While some evidence attests to the models' sensitivity to what crosses the limits of grammatically impossible language, this evidence has been contested on the grounds of the soundness of the testing material. We use model-internal representations to tap directly into the way Large Language Models represent the 'grammatical-ungrammatical' distinction. In a novel benchmark, we elicit probabilities from 4 models and compute minimal-pair surprisal differences, juxtaposing probabilities assigned to grammatical sentences to probabilities assigned to (i) lower frequency grammatical sentences, (ii) ungrammatical sentences, (iii) semantically odd sentences, and (iv) pragmatically odd sentences. The prediction is that if string-probabilities can function as proxies for the limits of grammar, the ungrammatical condition will stand out among the conditions that involve linguistic violations, showing a spike in the surprisal rates. Our results do not reveal a unique surprisal signature for ungrammatical prompts, as the semantically and pragmatically odd conditions consistently show higher surprisal. We thus demonstrate that probabilities do not constitute reliable proxies for model-internal representations of syntactic knowledge. Consequently, claims about models being able to distinguish possible from impossible language need verification through a different methodology.

01 Jul 2025

computer-science computation-and-language inference-optimization

Question Decomposition for Retrieval-Augmented Generation

Humboldt-Universität zu Berlin

Researchers at Humboldt-Universität zu Berlin introduced an enhancement for Retrieval-Augmented Generation (RAG) that decomposes complex queries into sub-queries using a zero-shot LLM and then refines retrieved passages with a cross-encoder reranker. This combined strategy improved retrieval recall and answer accuracy, achieving an 16.5% higher Hits@10 on MultiHop-RAG and an F1 score of 35.0 on HotpotQA for answer generation.

03 Sep 2025

high-energy-physics-theory physics

Integrable systems: From the ice rule to supersymmetric fishnet Feynman diagrams

Humboldt-Universität zu Berlin

This thesis examines the correspondence between models of statistical physics and Feynman graphs of quantum field theories (QFTs) by a common property: integrability. We review integrable structures for periodic boundary conditions on both sides, while focusing on the eight- and six-vertex model and the bi-scalar fishnet theory. The latter is a double-scaled

\gamma

-deformation of

\mathcal{N} = 4

super Yang-Mills theory. Interesting applications of integrability existing in the literature that we reconsider are the computation of the free energy in the thermodynamic limit and its QFT counterpart, the critical coupling. In addition, we provide a detailed overview of the calculation of exact anomalous dimensions and operator product expansion (OPE) coefficients in the conformal bi-scalar fishnet theory. The original contributions of this work comprise the results of the critical coupling for models with fermions, the brick wall theory, and the fermionic fishnet theory. Additionally, we extend the study of integrable Feynman graphs to supersymmetric diagrams in superspace. By establishing an efficient graphical formalism, we obtain the critical coupling of double-scaled

\beta

-deformations of

\mathcal{N} = 4

super Yang-Mills theory and Aharony-Bergman-Jafferis-Maldacena theory, the super brick wall and superfishnet theory, respectively. Moreover, we apply superspace methods to the superfishnet theory and find results for anomalous dimensions and an OPE coefficient, which are all-loop exact in the coupling. In addition, we study boundary integrability in the six-vertex model and for Feynman diagrams. We present new box-shaped boundary conditions for the six-vertex model and conjecture a closed form for its partition function at any lattice size. On the QFT side, we find integrable boundary scattering matrices in the form of generalized Feynman diagrams by graphical methods.

114

28 May 2025

computer-science artificial-intelligence computation-and-language

Pre-Training Curriculum for Multi-Token Prediction in Language Models

Humboldt-Universität zu Berlin

This paper from Humboldt-Universität zu Berlin introduces curriculum learning strategies to enable Multi-Token Prediction (MTP) for smaller language models, demonstrating that a Forward curriculum improves both performance and inference speed (1.2-1.7x) for models with 1.3B and 3B parameters. The work also found that byte-level tokenization consistently yields better results for MTP in these smaller architectures.

17 Jan 2025

high-energy-physics-lattice high-energy-physics-phenomenology physics

FLAG Review 2024

University of Washington Indiana University University of Edinburgh

INFN

Peking University

CERN

University of Southampton

Brookhaven National Laboratory Los Alamos National Laboratory

University of Arizona Deutsches Elektronen-Synchrotron DESY Fermi National Accelerator Laboratory RIKEN Center for Computational Science University of North Carolina Trinity College Dublin Humboldt-Universität zu Berlin Universidad de Zaragoza High Energy Accelerator Research Organization (KEK)University of Connecticut Universität Regensburg University of Southern Denmark SOKENDAI (The Graduate University for Advanced Studies)Università di Parma San Francisco State University Colorado College University of Mainz RIKEN BNL Research Center The College of William & Mary GSI Helmholtz Center for Heavy Ion Research Albert Einstein Center for Fundamental Physics, Institut f ̈ur Theoretische Physik, Universit ̈at Bern IFIC (CSIC-UVEG)Universidad Aut ´ onoma de Madrid Universit di Roma Tor Vergata

We review lattice results related to pion, kaon,

D

-meson,

B

-meson, and nucleon physics with the aim of making them easily accessible to the nuclear and particle physics communities. More specifically, we report on the determination of the light-quark masses, the form factor

f_+(0)

arising in the semileptonic

K \to \pi

transition at zero momentum transfer, as well as the decay-constant ratio

f_K/f_\pi

and its consequences for the CKM matrix elements

V_{us}

and

V_{ud}

. We review the determination of the

B_K

parameter of neutral kaon mixing as well as the additional four

B

parameters that arise in theories of physics beyond the Standard Model. For the heavy-quark sector, we provide results for

m_c

and

m_b

as well as those for the decay constants, form factors, and mixing parameters of charmed and bottom mesons and baryons. These are the heavy-quark quantities most relevant for the determination of CKM matrix elements and the global CKM unitarity-triangle fit. We review the status of lattice determinations of the strong coupling constant

\alpha_s

. We review the determinations of nucleon charges from the matrix elements of both isovector and flavour-diagonal axial, scalar and tensor local quark bilinears, and momentum fraction, helicity moment and the transversity moment from one-link quark bilinears. We also review determinations of scale-setting quantities. Finally, in this review we have added a new section on the general definition of the low-energy limit of the Standard Model.

24 Nov 2025

computer-science machine-learning optimization-methods

Collapsing Taylor Mode Automatic Differentiation

ETH Zurich Vector Institute Humboldt-Universität zu Berlin Zuse Institute Berlin

Computing partial differential equation (PDE) operators via nested backpropagation is expensive, yet popular, and severely restricts their utility for scientific machine learning. Recent advances, like the forward Laplacian and randomizing Taylor mode automatic differentiation (AD), propose forward schemes to address this. We introduce an optimization technique for Taylor mode that 'collapses' derivatives by rewriting the computational graph, and demonstrate how to apply it to general linear PDE operators, and randomized Taylor mode. The modifications simply require propagating a sum up the computational graph, which could -- or should -- be done by a machine learning compiler, without exposing complexity to users. We implement our collapsing procedure and evaluate it on popular PDE operators, confirming it accelerates Taylor mode and outperforms nested backpropagation.

01 Oct 2025

general-relativity-and-quantum-cosmology high-energy-physics-theory physics

Unitarity and the On-Shell Action of Worldline Quantum Field Theory

Queen Mary University of London Humboldt-Universität zu Berlin Humboldt-Universitat zu Berlin

We develop the on-shell action formalism within Worldline Quantum Field Theory (WQFT) to describe scattering of spinning compact bodies in General Relativity in the post-Minkowskian (PM) expansion. The real on-shell action is constructed from vacuum diagrams with causal (retarded) propagators from which scattering observables such as momentum impulse and spin kick follow via Poisson brackets of the initial scattering data. Furthermore, we explore the implications of unitarity at the level of the worldline and show how generalised unitarity techniques can be adapted to WQFT to efficiently compute multi-loop contributions. Our work establishes a concrete link between WQFT and amplitude-based methods, elucidating how unitarity cuts ensure equivalence between the on-shell action derived from either approach. Extending the state-of-the-art, we complete the full on-shell action -- including dissipative terms -- at (formal) 3PM order and up to quartic spin interactions on both massive bodies.

05 Aug 2025

ai-for-health computer-science computer-vision-and-pattern-recognition

PhenoBench: A Comprehensive Benchmark for Cell Phenotyping

University of Bern University of Tübingen Humboldt-Universität zu Berlin Universität Potsdam Helmholtz Imaging Charité-Universitätsmedizin University Hospital and Comprehensive Cancer Center Tübingen Cluster of Excellence iFIT (EXC 2180) “Image-Guided and Functionally Instructed Tumor Therapies”Max-Delbrück Center

Digital pathology has seen the advent of a wealth of foundational models (FM), yet to date their performance on cell phenotyping has not been benchmarked in a unified manner. We therefore propose PhenoBench: A comprehensive benchmark for cell phenotyping on Hematoxylin and Eosin (H&E) stained histopathology images. We provide both PhenoCell, a new H&E dataset featuring 14 granular cell types identified by using multiplexed imaging, and ready-to-use fine-tuning and benchmarking code that allows the systematic evaluation of multiple prominent pathology FMs in terms of dense cell phenotype predictions in different generalization scenarios. We perform extensive benchmarking of existing FMs, providing insights into their generalization behavior under technical vs. medical domain shifts. Furthermore, while FMs achieve macro F1 scores > 0.70 on previously established benchmarks such as Lizard and PanNuke, on PhenoCell, we observe scores as low as 0.20. This indicates a much more challenging task not captured by previous benchmarks, establishing PhenoCell as a prime asset for future benchmarking of FMs and supervised models alike. Code and data are available on GitHub.

167

18 Mar 2025

computer-science computer-vision-and-pattern-recognition

Improving Adaptive Density Control for 3D Gaussian Splatting

Humboldt-Universität zu Berlin Fraunhofer HHI

Improvements to the Adaptive Density Control (ADC) mechanism for 3D Gaussian Splatting (3DGS) are presented, featuring a corrected scene extent calculation, an exponentially ascending gradient threshold, and significance-aware pruning. This refined ADC results in enhanced rendering quality and faster training convergence for 3D scene reconstruction and novel view synthesis.

12 Oct 2020

high-energy-physics-experiment high-energy-physics-phenomenology physics

Higgs self-coupling measurements using deep learning in the $b\bar{b}b\bar{b}$ final state

DESY

University of Chicago

University of Oxford Humboldt-Universität zu Berlin

Durham University

Measuring the Higgs trilinear self-coupling

\lambda_{hhh}

is experimentally demanding but fundamental for understanding the shape of the Higgs potential. We present a comprehensive analysis strategy for the HL-LHC using di-Higgs events in the four

b

-quark channel (

hh \to 4b

), extending current methods in several directions. We perform deep learning to suppress the formidable multijet background with dedicated optimisation for BSM

\lambda_{hhh}

scenarios. We compare the

\lambda_{hhh}

constraining power of events using different multiplicities of large radius jets with a two-prong structure that reconstruct boosted

h \to bb

decays. We show that current uncertainties in the SM top Yukawa coupling

y_t

can modify

\lambda_{hhh}

constraints by

\sim 20\%

. For SM

y_t

, we find prospects of $-0.8 < \lambda_{hhh} / \lambda_{hhh}^\text{SM} < 6.6$ at 68% CL under simplified assumptions for 3000~fb

^{-1}

of HL-LHC data. Our results provide a careful assessment of di-Higgs identification and machine learning techniques for all-hadronic measurements of the Higgs self-coupling and sharpens the requirements for future improvement.

23 Apr 2024

adversarial-attacks adversarial-robustness computer-science

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures

Monash University Griffith University Humboldt-Universität zu Berlin The University of Queensland École Polytechnique Fédérale de Lausanne

Recommender systems have become an integral part of online services to help users locate specific information in a sea of data. However, existing studies show that some recommender systems are vulnerable to poisoning attacks, particularly those that involve learning schemes. A poisoning attack is where an adversary injects carefully crafted data into the process of training a model, with the goal of manipulating the system's final recommendations. Based on recent advancements in artificial intelligence, such attacks have gained importance recently. While numerous countermeasures to poisoning attacks have been developed, they have not yet been systematically linked to the properties of the attacks. Consequently, assessing the respective risks and potential success of mitigation strategies is difficult, if not impossible. This survey aims to fill this gap by primarily focusing on poisoning attacks and their countermeasures. This is in contrast to prior surveys that mainly focus on attacks and their detection methods. Through an exhaustive literature review, we provide a novel taxonomy for poisoning attacks, formalise its dimensions, and accordingly organise 30+ attacks described in the literature. Further, we review 40+ countermeasures to detect and/or prevent poisoning attacks, evaluating their effectiveness against specific types of attacks. This comprehensive survey should serve as a point of reference for protecting recommender systems against poisoning attacks. The article concludes with a discussion on open issues in the field and impactful directions for future research. A rich repository of resources associated with poisoning attacks is available at this https URL.

14 May 2025

agentic-frameworks agents computer-science

Extracting Knowledge Graphs from User Stories using LangChain

Humboldt-Universität zu Berlin Brandenburg Technical University Cottbus

This thesis introduces a novel methodology for the automated generation of knowledge graphs from user stories by leveraging the advanced capabilities of Large Language Models. Utilizing the LangChain framework as a basis, the User Story Graph Transformer module was developed to extract nodes and relationships from user stories using an LLM to construct accurate knowledge graphs.This innovative technique was implemented in a script to fully automate the knowledge graph extraction process. Additionally, the evaluation was automated through a dedicated evaluation script, utilizing an annotated dataset for assessment. By enhancing the visualization and understanding of user requirements and domain concepts, this method fosters better alignment between software functionalities and user expectations, ultimately contributing to more effective and user-centric software development processes.

30 Jun 2022

computer-science computation-and-language

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Stanford University

Brown University

HKUST Booz Allen Hamilton Humboldt-Universität zu Berlin TU Wien

University of Virginia Medical University of Vienna EleutherAI Tempus Labs Inc.Immuneering Sherlock Biosciences Max Delbrück Center for Molecular Medicine

Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful in general-domain text, translating these data-centric approaches to biomedical language modeling remains challenging, as labeled biomedical datasets are significantly underrepresented in popular data hubs. To address this challenge, we introduce BigBIO a community library of 126+ biomedical NLP datasets, currently covering 12 task categories and 10+ languages. BigBIO facilitates reproducible meta-dataset curation via programmatic access to datasets and their metadata, and is compatible with current platforms for prompt engineering and end-to-end few/zero shot language model evaluation. We discuss our process for task schema harmonization, data auditing, contribution guidelines, and outline two illustrative use cases: zero-shot evaluation of biomedical prompts and large-scale, multi-task learning. BigBIO is an ongoing community effort and is available at this https URL

06 Feb 2024

instrumentation-and-methods-for-astrophysics physics instrumentation-and-detectors

The IceCube Neutrino Observatory: Instrumentation and Online Systems

The IceCube Neutrino Observatory is a cubic-kilometer-scale high-energy neutrino detector built into the ice at the South Pole. Construction of IceCube, the largest neutrino detector built to date, was completed in 2011 and enabled the discovery of high-energy astrophysical neutrinos. We describe here the design, production, and calibration of the IceCube digital optical module (DOM), the cable systems, computing hardware, and our methodology for drilling and deployment. We also describe the online triggering and data filtering systems that select candidate neutrino and cosmic ray events for analysis. Due to a rigorous pre-deployment protocol, 98.4% of the DOMs in the deep ice are operating and collecting data. IceCube routinely achieves a detector uptime of 99% by emphasizing software stability and monitoring. Detector operations have been stable since construction was completed, and the detector is expected to operate at least until the end of the next decade.

20 Feb 2024

computer-science machine-learning generative-models

Generative Sliced MMD Flows with Riesz Kernels

TU Berlin Humboldt-Universität zu Berlin

Maximum mean discrepancy (MMD) flows suffer from high computational costs in large scale computations. In this paper, we show that MMD flows with Riesz kernels

K(x,y) = - \|x-y\|^r

r \in (0,2)

have exceptional properties which allow their efficient computation. We prove that the MMD of Riesz kernels, which is also known as energy distance, coincides with the MMD of their sliced version. As a consequence, the computation of gradients of MMDs can be performed in the one-dimensional setting. Here, for

r=1

, a simple sorting algorithm can be applied to reduce the complexity from

O(MN+N^2)

O((M+N)\log(M+N))

for two measures with

M

and

N

support points. As another interesting follow-up result, the MMD of compactly supported measures can be estimated from above and below by the Wasserstein-1 distance. For the implementations we approximate the gradient of the sliced MMD by using only a finite number

P

of slices. We show that the resulting error has complexity

O(\sqrt{d/P})

, where

d

is the data dimension. These results enable us to train generative models by approximating MMD gradient flows by neural networks even for image applications. We demonstrate the efficiency of our model by image generation on MNIST, FashionMNIST and CIFAR10.

135

02 Dec 2024

computer-science materials-science machine-learning

From Text to Insight: Large Language Models for Materials Science Data Extraction

Intel Labs Humboldt-Universität zu Berlin Friedrich-Schiller-University Jena Helmholtz Institute for Polymers in Energy Applications Jena INCAR CSIC

The vast majority of materials science knowledge exists in unstructured natural language, yet structured data is crucial for innovative and systematic materials design. Traditionally, the field has relied on manual curation and partial automation for data extraction for specific use cases. The advent of large language models (LLMs) represents a significant shift, potentially enabling efficient extraction of structured, actionable data from unstructured text by non-experts. While applying LLMs to materials science data extraction presents unique challenges, domain knowledge offers opportunities to guide and validate LLM outputs. This review provides a comprehensive overview of LLM-based structured data extraction in materials science, synthesizing current knowledge and outlining future directions. We address the lack of standardized guidelines and present frameworks for leveraging the synergy between LLMs and materials science expertise. This work serves as a foundational resource for researchers aiming to harness LLMs for data-driven materials research. The insights presented here could significantly enhance how researchers across disciplines access and utilize scientific information, potentially accelerating the development of novel materials for critical societal needs.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Stochastic Volterra Equations for the Local Times of Spectrally Positive Stable Processes

SHARP: Shared State Reduction for Efficient Matching of Sequential Patterns

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields

Large Language Model probabilities cannot distinguish between possible and impossible language

Question Decomposition for Retrieval-Augmented Generation

Integrable systems: From the ice rule to supersymmetric fishnet Feynman diagrams

Pre-Training Curriculum for Multi-Token Prediction in Language Models

FLAG Review 2024

Collapsing Taylor Mode Automatic Differentiation

Unitarity and the On-Shell Action of Worldline Quantum Field Theory

PhenoBench: A Comprehensive Benchmark for Cell Phenotyping

Improving Adaptive Density Control for 3D Gaussian Splatting

Higgs self-coupling measurements using deep learning in the $b\bar{b}b\bar{b}$ final state

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures

Extracting Knowledge Graphs from User Stories using LangChain

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

The IceCube Neutrino Observatory: Instrumentation and Online Systems

Generative Sliced MMD Flows with Riesz Kernels

From Text to Insight: Large Language Models for Materials Science Data Extraction

Events

AI for Law

Personalize Your Feed

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Stochastic Volterra Equations for the Local Times of Spectrally Positive Stable Processes

SHARP: Shared State Reduction for Efficient Matching of Sequential Patterns

Large Language Models Are More Persuasive Than Incentivized Human Persuaders

WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields

Large Language Model probabilities cannot distinguish between possible and impossible language

Question Decomposition for Retrieval-Augmented Generation

Integrable systems: From the ice rule to supersymmetric fishnet Feynman diagrams

Pre-Training Curriculum for Multi-Token Prediction in Language Models

FLAG Review 2024

Collapsing Taylor Mode Automatic Differentiation

Unitarity and the On-Shell Action of Worldline Quantum Field Theory

PhenoBench: A Comprehensive Benchmark for Cell Phenotyping

Improving Adaptive Density Control for 3D Gaussian Splatting

Higgs self-coupling measurements using deep learning in the bbˉbbˉb\bar{b}b\bar{b}bbˉbbˉ final state

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures

Extracting Knowledge Graphs from User Stories using LangChain

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

The IceCube Neutrino Observatory: Instrumentation and Online Systems

Generative Sliced MMD Flows with Riesz Kernels

From Text to Insight: Large Language Models for Materials Science Data Extraction

Events

AI for Law

Personalize Your Feed

Higgs self-coupling measurements using deep learning in the $b\bar{b}b\bar{b}$ final state