alphaXiv

History

Papers Benchmarks

symbolic-computation

487

12 May 2025

symbolic-computation computer-science machine-learning

Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks

Northeastern University

University of Oxford NSF AI Institute for Fundamental Interactions, MIT

A novel symbolic regression framework combines multimodal large language models with Kolmogorov-Arnold Networks to infer mathematical expressions from data plots, achieving higher success rates than Mathematica's FindFormula on randomly generated functions while demonstrating effectiveness with both commercial and open-source LLMs.

1,818

02 Apr 2025

symbolic-computation causal-inference computer-science

AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge

Peking University

Peking University researchers develop AI-Newton, a concept-driven framework that autonomously discovers physical laws from experimental data without prior knowledge, successfully rediscovering fundamental laws of Newtonian mechanics while identifying approximately 90 physical concepts and 50 general laws through an iterative knowledge building process.

01 Sep 2025

symbolic-computation computer-science artificial-intelligence

APEX $^2$ : Adaptive and Extreme Summarization for Personalized Knowledge Graphs

University of Illinois at Urbana-Champaign

Symbolic Regression with a Learned Concept Library

University of Cambridge UT Austin

MIT Foundry Technologies

We present a novel method for symbolic regression (SR), the task of searching for compact programmatic hypotheses that best explain a dataset. The problem is commonly solved using genetic algorithms; we show that we can enhance such methods by inducing a library of abstract textual concepts. Our algorithm, called LaSR, uses zero-shot queries to a large language model (LLM) to discover and evolve concepts occurring in known high-performing hypotheses. We discover new hypotheses using a mix of standard evolutionary steps and LLM-guided steps (obtained through zero-shot LLM queries) conditioned on discovered concepts. Once discovered, hypotheses are used in a new round of concept abstraction and evolution. We validate LaSR on the Feynman equations, a popular SR benchmark, as well as a set of synthetic tasks. On these benchmarks, LaSR substantially outperforms a variety of state-of-the-art SR approaches based on deep learning and evolutionary algorithms. Moreover, we show that LaSR can be used to discover a novel and powerful scaling law for LLMs.

2,860

15 Mar 2024

symbolic-computation agent-based-systems computer-science

Cognitive Architectures for Language Agents

Princeton University

Researchers from Princeton University introduce CoALA, a conceptual framework that leverages historical cognitive architectures and production systems to organize and guide the design of language agents. It demonstrates CoALA's utility by systematically categorizing existing agents and identifying directions for developing more capable AI systems.

864

01 Jun 2023

symbolic-computation computer-science symbolic-computation

Some New Non-Commutative Matrix Multiplication Algorithms of Size $(n,m,6)$

For various

2\leq n,m \leq 6

, we propose some new algorithms for multiplying an

n\times m

matrix with an

m \times 6

matrix over a possibly noncommutative coefficient ring.

555

05 May 2023

symbolic-computation instrumentation-and-methods-for-astrophysics computer-science

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Princeton University

Flatiron Institute

PySR is an open-source library for practical symbolic regression, a type of machine learning which aims to discover human-interpretable symbolic models. PySR was developed to democratize and popularize symbolic regression for the sciences, and is built on a high-performance distributed back-end, a flexible search algorithm, and interfaces with several deep learning packages. PySR's internal search algorithm is a multi-population evolutionary algorithm, which consists of a unique evolve-simplify-optimize loop, designed for optimization of unknown scalar constants in newly-discovered empirical expressions. PySR's backend is the extremely optimized Julia library SymbolicRegression.jl, which can be used directly from Julia. It is capable of fusing user-defined operators into SIMD kernels at runtime, performing automatic differentiation, and distributing populations of expressions to thousands of cores across a cluster. In describing this software, we also introduce a new benchmark, "EmpiricalBench," to quantify the applicability of symbolic regression algorithms in science. This benchmark measures recovery of historical empirical equations from original and synthetic datasets.

30 Jul 2024

symbolic-computation computer-science symbolic-computation

Integrability and Linearizability of a Family of Three-Dimensional Polynomial Systems

Beihang University University of Maribor

We investigate the local integrability and linearizability of a family of three-dimensional polynomial systems with the matrix of the linear approximation having the eigenvalues

1, \zeta, \zeta^2

, where

\zeta

is a primitive cubic root of unity. We establish a criterion for the convergence of the Poincar\'e--Dulac normal form of the systems and examine the relationship between the normal form and integrability. Additionally, we introduce an efficient algorithm to determine the necessary conditions for the integrability of the systems. This algorithm is then applied to a quadratic subfamily of the systems to analyze its integrability and linearizability. Our findings offer insights into the integrability properties of three-dimensional polynomial systems.

14 May 2022

symbolic-computation computer-science symbolic-computation

On realizing differential-algebraic equations by rational dynamical systems

Real-world phenomena can often be conveniently described by dynamical systems (that is, ODE systems in the state-space form). However, if one observes the state of the system only partially, the observed quantities (outputs) and the inputs of the system can typically be related by more complicated differential-algebraic equations (DAEs). Therefore, a natural question (referred to as the realizability problem) is: given a differential-algebraic equation (say, fitted from data), does it come from a partially observed dynamical system? A special case in which the functions involved in the dynamical system are rational is of particular interest. For a single differential-algebraic equation in a single output variable, Forsman has shown that it is realizable by a rational dynamical system if and only if the corresponding hypersurface is unirational, and he turned this into an algorithm in the first-order case. In this paper, we study a more general case of single-input-single-output equations. We show that if a realization by a rational dynamical system exists, the system can be taken to have the dimension equal to the order of the DAE. We provide a complete algorithm for first-order DAEs. We also show that the same approach can be used for higher-order DAEs using several examples from the literature.

271

22 Jul 2024

symbolic-computation computer-science artificial-intelligence

LLM4ED: Large Language Models for Automatic Equation Discovery

Peking University

Southern University of Science and Technology Eastern Institute of Technology

A framework uses Large Language Models to automatically discover explicit mathematical equations for nonlinear dynamical systems directly from observational data. It accurately identifies governing equations for various PDEs and ODEs, achieving high reconstruction and generalization accuracy while significantly reducing the need for handcrafted algorithms.

08 Jun 2011

symbolic-computation computer-science artificial-intelligence

Dependency Pairs and Polynomial Path Orders

University of Innsbruck

We show how polynomial path orders can be employed efficiently in conjunction with weak innermost dependency pairs to automatically certify polynomial runtime complexity of term rewrite systems and the polytime computability of the functions computed. The established techniques have been implemented and we provide ample experimental data to assess the new method.

21 Sep 2023

symbolic-computation computer-science symbolic-computation

On Bergman's Diamond Lemma for Ring Theory

This expository and review paper deals with the Diamond Lemma for ring theory, which is proved in the first section of G. M. Bergman, The Diamond Lemma for Ring Theory, Advances in Mathematics, 29 (1978), pp. 178-218. No originality of the present note is claimed on the part of the author, except for some suggestions and figures. Throughout this paper, I shall mostly use Bergman's expressions in his paper. In Remarks and Notes, the reader will find some useful information on this topic.

28 Apr 2017

symbolic-computation computer-science symbolic-computation

A Case Study on the Parametric Occurrence of Multiple Steady States

Charles Tapley Hoyt (Charlie)

We consider the problem of determining multiple steady states for positive real values in models of biological networks. Investigating the potential for these in models of the mitogen-activated protein kinases (MAPK) network has consumed considerable effort using special insights into the structure of corresponding models. Here we apply combinations of symbolic computation methods for mixed equality/inequality systems, specifically virtual substitution, lazy real triangularization and cylindrical algebraic decomposition. We determine multistationarity of an 11-dimensional MAPK network when numeric values are known for all but potentially one parameter. More precisely, our considered model has 11 equations in 11 variables and 19 parameters, 3 of which are of interest for symbolic treatment, and furthermore positivity conditions on all variables and parameters.

18 May 2020

symbolic-computation computer-science artificial-intelligence

Applying Genetic Programming to Improve Interpretability in Machine Learning Models

Explainable Artificial Intelligence (or xAI) has become an important research topic in the fields of Machine Learning and Deep Learning. In this paper, we propose a Genetic Programming (GP) based approach, named Genetic Programming Explainer (GPX), to the problem of explaining decisions computed by AI systems. The method generates a noise set located in the neighborhood of the point of interest, whose prediction should be explained, and fits a local explanation model for the analyzed sample. The tree structure generated by GPX provides a comprehensible analytical, possibly non-linear, symbolic expression which reflects the local behavior of the complex model. We considered three machine learning techniques that can be recognized as complex black-box models: Random Forest, Deep Neural Network and Support Vector Machine in twenty data sets for regression and classifications problems. Our results indicate that the GPX is able to produce more accurate understanding of complex models than the state of the art. The results validate the proposed approach as a novel way to deploy GP to improve interpretability.

132

12 Apr 2024

symbolic-computation computer-science artificial-intelligence

Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information using Multilayered Multimodal Latent Dirichlet Allocation

Osaka University Nara Institute of Science and Technology

How are emotions formed? Through extensive debate and the promulgation of diverse theories , the theory of constructed emotion has become prevalent in recent research on emotions. According to this theory, an emotion concept refers to a category formed by interoceptive and exteroceptive information associated with a specific emotion. An emotion concept stores past experiences as knowledge and can predict unobserved information from acquired information. Therefore, in this study, we attempted to model the formation of emotion concepts using a constructionist approach from the perspective of the constructed emotion theory. Particularly, we constructed a model using multilayered multimodal latent Dirichlet allocation , which is a probabilistic generative model. We then trained the model for each subject using vision, physiology, and word information obtained from multiple people who experienced different visual emotion-evoking stimuli. To evaluate the model, we verified whether the formed categories matched human subjectivity and determined whether unobserved information could be predicted via categories. The verification results exceeded chance level, suggesting that emotion concept formation can be explained by the proposed model.

09 Mar 2020

symbolic-computation computer-science artificial-intelligence

Neuro-symbolic Architectures for Context Understanding

Computational context understanding refers to an agent's ability to fuse disparate sources of information for decision-making and is, therefore, generally regarded as a prerequisite for sophisticated machine reasoning capabilities, such as in artificial intelligence (AI). Data-driven and knowledge-driven methods are two classical techniques in the pursuit of such machine sense-making capability. However, while data-driven methods seek to model the statistical regularities of events by making observations in the real-world, they remain difficult to interpret and they lack mechanisms for naturally incorporating external knowledge. Conversely, knowledge-driven methods, combine structured knowledge bases, perform symbolic reasoning based on axiomatic principles, and are more interpretable in their inferential processing; however, they often lack the ability to estimate the statistical salience of an inference. To combat these issues, we propose the use of hybrid AI methodology as a general framework for combining the strengths of both approaches. Specifically, we inherit the concept of neuro-symbolism as a way of using knowledge-bases to guide the learning progress of deep neural networks. We further ground our discussion in two applications of neuro-symbolism and, in both cases, show that our systems maintain interpretability while achieving comparable performance, relative to the state-of-the-art.

16 Apr 2024

symbolic-computation computer-science symbolic-computation

Primary Decomposition of Symmetric Ideals

Tokyo University of Science

We propose an effective method for primary decomposition of symmetric ideals. Let

K[X]=K[x_1,\ldots,x_n]

be the

n

-valuables polynomial ring over a field

K

and

\mathfrak{S}_n

the symmetric group of order

n

. We consider the canonical action of

\mathfrak{S}_n

K[X]

i.e.

\sigma(f(x_1,\ldots,x_n))=f(x_{\sigma(1)},\ldots,x_{\sigma(n)})

for

\sigma\in \mathfrak{S}_n

. For an ideal

I

K[X]

I

is called {\em symmetric} if

\sigma(I)=I

for any

\sigma\in \mathfrak{S}_n

. For a minimal primary decomposition

I=Q_1\cap \cdots \cap Q_r

of a symmetric ideal

I

\sigma(I)=\sigma (Q_1)\cap \cdots \cap \sigma(Q_r)

is a minimal primary decomposition of

I

for any

\sigma\in \mathfrak{S}_n

. We utilize this property to compute a full primary decomposition of

I

efficiently from partial primary components. We investigate the effectiveness of our algorithm by implementing it in the computer algebra system Risa/Asir.

22 Mar 2022

symbolic-computation agent-based-systems computer-science

Insights From the NeurIPS 2021 NetHack Challenge

Facebook

University College London

University of Oxford Sberbank AIRI University of Eastern Finland Kakao Brain AIcrowd

Tim Rocktäschel

Edward Grefenstette

In this report, we summarize the takeaways from the first NeurIPS 2021 NetHack Challenge. Participants were tasked with developing a program or agent that can win (i.e., 'ascend' in) the popular dungeon-crawler game of NetHack by interacting with the NetHack Learning Environment (NLE), a scalable, procedurally generated, and challenging Gym environment for reinforcement learning (RL). The challenge showcased community-driven progress in AI with many diverse approaches significantly beating the previously best results on NetHack. Furthermore, it served as a direct comparison between neural (e.g., deep RL) and symbolic AI, as well as hybrid systems, demonstrating that on NetHack symbolic bots currently outperform deep RL by a large margin. Lastly, no agent got close to winning the game, illustrating NetHack's suitability as a long-term benchmark for AI research.

19 Aug 2025

symbolic-computation computer-science artificial-intelligence

Boolean Matrix Logic Programming on the GPU

Imperial College London

Traditional logic programming relies on symbolic computation on the CPU, which can limit performance for large-scale inference tasks. Recent advances in GPU hardware enable high-throughput matrix operations, motivating a shift toward parallel logic inference. Boolean Matrix Logic Programming (BMLP) introduces a novel approach to datalog query evaluation using Boolean matrix algebra, well-suited to GPU acceleration. Building on this paradigm, we present two GPU-accelerated BMLP algorithms for bottom-up inference over linear dyadic recursive datalog programs. We further extend the BMLP theoretical framework to support general linear recursion with binary predicates. Empirical evaluations on reachability queries in large directed graphs and the Freebase 15K dataset show that our methods achieve 1-4 orders of magnitude speed up over state-of-the-art systems. These results demonstrate that Boolean matrix-based reasoning can significantly advance the scalability and efficiency of logic programming on modern hardware. Source code is available on this https URL.

152

12 Oct 2023

symbolic-computation computer-science symbolic-computation

Computing square roots in quaternion algebras

University of Silesia in Katowice

We present an explicit algorithmic method for computing square roots in quaternion algebras over global fields of characteristic different from 2.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks

AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge

APEX $^2$ : Adaptive and Extreme Summarization for Personalized Knowledge Graphs

Symbolic Regression with a Learned Concept Library

Cognitive Architectures for Language Agents

Some New Non-Commutative Matrix Multiplication Algorithms of Size $(n,m,6)$

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Integrability and Linearizability of a Family of Three-Dimensional Polynomial Systems

On realizing differential-algebraic equations by rational dynamical systems

LLM4ED: Large Language Models for Automatic Equation Discovery

Dependency Pairs and Polynomial Path Orders

On Bergman's Diamond Lemma for Ring Theory

A Case Study on the Parametric Occurrence of Multiple Steady States

Applying Genetic Programming to Improve Interpretability in Machine Learning Models

Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information using Multilayered Multimodal Latent Dirichlet Allocation

Neuro-symbolic Architectures for Context Understanding

Primary Decomposition of Symmetric Ideals

Insights From the NeurIPS 2021 NetHack Challenge

Boolean Matrix Logic Programming on the GPU

Computing square roots in quaternion algebras

Events

AI for Law

Personalize Your Feed

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks

AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge

APEX2^22: Adaptive and Extreme Summarization for Personalized Knowledge Graphs

Symbolic Regression with a Learned Concept Library

Cognitive Architectures for Language Agents

Some New Non-Commutative Matrix Multiplication Algorithms of Size (n,m,6)(n,m,6)(n,m,6)

Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

Integrability and Linearizability of a Family of Three-Dimensional Polynomial Systems

On realizing differential-algebraic equations by rational dynamical systems

LLM4ED: Large Language Models for Automatic Equation Discovery

Dependency Pairs and Polynomial Path Orders

On Bergman's Diamond Lemma for Ring Theory

A Case Study on the Parametric Occurrence of Multiple Steady States

Applying Genetic Programming to Improve Interpretability in Machine Learning Models

Study of Emotion Concept Formation by Integrating Vision, Physiology, and Word Information using Multilayered Multimodal Latent Dirichlet Allocation

Neuro-symbolic Architectures for Context Understanding

Primary Decomposition of Symmetric Ideals

Insights From the NeurIPS 2021 NetHack Challenge

Boolean Matrix Logic Programming on the GPU

Computing square roots in quaternion algebras

Events

AI for Law

Personalize Your Feed

APEX $^2$ : Adaptive and Extreme Summarization for Personalized Knowledge Graphs

Some New Non-Commutative Matrix Multiplication Algorithms of Size $(n,m,6)$