alphaXiv

History

Papers Benchmarks

SylphAI

773

30 Jan 2025

agents computer-science computation-and-language

LLM-AutoDiff: Auto-Differentiate Any LLM Workflow

University of Texas at Austin SylphAI

Researchers at SylphAI and the University of Texas at Austin developed LLM-AutoDiff, a framework that applies automatic differentiation principles to optimize prompts in complex, multi-component Large Language Model (LLM) workflows. This approach systematically leverages textual gradients to improve performance and efficiency across diverse LLM applications, from single-step classification to multi-hop RAG and agentic systems.

3,817

18 Nov 2025

computer-science artificial-intelligence computation-and-language

Scaling Textual Gradients via Sampling-Based Momentum

UT Austin

University of Chicago

Microsoft

Princeton University SylphAI

LLM-based prompt optimization, that uses LLM-provided "textual gradients" (feedback) to refine prompts, has emerged an effective method for automatic prompt engineering. However, its scalability and stability are unclear when using more data in training. We systematically investigate the potential and challenges of scaling training data in textual gradient descent. We show that naively scaling training examples is infeasible due to both explicit context-length limits and an implicit context wall, where long-context degradation yields diminishing returns. Inspired by prior wisdom in stochastic gradient descent, we propose Textual Stochastic Gradient Descent with Momentum (TSGD-M), which reweights updates through momentum sampling, using bootstrapped minibatch validation accuracy as importance weights over historical prompts. We introduce Gumbel-Top-

k

sampling for prompt generation, balancing exploration--exploitation and improving sampling efficiency while maintaining a low-variance running mean estimator. TSGD-M integrates seamlessly into existing prompt optimization frameworks, including TextGrad, DSPy-COPRO, and AdalFlow, and achieves consistent gains across 5 benchmarks.

22 Sep 2025

computer-science formal-languages-and-automata-theory robotics

AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback

Texas A&M University

University of Texas at Austin University of São Paulo SylphAI

Large language models (LLMs) can translate natural language instructions into executable action plans for robotics, autonomous driving, and other domains. Yet, deploying LLM-driven planning in the physical world demands strict adherence to safety and regulatory constraints, which current models often violate due to hallucination or weak alignment. Traditional data-driven alignment methods, such as Direct Preference Optimization (DPO), require costly human labeling, while recent formal-feedback approaches still depend on resource-intensive fine-tuning. In this paper, we propose LAD-VF, a fine-tuning-free framework that leverages formal verification feedback for automated prompt engineering. By introducing a formal-verification-informed text loss integrated with LLM-AutoDiff, LAD-VF iteratively refines prompts rather than model parameters. This yields three key benefits: (i) scalable adaptation without fine-tuning; (ii) compatibility with modular LLM architectures; and (iii) interpretable refinement via auditable prompts. Experiments in robot navigation and manipulation tasks demonstrate that LAD-VF substantially enhances specification compliance, improving success rates from 60% to over 90%. Our method thus presents a scalable and interpretable pathway toward trustworthy, formally-verified LLM-driven control systems.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

LLM-AutoDiff: Auto-Differentiate Any LLM Workflow

Scaling Textual Gradients via Sampling-Based Momentum

AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback

Events

AI for Law

Personalize Your Feed