alphaXiv

History

Papers Benchmarks

Bar Ilan University

1,508

12 Jun 2023

computer-science artificial-intelligence computation-and-language

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

ETH Zurich

KAIST

University of Washington Rensselaer Polytechnic Institute

Google DeepMind

University of Amsterdam

University of Illinois at Urbana-Champaign

University of Cambridge Heidelberg University

University of Waterloo Facebook

Carnegie Mellon University

University of Southern California

Google

New York University University of Stuttgart

UC Berkeley

National University of Singapore

University College London

University of Oxford LMU Munich

Shanghai Jiao Tong University

University of California, Irvine

Tsinghua University

Stanford University

University of Michigan

University of Copenhagen

The Chinese University of Hong Kong University of Melbourne

Meta University of Edinburgh

OpenAI

The University of Texas at Austin

Cornell University

University of California, San Diego Yonsei University

McGill University

Boston University University of Bamberg

Nanyang Technological University

Microsoft

KU Leuven

Columbia University UC Santa Barbara

Allen Institute for AI German Research Center for Artificial Intelligence (DFKI)

University of Pennsylvania

Johns Hopkins University

Arizona State University

University of Maryland

University of Tokyo University of North Carolina at Chapel Hill Hebrew University of Jerusalem Amazon Tilburg University University of Massachusetts Amherst University of Rochester University of Duisburg-Essen Sapienza University of Rome University of Sheffield

Princeton University

HKUST University of Tübingen TU Berlin Saarland University Technical University of Darmstadt University of Haifa University of Trento University of Montreal Bilkent University University of Cape Town Bar Ilan University IBM University of Mannheim

ServiceNow Potsdam University Polish-Japanese Academy of Information Technology Salesforce ASAPP AI21 Labs Valencia Polytechnic University University of Trento, Italy

Allen Nie

Jos Rozen

+13

A large-scale and diverse benchmark, BIG-bench, was introduced to rigorously evaluate the capabilities and limitations of large language models across 204 tasks. The evaluation revealed that even state-of-the-art models currently achieve aggregate scores below 20 (on a 0-100 normalized scale), indicating significantly lower performance compared to human experts.

310

03 Sep 2025

agents chain-of-thought computer-science

MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents

Tel Aviv University

Allen Institute for AI

University of Pennsylvania Bar Ilan University Oracle AI

MONACO introduces a benchmark of 1,315 natural, complex, and time-consuming information-seeking questions that require reasoning across dozens of documents, demonstrating that frontier Large Language Models achieve an F1 score of only 61.2% and struggle significantly with extensive information aggregation and retrieval robustness.

834

22 Sep 2024

computer-science artificial-intelligence robotics

MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting

NVIDIA Simon Fraser University Bar Ilan University

MaskedMimic introduces a unified framework for physics-based character control by reformulating it as a masked motion inpainting problem. This approach allows a single system to generate physically plausible full-body motions from various partial constraints, including full-body tracking, VR inputs, object interactions, path following, and text-to-motion synthesis.

450

03 Apr 2025

computer-science computation-and-language computers-and-society

LEACE: Perfect linear concept erasure in closed form

ETH Zürich Booz Allen Hamilton Bar Ilan University EleutherAI

Nora Belrose

LEACE presents a closed-form, provably optimal solution for linear concept erasure in machine learning models that minimizes disruption to the original embedding. This method enables 'concept scrubbing' for multi-layer interventions in deep neural networks, demonstrating high effectiveness in debiasing large language models and precise causal probing with minimal impact on main-task performance.

354

05 Sep 2022

computer-science computation-and-language machine-learning

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

Allen Institute for Artificial Intelligence Bar Ilan University

BitFit, developed by researchers at Bar Ilan University and AI2, proposes a method for fine-tuning transformer-based masked language models by training only bias terms and the classification layer. This approach achieves performance comparable to full fine-tuning on GLUE benchmarks while modifying as little as 0.08% of total model parameters and often outperforming full fine-tuning in data-scarce scenarios.

142

252

16 Oct 2025

computer-science computer-vision-and-pattern-recognition generative-models

OmnimatteZero: Fast Training-free Omnimatte with Pre-trained Video Diffusion Models

NVIDIA The Hebrew University of Jerusalem Bar Ilan University OriginAI

In Omnimatte, one aims to decompose a given video into semantically meaningful layers, including the background and individual objects along with their associated effects, such as shadows and reflections. Existing methods often require extensive training or costly self-supervised optimization. In this paper, we present OmnimatteZero, a training-free approach that leverages off-the-shelf pre-trained video diffusion models for omnimatte. It can remove objects from videos, extract individual object layers along with their effects, and composite those objects onto new videos. These are accomplished by adapting zero-shot image inpainting techniques for video object removal, a task they fail to handle effectively out-of-the-box. To overcome this, we introduce temporal and spatial attention guidance modules that steer the diffusion process for accurate object removal and temporally consistent background reconstruction. We further show that self-attention maps capture information about the object and its footprints and use them to inpaint the object's effects, leaving a clean background. Additionally, through simple latent arithmetic, object layers can be isolated and recombined seamlessly with new video layers to produce new videos. Evaluations show that OmnimatteZero not only achieves superior performance in terms of background reconstruction but also sets a new record for the fastest Omnimatte approach, achieving real-time performance with minimal frame runtime.

106

25 Oct 2025

computer-science computation-and-language machine-learning

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training

Tel Aviv University Bar Ilan University Ben Gurion University

Ofir Lindenbaum

Low-rank gradient-based optimization methods have significantly improved memory efficiency during the training of large language models (LLMs), enabling operations within constrained hardware without sacrificing performance. However, these methods primarily emphasize memory savings, often overlooking potential acceleration in convergence due to their reliance on standard isotropic steepest descent techniques, which can perform suboptimally in the highly anisotropic landscapes typical of deep networks, particularly LLMs. In this paper, we propose SUMO (Subspace-Aware Moment-Orthogonalization), an optimizer that employs exact singular value decomposition (SVD) for moment orthogonalization within a dynamically adapted low-dimensional subspace, enabling norm-inducing steepest descent optimization steps. By explicitly aligning optimization steps with the spectral characteristics of the loss landscape, SUMO effectively mitigates approximation errors associated with commonly used methods like Newton-Schulz orthogonalization approximation. We theoretically establish an upper bound on these approximation errors, proving their dependence on the condition numbers of moments, conditions we analytically demonstrate are encountered during LLM training. Furthermore, we both theoretically and empirically illustrate that exact orthogonalization via SVD substantially improves convergence rates while reducing overall complexity. Empirical evaluations confirm that SUMO accelerates convergence, enhances stability, improves performance, and reduces memory requirements by up to 20% compared to state-of-the-art methods.

566

29 Apr 2020

computer-science computation-and-language machine-learning

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Allen Institute for Artificial Intelligence Bar Ilan University

Researchers from Bar Ilan University and AI2 developed Iterative Nullspace Projection (INLP), a data-driven method to deterministically remove specific linear information from neural representations. This approach effectively mitigated gender bias in word embeddings and achieved fairer classification performance, often with minimal impact on main task accuracy.

417

12 Oct 2022

computer-science computation-and-language efficient-transformers

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Allen Institute for AI Bar Ilan University

Feed-Forward Network (FFN) layers in Transformer-based Language Models build predictions by dynamically promoting human-interpretable concepts within the vocabulary space, rather than primarily eliminating tokens. Decomposing FFN outputs into individual 'sub-updates' provides a mechanistic understanding that enables practical applications in controlling generation and improving computational efficiency.

101

04 Sep 2025

computer-science contrastive-learning artificial-intelligence

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Bar Ilan University

We present NER Retriever, a zero-shot retrieval framework for ad-hoc Named Entity Retrieval, a variant of Named Entity Recognition (NER), where the types of interest are not provided in advance, and a user-defined type description is used to retrieve documents mentioning entities of that type. Instead of relying on fixed schemas or fine-tuned models, our method builds on internal representations of large language models (LLMs) to embed both entity mentions and user-provided open-ended type descriptions into a shared semantic space. We show that internal representations, specifically the value vectors from mid-layer transformer blocks, encode fine-grained type information more effectively than commonly used top-layer embeddings. To refine these representations, we train a lightweight contrastive projection network that aligns type-compatible entities while separating unrelated types. The resulting entity embeddings are compact, type-aware, and well-suited for nearest-neighbor search. Evaluated on three benchmarks, NER Retriever significantly outperforms both lexical and dense sentence-level retrieval baselines. Our findings provide empirical support for representation selection within LLMs and demonstrate a practical solution for scalable, schema-free entity retrieval. The NER Retriever Codebase is publicly available at this https URL

05 Apr 2022

physics quantum-physics

Dynamical nonlocality in quantum time via modular operators

Bar Ilan University Gdansk University of Technology National Quantum Information Center of Gdansk

We formalize the concept of the modular energy operator within the Page and Wootters timeless framework. As a result, this operator is elevated to the same status as the more studied modular operators of position and momentum. In analogy with dynamical nonlocality in space associated with the modular momentum, we introduce and analyze the nonlocality in time associated with the modular energy operator. Some applications of our formalization are provided through illustrative examples.

239

13 Aug 2025

attention-mechanisms computer-science computer-vision-and-pattern-recognition

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Hebrew University of Jerusalem Bar Ilan University OriginAI

The Story2Board framework enables training-free generation of expressive, multi-panel storyboards from natural language, maintaining strong character consistency while allowing for diverse scene compositions and narrative progression. It achieves this by employing an LLM-based prompt decomposition alongside two novel in-context consistency mechanisms during the diffusion model's denoising process.

169

14 Sep 2025

computer-science continual-learning artificial-intelligence

Gradient Free Deep Reinforcement Learning With TabPFN

Meta Bar Ilan University

Gradient based optimization is fundamental to most modern deep reinforcement learning algorithms, however, it introduces significant sensitivity to hyperparameters, unstable training dynamics, and high computational costs. We propose TabPFN RL, a novel gradient free deep RL framework that repurposes the meta trained transformer TabPFN as a Q function approximator. Originally developed for tabular classification, TabPFN is a transformer pre trained on millions of synthetic datasets to perform inference on new unseen datasets via in context learning. Given an in context dataset of sample label pairs and new unlabeled data, it predicts the most likely labels in a single forward pass, without gradient updates or task specific fine tuning. We use TabPFN to predict Q values using inference only, thereby eliminating the need for back propagation at both training and inference. To cope with the model's fixed context budget, we design a high reward episode gate that retains only the top 5% of trajectories. Empirical evaluations on the Gymnasium classic control suite demonstrate that TabPFN RL matches or surpasses Deep Q Network on CartPole v1, MountainCar v0, and Acrobot v1, without applying gradient descent or any extensive hyperparameter tuning. We discuss the theoretical aspects of how bootstrapped targets and non stationary visitation distributions violate the independence assumptions encoded in TabPFN's prior, yet the model retains a surprising generalization capacity. We further formalize the intrinsic context size limit of in context RL algorithms and propose principled truncation strategies that enable continual learning when the context is full. Our results establish prior fitted networks such as TabPFN as a viable foundation for fast and computationally efficient RL, opening new directions for gradient free RL with large pre trained transformers.

157

15 Jun 2025

agentic-frameworks agents chain-of-thought

DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs

Google Research Bar Ilan University

This Google Research paper proposes a comprehensive taxonomy of knowledge conflict types in search-augmented LLMs, accompanied by a new expert-annotated benchmark, CONFLICTS. The work evaluates how effectively LLMs identify and address these conflicts, demonstrating that explicitly providing conflict type information or using taxonomy-aware prompting strategies significantly improves the appropriateness and style of LLM responses.

2,020

12 Nov 2024

attention-mechanisms computer-science artificial-intelligence

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Tel Aviv University

NVIDIA Bar Ilan University

A new training-free method named ADD-IT enables seamless and contextually plausible object insertion into images based on textual instructions. It achieves state-of-the-art performance, significantly improving object placement plausibility on a new benchmark by raising the affordance score from 47.4% to 82.8%.

382

10 Jul 2024

computer-science artificial-intelligence computation-and-language

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Allen Institute for AI Bar Ilan University

Mosh

Alon Jacoby

This research from Bar-Ilan University and the Allen Institute for AI systematically investigates how increasing input length impacts the reasoning performance of Large Language Models using the controlled FLenQA framework. The study reveals significant performance degradation across models as input length grows, even when the core reasoning task remains constant, and identifies specific length-induced failure modes such as increased refusal rates and reduced CoT coverage.

02 Sep 2025

attention-mechanisms computer-science computer-vision-and-pattern-recognition

Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation

NVIDIA Bar Ilan University

Text-to-image diffusion models can generate stunning visuals, yet they often fail at tasks children find trivial--like placing a dog to the right of a teddy bear rather than to the left. When combinations get more unusual--a giraffe above an airplane--these failures become even more pronounced. Existing methods attempt to fix these spatial reasoning failures through model fine-tuning or test-time optimization with handcrafted losses that are suboptimal. Rather than imposing our assumptions about spatial encoding, we propose learning these objectives directly from the model's internal representations. We introduce Learn-to-Steer, a novel framework that learns data-driven objectives for test-time optimization rather than handcrafting them. Our key insight is to train a lightweight classifier that decodes spatial relationships from the diffusion model's cross-attention maps, then deploy this classifier as a learned loss function during inference. Training such classifiers poses a surprising challenge: they can take shortcuts by detecting linguistic traces rather than learning true spatial patterns. We solve this with a dual-inversion strategy that enforces geometric understanding. Our method dramatically improves spatial accuracy: from 0.20 to 0.61 on FLUX.1-dev and from 0.07 to 0.54 on SD2.1 across standard benchmarks. Moreover, our approach generalizes to multiple relations and significantly improves accuracy.

348

01 Sep 2025

adversarial-robustness computer-science artificial-intelligence

Intrinsic Test of Unlearning Using Parametric Knowledge Traces

South China University of Technology

University of Toronto

Tel Aviv University Bar Ilan University

International Digital Economy Academy (IDEA)

Yihuai Hong

The task of "unlearning" certain concepts in large language models (LLMs) has attracted immense attention recently, due to its importance in mitigating undesirable model behaviours, such as the generation of harmful, private, or incorrect information. Current protocols to evaluate unlearning methods largely rely on behavioral tests, without monitoring the presence of unlearned knowledge within the model's parameters. This residual knowledge can be adversarially exploited to recover the erased information post-unlearning. We argue that unlearning should also be evaluated internally, by considering changes in the parametric knowledge traces of the unlearned concepts. To this end, we propose a general evaluation methodology that leverages vocabulary projections to inspect concepts encoded in model parameters. We use this approach to localize "concept vectors" - parameter vectors that encode concrete concepts - and construct ConceptVectors, a benchmark dataset containing hundreds of common concepts and their parametric knowledge traces within two open-source LLMs. Evaluation on ConceptVectors shows that existing unlearning methods minimally impact concept vectors and mostly suppress them during inference, while directly ablating these vectors demonstrably removes the associated knowledge and significantly reduces the model's susceptibility to adversarial manipulation. Our results highlight limitations in behavioral-based unlearning evaluations and call for future work to include parameter-based evaluations. To support this, we release our code and benchmark at this https URL.

171

04 Jun 2025

computer-science computation-and-language computers-and-society

Representation Surgery: Theory and Practice of Affine Steering

ETH Zurich

Google Research IIIT Hyderabad Bar Ilan University

Roee Aharoni

Language models often exhibit undesirable behavior, e.g., generating toxic or gender-biased text. In the case of neural language models, an encoding of the undesirable behavior is often present in the model's representations. Thus, one natural (and common) approach to prevent the model from exhibiting undesirable behavior is to steer the model's representations in a manner that reduces the probability of it generating undesirable text. This paper investigates the formal and empirical properties of steering functions, i.e., transformation of the neural language model's representations that alter its behavior. First, we derive two optimal, in the least-squares sense, affine steering functions under different constraints. Our theory provides justification for existing approaches and offers a novel, improved steering approach. Second, we offer a series of experiments that demonstrate the empirical effectiveness of the methods in mitigating bias and reducing toxic generation.

537

21 May 2024

computer-science computation-and-language explainable-ai

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

Google DeepMind

Tel Aviv University

Google Research Bar Ilan University

Roee Aharoni

REVEAL, a new benchmark for evaluating Chain-of-Thought (CoT) verifiers, provides fine-grained, step-level annotations for LLM reasoning, demonstrating that current models frequently introduce factual errors in their reasoning steps. The dataset enables a more transparent, process-oriented assessment of LLM reasoning beyond final answer correctness.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents

MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting

LEACE: Perfect linear concept erasure in closed form

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

OmnimatteZero: Fast Training-free Omnimatte with Pre-trained Video Diffusion Models

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Dynamical nonlocality in quantum time via modular operators

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Gradient Free Deep Reinforcement Learning With TabPFN

DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation

Intrinsic Test of Unlearning Using Parametric Knowledge Traces

Representation Surgery: Theory and Practice of Affine Steering

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

Events

AI for Law

Personalize Your Feed