alphaXiv

History

Papers Benchmarks

Griffith University

4,794

29 Jan 2024

computer-science artificial-intelligence machine-learning

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

Monash University

Alibaba Group Ant Group The Hong Kong University of Science and Technology (Guangzhou)Griffith University IBM Research

TIME-LLM introduces a reprogramming framework that adapts large language models for general time series forecasting by keeping the LLM backbone frozen. The approach achieves state-of-the-art performance across various benchmarks, excelling particularly in data-scarce few-shot and zero-shot settings.

1,727

37,609

09 Jun 2025

adversarial-attacks adversarial-robustness agents

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

University of Washington Wuhan University

University of Illinois at Urbana-Champaign

UCLA

Chinese Academy of Sciences Shanghai AI Laboratory

New York University

National University of Singapore

Fudan University

Georgia Institute of Technology

University of Science and Technology of China

Zhejiang University University of Electronic Science and Technology of China

Renmin University of China

The Hong Kong Polytechnic University

Peking University Griffith University

Nanyang Technological University

Johns Hopkins University

The University of Hong Kong

The Pennsylvania State University A*STAR Shanghai University University of Illinois at Chicago Singapore Management University

Southern University of Science and Technology

HKUST

Tencent TeleAI Squirrel Ai Learning Hong Kong University of Science and Technology (Guangzhou)The University of North Carolina at Chapel Hill Ben Gurion University Center for Applied Scientific Computing

Kun Wang

Fu An

This survey paper defines and applies a 'full-stack' safety concept for Large Language Models (LLMs), systematically analyzing safety concerns across their entire lifecycle from data to deployment and commercialization. The collaboration synthesizes findings from over 900 papers, providing a unified taxonomy of attacks and defenses while identifying key insights and future research directions for LLM and LLM-agent safety.

1,587

20 Oct 2025

computer-science artificial-intelligence computation-and-language

GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation

Monash University Griffith University sity of Science and Technology Nanjing Univer-

GFM-RAG introduces the first graph foundation model specifically designed for Retrieval Augmented Generation (RAG), leveraging a query-dependent Graph Neural Network to capture complex, multi-hop knowledge relationships. This model achieves state-of-the-art retrieval and question answering performance on diverse datasets and generalizes to unseen domains without fine-tuning, significantly enhancing LLM reasoning capabilities.

133

1,133

27 Feb 2025

computer-science artificial-intelligence machine-learning

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Griffith University

Princeton University Squirrel Ai Learning

TIME-MOE introduces a billion-scale time series foundation model leveraging a sparse Mixture-of-Experts architecture to achieve state-of-the-art zero-shot and fine-tuned forecasting performance. The model validates scaling laws for time series, achieving over 20% average MSE reduction and significantly improving computational efficiency.

443

531

27 Aug 2025

agents chain-of-thought computer-science

Explain Before You Answer: A Survey on Compositional Visual Reasoning

University of Washington

Monash University Allen Institute for Artificial Intelligence

Stanford University Griffith University

Princeton University

A survey charts the recent trajectory of Compositional Visual Reasoning (CVR) from 2023 to 2025, introducing a five-stage taxonomy to explain its evolution and distinct advantages over monolithic approaches. The work systematically reviews over 260 papers, identifying key benefits such as enhanced interpretability and robustness, while also outlining persistent open challenges and future research directions for the field.

247

1,430

28 May 2025

computer-science computation-and-language graph-neural-networks

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

Monash University Nanjing University of Science and Technology Griffith University

The Graph-constrained Reasoning (GCR) framework integrates Knowledge Graph (KG) structure directly into Large Language Model (LLM) decoding, achieving 100% faithful reasoning without hallucinations on KGQA tasks. This approach consistently outperforms state-of-the-art methods on benchmarks like WebQuestionSP and Complex WebQuestions by up to 9.1% while being significantly more efficient than agent-based approaches.

100

1,865

19 May 2025

attention-mechanisms computer-science artificial-intelligence

TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis

Zhejiang University The Hong Kong University of Science and Technology (Guangzhou)Griffith University Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security

TIMEMIXER++, developed by researchers from Griffith University, Zhejiang University, and MIT, presents a general-purpose time series pattern machine capable of dynamically capturing patterns across multiple temporal scales and frequency resolutions. The model consistently achieves state-of-the-art performance across 8 diverse time series tasks, including long-term forecasting (reducing MSE on Electricity by 7.3%), imputation (outperforming TimesNet by 25.7% in MSE), and zero-shot forecasting (reducing MSE by 13.1%).

1,513

3,303

24 Feb 2024

computer-science artificial-intelligence computation-and-language

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Monash University Griffith University

The Reasoning on Graphs (RoG) framework enhances Large Language Model (LLM) reasoning by integrating Knowledge Graph (KG) structural information as explicit reasoning plans. It achieves state-of-the-art performance on KGQA benchmarks, improving Hits@1 by 22.3% and F1 by 14.4% on CWQ, while providing faithful and interpretable explanations grounded in KG paths.

371

457

26 May 2025

computer-science computer-vision-and-pattern-recognition machine-learning

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Zhejiang University The Hong Kong University of Science and Technology (Guangzhou)Griffith University Squirrel AI

Time-VLM, developed by researchers at The Hong Kong University of Science and Technology (Guangzhou) and collaborators, proposes a unified framework that integrates temporal, visual, and textual modalities using pre-trained Vision-Language Models for time series forecasting. The model demonstrates enhanced generalization in data-scarce settings, outperforming baselines in few-shot and zero-shot scenarios, while maintaining significantly higher computational efficiency compared to existing large language model-based approaches.

193

17 Oct 2025

computer-science artificial-intelligence machine-learning

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Fudan University Griffith University INFLY Tech

Researchers introduce Diversity-Preserving Hybrid Reinforcement Learning (DPH-RL), a framework that leverages mass-covering f-divergences to counter diversity collapse and catastrophic forgetting in large language models fine-tuned with verifiable rewards. DPH-RL improves multi-attempt performance (Pass@k) and out-of-domain generalization, surpassing baselines by up to 8.35% in average out-of-domain performance on mathematical tasks.

280

30 Aug 2025

agentic-frameworks agents computer-science

Graph-Augmented Large Language Model Agents: Current Progress and Future Prospects

National University of Singapore Griffith University

Nanyang Technological University

This survey paper, authored by researchers from Griffith University and collaborators, provides a comprehensive overview and taxonomy of Graph-Augmented Large Language Model Agents (GLA), synthesizing current advancements and outlining future research directions. It systematically categorizes how graph structures enhance LLM agents in planning, memory, tool use, and multi-agent system coordination and trustworthiness.

2,730

02 Aug 2025

adversarial-attacks adversarial-robustness computer-science

Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety

This comprehensive survey systematically reviews current safety research across six major large AI model paradigms and autonomous agents, presenting a detailed taxonomy of 10 attack types and corresponding defense strategies. The review identifies a predominant focus on attack methodologies (60% of papers) over defenses and outlines key open challenges for advancing AI safety.

1,072

07 Feb 2024

computer-science continual-learning computation-and-language

Continual Learning for Large Language Models: A Survey

Monash University Griffith University

Researchers from Monash University and Griffith University present the first comprehensive survey of continual learning techniques tailored for large language models, proposing a novel multi-staged categorization scheme that aligns with the distinct phases of LLM training. The survey identifies specific challenges like "cross-stage forgetting" and outlines key areas for future research to enable LLMs to adapt continuously and sustainably to evolving information, tasks, and human values.

183

19 Nov 2025

computer-science multiagent-systems

Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation

Griffith University Hong Kong Polytechnic University

Researchers from Griffith University and collaborators introduce ARG-DESIGNER, an autoregressive graph generation model that designs customized multi-agent system communication topologies from scratch. The model achieves state-of-the-art performance across six benchmarks, including MMLU and HumanEval, while simultaneously reducing token consumption by approximately 50% compared to previous learning-based methods.

815

18 Jun 2024

computer-science machine-learning multi-modal-learning

Foundation Models for Time Series Analysis: A Tutorial and Survey

Monash University The Hong Kong University of Science and Technology (Guangzhou)Griffith University Squirrel AI

Princeton University University of Connecticut

A tutorial and survey categorizes Foundation Models for Time Series (TSFMs) by their underlying mechanisms across diverse time series data types, offering a methodology-centric taxonomy of architectures, pre-training, and adaptation methods. It synthesizes current advancements and identifies future research avenues.

381

04 Jul 2025

agentic-frameworks agents computer-science

Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities

Ant Group

Zhejiang University

The Chinese University of Hong Kong Griffith University University of Illinois Chicago

The University of Hong Kong

Mohamed bin Zayed University of Artificial Intelligence City University of Macau

This survey paper from a collaborative team including researchers from Zhejiang University, University of Illinois Chicago, and MBZUAI, offers a systematic review and taxonomy for understanding how graph techniques enhance various functionalities of AI agents. It demonstrates that graphs effectively structure complex information, leading to more capable agents in planning, execution, memory management, and multi-agent coordination, and also highlights how AI agents can advance graph learning tasks.

957

06 Dec 2025

computer-science artificial-intelligence machine-learning

A Survey on Diffusion Models for Time Series and Spatio-Temporal Data

Monash University

University of Oxford

Fudan University Ant Group

Beijing Jiaotong University Griffith University

The University of Hong Kong Salesforce Research East China Normal University Zhejiang Normal University Hong Kong University of Science and Technology (Guangzhou)

Jiang Bian

Diffusion models have been widely used in time series and spatio-temporal data, enhancing generative, inferential, and downstream capabilities. These models are applied across diverse fields such as healthcare, recommendation, climate, energy, audio, and traffic. By separating applications for time series and spatio-temporal data, we offer a structured perspective on model category, task type, data modality, and practical application domain. This study aims to provide a solid foundation for researchers and practitioners, inspiring future innovations that tackle traditional challenges and foster novel solutions in diffusion model-based data mining tasks and applications. For more detailed information, we have open-sourced a repository at this https URL.

844

403

16 Apr 2024

computer-science computer-vision-and-pattern-recognition few-shot-learning

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

Griffith University Hohai University

RemoteCLIP introduces the first vision-language foundation model specifically designed for remote sensing, adapting the CLIP paradigm through an innovative data scaling strategy that unifies heterogeneous annotations. The model achieves State-of-the-Art performance across various remote sensing tasks, including cross-modal retrieval, zero-shot and few-shot classification, and object counting, demonstrating enhanced semantic understanding and generalization capabilities.

347

299

03 Nov 2025

chain-of-thought computer-science computation-and-language

Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking

Rensselaer Polytechnic Institute

University of Toronto Pennsylvania State University Griffith University AT&T-Chief Data Office

Researchers from Rensselaer Polytechnic Institute and collaborators audit existing Knowledge Graph Question Answering (KGQA) datasets, revealing an average factual correctness of only 57%. They introduce KGQAGen, an LLM-guided framework for creating high-quality, verifiable benchmarks, and use it to construct KGQAGen-10k, which achieves 96.3% factual accuracy and highlights retrieval as a primary bottleneck for state-of-the-art KG-RAG models.

105

18 Sep 2025

agents computer-science artificial-intelligence

TableDART: Dynamic Adaptive Multi-Modal Routing for Table Understanding

University of Notre Dame Griffith University The University of Queensland

TableDART presents a framework for multimodal table understanding that dynamically routes each query-table pair to optimal processing paths (text-only, image-only, or fusion), achieving state-of-the-art performance among open-source models. It outperforms existing multimodal baselines by an average of +4.02% accuracy and reduces inference latency by 24.5% while using nearly 10 times fewer trainable parameters.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Explain Before You Answer: A Survey on Compositional Visual Reasoning

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models

TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis

Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Graph-Augmented Large Language Model Agents: Current Progress and Future Prospects

Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety

Continual Learning for Large Language Models: A Survey

Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation

Foundation Models for Time Series Analysis: A Tutorial and Survey

Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities

A Survey on Diffusion Models for Time Series and Spatio-Temporal Data

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking

TableDART: Dynamic Adaptive Multi-Modal Routing for Table Understanding

Events

AI for Law

Personalize Your Feed