Ask or search anything...

History

Events

Watch Recordings

AI for Law01/09 · Joel Niklaus · Hugging Face

Papers Benchmarks

Hot

International Business Machines Corporation (IBM)

EfficientLLM: Efficiency in Large Language Models

20 May 2025

Imperial College London University of Notre Dame logo

University of Notre Dame

A comprehensive empirical evaluation framework assesses efficiency techniques for Large Language Models across architecture pretraining, fine-tuning, and quantization dimensions, revealing key trade-offs between memory usage, compute utilization, latency, throughput and energy consumption while demonstrating effective transfer of findings to vision and multimodal models.

View blog

#computer-science #artificial-intelligence #computation-and-language

Resources 21

853

There are no more papers matching your filters at the moment.

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

Dark mode

Ask or search anything...

Events