International Business Machines Corporation (IBM)
EfficientLLM: Efficiency in Large Language Models

A comprehensive empirical evaluation framework assesses efficiency techniques for Large Language Models across architecture pretraining, fine-tuning, and quantization dimensions, revealing key trade-offs between memory usage, compute utilization, latency, throughput and energy consumption while demonstrating effective transfer of findings to vision and multimodal models.

View blog
Resources21
There are no more papers matching your filters at the moment.