alphaXiv

History

Papers Benchmarks

Cluster

03 Feb 2023

computer-science machine-learning optimization-methods

Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias

ETH Zurich AIST Cluster

Research clarifies the computational efficiency and implicit bias of Gradient Regularization (GR) in deep learning. The study introduces efficient finite-difference methods for GR computation, demonstrating their improved generalization and revealing theoretical connections to flat minima and other optimization techniques like flooding and SAM.

There are no more papers matching your filters at the moment.

Events

AI for Law
Joel Niklaus· Hugging Face
01/09
Register
Watch recordings

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias

Events

AI for Law

Personalize Your Feed