alphaXiv

Univ. of Modena and

29 Jul 2022

computer-science artificial-intelligence computation-and-language

ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval

ISTI-CNR Univ. of Modena and Reggio Emilia

ALADIN introduces a two-stage architecture that distills fine-grained alignment scores into an efficient common embedding space, enabling high-performance image-text matching and retrieval. The model achieves competitive recall while demonstrating up to a 90-fold increase in inference speed compared to entangled Vision-Language Transformers.

There are no more papers matching your filters at the moment.

Events

AI for Law
Joel Niklaus· Hugging Face
01/09
Register
Watch recordings

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval

Events

AI for Law

Personalize Your Feed