Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
BibTex
Copy
@misc{globerson2024visualriddlescommonsense,
title={Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models},
author={Amir Globerson and Yuval Elovici and Idan Szpektor and Aviv Slobodkin and Aviya Maimon and Yonatan Bitton and Royi Rassin and Nitzan Bitton-Guetta and Eliya Habba},
year={2024},
eprint={2407.19474},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2407.19474},
}
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.