alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

We're hiring
PaperBlogResources

From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding

BibTex
Copy
@Article{Zou2024FromST,
 author = {Heqing Zou and Tianze Luo and Guiyang Xie and Victor Zhang and Fengmao Lv and Guangcong Wang and Juanyang Chen and Zhuochen Wang and Hansheng Zhang and Huaijian Zhang},
 booktitle = {arXiv.org},
 journal = {ArXiv},
 title = {From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding},
 volume = {abs/2409.18938},
 year = {2024}
}
GitHub
LV-LLMs
5
HTTPS
https://github.com/Vincent-ZHQ/LV-LLMs
SSH
git@github.com:Vincent-ZHQ/LV-LLMs.git
CLI
gh repo clone Vincent-ZHQ/LV-LLMs
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.
Audio lecture
Q&A format