Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
BibTex
Copy
@misc{chengSun Oct 26 2025 06:24:15 GMT+0000 (Coordinated Universal Time)visualthoughtsunified,
title={Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought},
author={Zihui Cheng and Qiguang Chen and Xiao Xu and Jiaqi Wang and Weiyun Wang and Hao Fei and Yidong Wang and Alex Jinpeng Wang and Zhi Chen and Wanxiang Che and Libo Qin},
year={Sun Oct 26 2025 06:24:15 GMT+0000 (Coordinated Universal Time)},
eprint={2505.15510},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.15510},
}
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.