SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding
BibTex
Copy
@misc{zhong2025specmambaacceleratingmamba,
title={SpecMamba: Accelerating Mamba Inference on FPGA with Speculative Decoding},
author={Linfeng Zhong and Songqiang Xu and Huifeng Wen and Tong Xie and Qingyu Guo and Yuan Wang and Meng Li},
year={2025},
eprint={2509.19873/metadata},
archivePrefix={arXiv},
primaryClass={cs.AR},
url={https://arxiv.org/abs/2509.19873/metadata},
}
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.