Senmao Li1,3, Kai Wang2, Salman Khan3, Fahad Shahbaz Khan3,4, Jian Yang1, Yaxing Wang1,
1 Nankai University, 2 City University of Hong Kong (Dongguan), China, 3 MBZUAI, 4 Linkoping University
Figure 1. Overview of the proposed FasterVAR framework. We retain the original VAR inference process for the semantic and structure establishment stages, while exploiting semantic irrelevance and low-rank properties in the fidelity refinement stage to accelerate inference.
Figure 2. Qualitative comparison with the vanilla Infinity-2B, Infinity-8B, and STAR models (1st, 3rd, and 5th rows). Our StageVAR (2nd, 4th, and 6th rows) achieves a 3.4x, 2.7x, and 1.74x speedup while maintaining performance.
Please cite our paper if you find this work useful for your research:
@inproceedings{li2026icml,
title = {FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models},
author = {Li, Senmao and Wang, Kai and Khan, Salman and Khan, Fahad Shahbaz and Yang, Jian and Wang, Yaxing},
booktitle = {ICML},
year = {2026},
}β If FasterVAR is helpful to your projects, please help star this repo. Thanks! π€

