EVAR: Edge Visual Autoregressive Models via Principled Pruning
Zefang Wang, Yanyu Li, Mingluo Su, Simin Xu, Guanzhong Tian, and Huan Wang
In arxiv, 2026
We propose a principled OBS-based structured pruning method for visual autoregressive models. Our approach introduces a progressive scale-aware distillation method to address gradient imbalance during next-scale autoregressive fine-tuning. The method achieves 1.8× speedup with only 10% quality loss on single-image generation scenarios on edge devices.