ARFlow: Autoregressive Flow with Hybrid Linear Attention
Mude Hui, Rui-Jie Zhu, Songlin Yang, Yu Zhang, Zirui Wang, Yuyin Zhou, Jason Eshraghian, Cihang Xie

TL;DR
ARFlow integrates autoregressive modeling with flow models and a hybrid linear attention mechanism to improve image generation quality and capture long-range dependencies more effectively.
Contribution
This paper introduces ARFlow, a novel flow model that combines autoregressive training and a custom hybrid linear attention for enhanced image generation.
Findings
Achieves 6.63 FID on ImageNet 256x256 without guidance.
Outperforms previous flow-based models like SiT.
Demonstrates effectiveness of chunk-wise attention and autoregressive training.
Abstract
Flow models are effective at progressively generating realistic images, but they generally struggle to capture long-range dependencies during the generation process as they compress all the information from previous time steps into a single corrupted image. To address this limitation, we propose integrating autoregressive modeling -- known for its excellence in modeling complex, high-dimensional joint probability distributions -- into flow models. During training, at each step, we construct causally-ordered sequences by sampling multiple images from the same semantic category and applying different levels of noise, where images with higher noise levels serve as causal predecessors to those with lower noise levels. This design enables the model to learn broader category-level variations while maintaining proper causal relationships in the flow process. During generation, the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI) · Time Series Analysis and Forecasting
MethodsSoftmax · Attention Is All You Need
