I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow   Transformers with Projected Flow

Ruoyi Du; Dongyang Liu; Le Zhuo; Qin Qi; Hongsheng Li; Zhanyu Ma; Peng; Gao

arXiv:2410.07536·cs.CV·October 16, 2024

I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow

Ruoyi Du, Dongyang Liu, Le Zhuo, Qin Qi, Hongsheng Li, Zhanyu Ma, Peng, Gao

PDF

Open Access

TL;DR

This paper introduces I-Max, a framework that enhances the resolution capabilities of Rectified Flow Transformers in text-to-image generation, improving stability and detail without additional training.

Contribution

I-Max combines a novel Projected Flow strategy and an advanced inference toolkit to enable stable, tuning-free resolution extrapolation in RFTs.

Findings

01

Improved stability in resolution extrapolation.

02

Enhanced image detail and artifact correction.

03

Validated on Lumina-Next-2K and Flux.1-dev datasets.

Abstract

Rectified Flow Transformers (RFTs) offer superior training and inference efficiency, making them likely the most viable direction for scaling up diffusion models. However, progress in generation resolution has been relatively slow due to data quality and training costs. Tuning-free resolution extrapolation presents an alternative, but current methods often reduce generative stability, limiting practical application. In this paper, we review existing resolution extrapolation methods and introduce the I-Max framework to maximize the resolution potential of Text-to-Image RFTs. I-Max features: (i) a novel Projected Flow strategy for stable extrapolation and (ii) an advanced inference toolkit for generalizing model knowledge to higher resolutions. Experiments with Lumina-Next-2K and Flux.1-dev demonstrate I-Max's ability to enhance stability in resolution extrapolation and show that it can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFluid Dynamics and Turbulent Flows · Lattice Boltzmann Simulation Studies

MethodsDiffusion