G-Refine: A General Quality Refiner for Text-to-Image Generation

Chunyi Li; Haoning Wu; Hongkun Hao; Zicheng Zhang; Tengchaun Kou,; Chaofeng Chen; Lei Bai; Xiaohong Liu; Weisi Lin; Guangtao Zhai

arXiv:2404.18343·cs.MM·April 30, 2024

G-Refine: A General Quality Refiner for Text-to-Image Generation

Chunyi Li, Haoning Wu, Hongkun Hao, Zicheng Zhang, Tengchaun Kou,, Chaofeng Chen, Lei Bai, Xiaohong Liu, Weisi Lin, Guangtao Zhai

PDF

Open Access 1 Repo

TL;DR

G-Refine is a versatile image quality refiner that enhances AI-generated images by identifying perception and alignment issues, significantly improving quality metrics and facilitating broader adoption of text-to-image models.

Contribution

We introduce G-Refine, a general quality refiner with modules for perception, alignment, and targeted enhancement, improving AI-generated image quality without degrading high-quality images.

Findings

01

Outperforms alternative methods on 10+ quality metrics

02

Effective across 4 different databases

03

Enhances practical application of T2I models

Abstract

With the evolution of Text-to-Image (T2I) models, the quality defects of AI-Generated Images (AIGIs) pose a significant barrier to their widespread adoption. In terms of both perception and alignment, existing models cannot always guarantee high-quality results. To mitigate this limitation, we introduce G-Refine, a general image quality refiner designed to enhance low-quality images without compromising the integrity of high-quality ones. The model is composed of three interconnected modules: a perception quality indicator, an alignment quality indicator, and a general quality enhancement module. Based on the mechanisms of the Human Visual System (HVS) and syntax trees, the first two indicators can respectively identify the perception and alignment deficiencies, and the last module can apply targeted quality enhancement accordingly. Extensive experimentation reveals that when compared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

q-future/q-refine
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Handwritten Text Recognition Techniques