FGM-HD: Boosting Generation Diversity of Fractal Generative Models through Hausdorff Dimension Induction
Haowei Zhang, Yuanpei Zhao, Ji-Zhe Zhou, Mao Li

TL;DR
This paper introduces FGM-HD, a novel fractal generative model that leverages Hausdorff Dimension to significantly enhance output diversity while maintaining image quality, through a learnable HD estimation, a progressive training strategy, and HD-guided rejection sampling.
Contribution
It is the first work to incorporate Hausdorff Dimension into Fractal Generative Models, improving diversity with a learnable HD estimation and a novel training and sampling strategy.
Findings
39% increase in output diversity on ImageNet
Maintains comparable image quality to baseline FGMs
First integration of Hausdorff Dimension into FGM
Abstract
Improving the diversity of generated results while maintaining high visual quality remains a significant challenge in image generation tasks. Fractal Generative Models (FGMs) are efficient in generating high-quality images, but their inherent self-similarity limits the diversity of output images. To address this issue, we propose a novel approach based on the Hausdorff Dimension (HD), a widely recognized concept in fractal geometry used to quantify structural complexity, which aids in enhancing the diversity of generated outputs. To incorporate HD into FGM, we propose a learnable HD estimation method that predicts HD directly from image embeddings, addressing computational cost concerns. However, simply introducing HD into a hybrid loss is insufficient to enhance diversity in FGMs due to: 1) degradation of image quality, and 2) limited improvement in generation diversity. To this end,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Image Enhancement Techniques
