Enhancing Diffusion Models for High-Quality Image Generation

Jaineet Shah; Michael Gromis; and Rickston Pinto

arXiv:2412.14422·cs.CV·December 20, 2024

Enhancing Diffusion Models for High-Quality Image Generation

Jaineet Shah, Michael Gromis, and Rickston Pinto

PDF

Open Access

TL;DR

This paper enhances diffusion models for image generation by integrating advanced techniques like classifier-free guidance and latent diffusion, improving speed and quality on datasets like CIFAR-10 and ImageNet-100.

Contribution

It introduces optimized methods for diffusion models, including CFG and VAE integration, to boost efficiency and image quality in generative AI.

Findings

01

DDIM + CFG achieves faster inference and higher image quality

02

VAE and noise scheduling present challenges needing further research

03

Models perform well on CIFAR-10 and ImageNet-100 datasets

Abstract

This report presents the comprehensive implementation, evaluation, and optimization of Denoising Diffusion Probabilistic Models (DDPMs) and Denoising Diffusion Implicit Models (DDIMs), which are state-of-the-art generative models. During inference, these models take random noise as input and iteratively generate high-quality images as output. The study focuses on enhancing their generative capabilities by incorporating advanced techniques such as Classifier-Free Guidance (CFG), Latent Diffusion Models with Variational Autoencoders (VAE), and alternative noise scheduling strategies. The motivation behind this work is the growing demand for efficient and scalable generative AI models that can produce realistic images across diverse datasets, addressing challenges in applications such as art creation, image synthesis, and data augmentation. Evaluations were conducted on datasets including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Image Retrieval and Classification Techniques · AI in cancer detection

MethodsDiffusion · Focus