Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
Bingxin Ke, Kevin Qu, Tianfu Wang, Nando Metzger, Shengyu Huang, Bo Li, Anton Obukhov, Konrad Schindler

TL;DR
Marigold is a cost-effective method that adapts large pretrained diffusion models for dense image analysis tasks, achieving state-of-the-art zero-shot performance with minimal training.
Contribution
It introduces a novel fine-tuning protocol that leverages pretrained latent diffusion models for dense image analysis, requiring minimal modifications and synthetic data.
Findings
Achieves state-of-the-art zero-shot results on multiple dense prediction tasks.
Requires only small synthetic datasets and minimal training time.
Demonstrates effective adaptation of diffusion models for image analysis.
Abstract
The success of deep learning in computer vision over the past decade has hinged on large labeled datasets and strong pretrained models. In data-scarce settings, the quality of these pretrained models becomes crucial for effective transfer learning. Image classification and self-supervised learning have traditionally been the primary methods for pretraining CNNs and transformer-based architectures. Recently, the rise of text-to-image generative models, particularly those using denoising diffusion in a latent space, has introduced a new class of foundational models trained on massive, captioned image datasets. These models' ability to generate realistic images of unseen content suggests they possess a deep understanding of the visual world. In this work, we present Marigold, a family of conditional generative models and a fine-tuning protocol that extracts the knowledge from pretrained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗prs-eth/marigold-depth-v1-1model· 11k dl· ♡ 2811k dl♡ 28
- 🤗prs-eth/marigold-depth-v1-0model· 109k dl· ♡ 141109k dl♡ 141
- 🤗prs-eth/marigold-depth-lcm-v1-0model· 3.5k dl· ♡ 583.5k dl♡ 58
- 🤗prs-eth/marigold-normals-v0-1model· 5.2k dl· ♡ 45.2k dl♡ 4
- 🤗prs-eth/marigold-normals-lcm-v0-1model· 1.1k dl· ♡ 81.1k dl♡ 8
- 🤗prs-eth/marigold-disparity-affine-v0-1model· 3 dl3 dl
- 🤗prs-eth/marigold-normals-v1-1model· 6.2k dl· ♡ 66.2k dl♡ 6
- 🤗prs-eth/marigold-iid-appearance-v1-1model· 2.0k dl· ♡ 22.0k dl♡ 2
- 🤗prs-eth/marigold-iid-lighting-v1-1model· 2.3k dl· ♡ 62.3k dl♡ 6
- 🤗prs-eth/marigold-depth-hr-v1-1model· 142 dl· ♡ 10142 dl♡ 10
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Image and Signal Denoising Methods · Cell Image Analysis Techniques
MethodsDiffusion
