ConceptExpress: Harnessing Diffusion Models for Single-image   Unsupervised Concept Extraction

Shaozhe Hao; Kai Han; Zhengyao Lv; Shihao Zhao; Kwan-Yee K. Wong

arXiv:2407.07077·cs.CV·July 10, 2024

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv, Shihao Zhao, Kwan-Yee K. Wong

PDF

Open Access 1 Repo

TL;DR

ConceptExpress introduces an unsupervised method to extract and recreate multiple concepts from a single image using pretrained diffusion models, eliminating the need for human annotations.

Contribution

It proposes a novel unsupervised concept extraction framework leveraging diffusion models' inherent capabilities, including spatial localization and token association, for multi-concept understanding.

Findings

01

Effective localization of salient concepts via diffusion self-attention

02

Discriminative token learning for individual concepts

03

Promising results on the UCE evaluation protocol

Abstract

While personalized text-to-image generation has enabled the learning of a single concept from multiple images, a more practical yet challenging scenario involves learning multiple concepts within a single image. However, existing works tackling this scenario heavily rely on extensive human annotations. In this paper, we introduce a novel task named Unsupervised Concept Extraction (UCE) that considers an unsupervised setting without any human knowledge of the concepts. Given an image that contains multiple concepts, the task aims to extract and recreate individual concepts solely relying on the existing knowledge from pretrained diffusion models. To achieve this, we present ConceptExpress that tackles UCE by unleashing the inherent capabilities of pretrained diffusion models in two aspects. Specifically, a concept localization approach automatically locates and disentangles salient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haoosz/conceptexpress
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques

MethodsDiffusion