ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks

Jo\"el K\"uchler; Ellen van Maren; Vaiva Vasiliauskait\.e; Katarina Vuli\'c; Reza Abbasi-Asl; Stephan J. Ihle

arXiv:2603.02063·cs.CV·March 3, 2026

ORGAN: Object-Centric Representation Learning using Cycle Consistent Generative Adversarial Networks

Jo\"el K\"uchler, Ellen van Maren, Vaiva Vasiliauskait\.e, Katarina Vuli\'c, Reza Abbasi-Asl, Stephan J. Ihle

PDF

Open Access

TL;DR

ORGAN introduces a novel object-centric representation learning method using cycle-consistent GANs, effectively handling complex real-world images with multiple objects and low contrast, and enabling expressive object manipulation.

Contribution

It is the first to apply cycle-consistent GANs to object-centric learning, outperforming autoencoder-based methods on challenging datasets and scaling efficiently.

Findings

01

Performs well on synthetic and real-world datasets

02

Handles many objects and low contrast images

03

Enables expressive object manipulation

Abstract

Although data generation is often straightforward, extracting information from data is more difficult. Object-centric representation learning can extract information from images in an unsupervised manner. It does so by segmenting an image into its subcomponents: the objects. Each object is then represented in a low-dimensional latent space that can be used for downstream processing. Object-centric representation learning is dominated by autoencoder architectures (AEs). Here, we present ORGAN, a novel approach for object-centric representation learning, which is based on cycle-consistent Generative Adversarial Networks instead. We show that it performs similarly to other state-of-the-art approaches on synthetic datasets, while at the same time being the only approach tested here capable of handling more challenging real-world datasets with many objects and low visual contrast.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis