Unsupervised Image Decomposition with Phase-Correlation Networks
Angel Villar-Corrales, Sven Behnke

TL;DR
This paper introduces PCDNet, a novel unsupervised model that decomposes scenes into object components using frequency-domain phase correlation, achieving better performance with fewer parameters and enhanced interpretability.
Contribution
The paper presents PCDNet, a new unsupervised scene decomposition model utilizing phase correlation for object representation, improving efficiency and interpretability over existing methods.
Findings
Outperforms state-of-the-art in unsupervised object discovery and segmentation.
Uses fewer parameters than comparable models.
Demonstrates effectiveness on both simple and challenging datasets.
Abstract
The ability to decompose scenes into their object components is a desired property for autonomous agents, allowing them to reason and act in their surroundings. Recently, different methods have been proposed to learn object-centric representations from data in an unsupervised manner. These methods often rely on latent representations learned by deep neural networks, hence requiring high computational costs and large amounts of curated data. Such models are also difficult to interpret. To address these challenges, we propose the Phase-Correlation Decomposition Network (PCDNet), a novel model that decomposes a scene into its object components, which are represented as transformed versions of a set of learned object prototypes. The core building block in PCDNet is the Phase-Correlation Cell (PC Cell), which exploits the frequency-domain representation of the images in order to estimate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
