Controlling Structured Output Representations from Attributes using   Conditional Generative Models

Mohamed Debbagh

arXiv:2305.00980·cs.CV·February 25, 2025·5 cites

Controlling Structured Output Representations from Attributes using Conditional Generative Models

Mohamed Debbagh

PDF

Open Access

TL;DR

This paper adapts the Conditional Variational Auto-encoder (CVAE) framework for controlled image generation based on attributes, demonstrating improved attribute-specific sample generation on face and bird datasets.

Contribution

It extends the CVAE framework to enable attribute-controlled structured output generation, enhancing robustness and disentanglement in multimodal distributions.

Findings

01

Successful generation of attribute-specific faces and bird images

02

Improved sample diversity with weighted variational lower bound

03

Recreated and trained CVAE architecture on CelebA and CUB datasets

Abstract

Structured output representation is a generative task explored in computer vision that often times requires the mapping of low dimensional features to high dimensional structured outputs. Losses in complex spatial information in deterministic approaches such as Convolutional Neural Networks (CNN) lead to uncertainties and ambiguous structures within a single output representation. A probabilistic approach through deep Conditional Generative Models (CGM) is presented by Sohn et al. in which a particular model known as the Conditional Variational Auto-encoder (CVAE) is introduced and explored. While the original paper focuses on the task of image segmentation, this paper adopts the CVAE framework for the task of controlled output representation through attributes. This approach allows us to learn a disentangled multimodal prior distribution, resulting in more controlled and robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsConditional Variational Auto Encoder