Generated Contents Enrichment

Mahdi Naseri; Jiayan Qiu; Zhou Wang

arXiv:2405.03650·cs.CV·October 10, 2024

Generated Contents Enrichment

Mahdi Naseri, Jiayan Qiu, Zhou Wang

PDF

Open Access

TL;DR

This paper introduces Generated Contents Enrichment (GCE), a new AI task that explicitly enriches visual and textual content using scene graphs and adversarial learning, resulting in more detailed and coherent generated content.

Contribution

The paper proposes a novel GCE framework that models descriptions as scene graphs and uses GCNs for explicit semantic enrichment before image synthesis, advancing content generation methods.

Findings

01

Effective scene graph modeling of descriptions

02

Improved content richness and coherence in generated images

03

Promising results on Visual Genome dataset

Abstract

In this paper, we investigate a novel artificial intelligence generation task termed Generated Contents Enrichment (GCE). Conventional AI content generation produces visually realistic content by implicitly enriching the given textual description based on limited semantic descriptions. Unlike this traditional task, our proposed GCE strives to perform content enrichment explicitly in both the visual and textual domains. The goal is to generate content that is visually realistic, structurally coherent, and semantically abundant. To tackle GCE, we propose a deep end-to-end adversarial method that explicitly explores semantics and inter-semantic relationships during the enrichment process. Our approach first models the input description as a scene graph, where nodes represent objects and edges capture inter-object relationships. We then adopt Graph Convolutional Networks on top of the input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies