Generated Contents Enrichment
Mahdi Naseri, Jiayan Qiu, Zhou Wang

TL;DR
This paper introduces Generated Contents Enrichment (GCE), a new AI task that explicitly enriches visual and textual content using scene graphs and adversarial learning, resulting in more detailed and coherent generated content.
Contribution
The paper proposes a novel GCE framework that models descriptions as scene graphs and uses GCNs for explicit semantic enrichment before image synthesis, advancing content generation methods.
Findings
Effective scene graph modeling of descriptions
Improved content richness and coherence in generated images
Promising results on Visual Genome dataset
Abstract
In this paper, we investigate a novel artificial intelligence generation task termed Generated Contents Enrichment (GCE). Conventional AI content generation produces visually realistic content by implicitly enriching the given textual description based on limited semantic descriptions. Unlike this traditional task, our proposed GCE strives to perform content enrichment explicitly in both the visual and textual domains. The goal is to generate content that is visually realistic, structurally coherent, and semantically abundant. To tackle GCE, we propose a deep end-to-end adversarial method that explicitly explores semantics and inter-semantic relationships during the enrichment process. Our approach first models the input description as a scene graph, where nodes represent objects and edges capture inter-object relationships. We then adopt Graph Convolutional Networks on top of the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies
