Image Content Generation with Causal Reasoning
Xiaochuan Li, Baoyu Fan, Runze Zhang, Liang Jin, Di Wang, Zhenhua Guo,, Yaqian Zhao, Rengang Li

TL;DR
This paper introduces a new visual question answering with image (VQAI) task, creates a dataset based on Tom and Jerry, and proposes a novel image generation paradigm to incorporate causal reasoning in visual content creation.
Contribution
It pioneers the integration of causal reasoning into visual content generation and establishes a new dataset and paradigm for VQAI tasks.
Findings
Developed a new VQAI dataset based on Tom and Jerry
Proposed a novel image generation paradigm for causal reasoning
Conducted extensive experiments demonstrating the approach's effectiveness
Abstract
The emergence of ChatGPT has once again sparked research in generative artificial intelligence (GAI). While people have been amazed by the generated results, they have also noticed the reasoning potential reflected in the generated textual content. However, this current ability for causal reasoning is primarily limited to the domain of language generation, such as in models like GPT-3. In visual modality, there is currently no equivalent research. Considering causal reasoning in visual content generation is significant. This is because visual information contains infinite granularity. Particularly, images can provide more intuitive and specific demonstrations for certain reasoning tasks, especially when compared to coarse-grained text. Hence, we propose a new image generation task called visual question answering with image (VQAI) and establish a dataset of the same name based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Adam · {Dispute@FaQ-s}How to file a dispute with Expedia? · Weight Decay · Softmax · Residual Connection
