RefAdGen: High-Fidelity Advertising Image Generation
Yiyun Chen, Weikai Yang

TL;DR
RefAdGen is a novel AI framework that generates high-fidelity advertising images from product references and text descriptions, reducing costs and improving realism for marketing applications.
Contribution
It introduces a large-scale dataset and a decoupled generation framework with innovative modules to enhance fidelity and efficiency in advertising image synthesis.
Findings
Achieves state-of-the-art high-fidelity image generation
Maintains performance on unseen products and real-world images
Offers a scalable, cost-effective alternative to traditional marketing workflows
Abstract
The rapid advancement of Artificial Intelligence Generated Content (AIGC) techniques has unlocked opportunities in generating diverse and compelling advertising images based on referenced product images and textual scene descriptions. This capability substantially reduces human labor and production costs in traditional marketing workflows. However, existing AIGC techniques either demand extensive fine-tuning for each referenced image to achieve high fidelity, or they struggle to maintain fidelity across diverse products, making them impractical for e-commerce and marketing industries. To tackle this limitation, we first construct AdProd-100K, a large-scale advertising image generation dataset. A key innovation in its construction is our dual data augmentation strategy, which fosters robust, 3D-aware representations crucial for realistic and high-fidelity image synthesis. Leveraging this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Image Retrieval and Classification Techniques
