InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation

Yuxin Qin; Ke Cao; Haowei Liu; Ao Ma; Fengheng Li; Honghe Zhu; Zheng Zhang; Run Ling; Wei Feng; Xuanhua He; Zhanjie Zhang; Zhen Guo; Haoyi Bian; Jingjing Lv; Junjie Shen; Ching Law

arXiv:2603.05898·cs.CV·March 9, 2026

InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation

Yuxin Qin, Ke Cao, Haowei Liu, Ao Ma, Fengheng Li, Honghe Zhu, Zheng Zhang, Run Ling, Wei Feng, Xuanhua He, Zhanjie Zhang, Zhen Guo, Haoyi Bian, Jingjing Lv, Junjie Shen, Ching Law

PDF

Open Access

TL;DR

InnoAds-Composer is a novel single-stage framework for efficient, tri-conditional control of subject, text, and style in e-commerce poster generation, addressing fidelity and accuracy issues of prior multi-stage methods.

Contribution

It introduces importance-based routing of control tokens, a Text Feature Enhancement Module for Chinese text, and a new dataset for joint condition evaluation in poster synthesis.

Findings

01

Outperforms existing methods in quality metrics

02

Maintains low inference latency

03

Provides a new dataset for comprehensive evaluation

Abstract

E-commerce product poster generation aims to automatically synthesize a single image that effectively conveys product information by presenting a subject, text, and a designed style. Recent diffusion models with fine-grained and efficient controllability have advanced product poster synthesis, yet they typically rely on multi-stage pipelines, and simultaneous control over subject, text, and style remains underexplored. Such naive multi-stage pipelines also show three issues: poor subject fidelity, inaccurate text, and inconsistent style. To address these issues, we propose InnoAds-Composer, a single-stage framework that enables efficient tri-conditional control tokens over subject, glyph, and style. To alleviate the quadratic overhead introduced by naive tri-conditional token concatenation, we perform importance analysis over layers and timesteps and route each condition only to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Computer Graphics and Visualization Techniques