Hierarchical Conditional Semi-Paired Image-to-Image Translation For Multi-Task Image Defect Correction On Shopping Websites
Moyan Li, Jinmiao Fu, Shaoyuan Xu, Huidong Liu, Jia Liu, Bryan Wang

TL;DR
This paper introduces a hierarchical, semi-paired image-to-image translation model that effectively corrects multiple product image defects across various types on shopping websites, improving image quality and user experience.
Contribution
The proposed unified model leverages an attention mechanism for hierarchical defect correction and combines paired and unpaired data training, addressing scalability and data quality challenges.
Findings
Reduces FID by 24.6% on eight datasets compared to MoNCE.
Achieves 63.2% FID reduction on a shopping website dataset over WS-I2I.
Effectively corrects multiple defect types across diverse product categories.
Abstract
On shopping websites, product images of low quality negatively affect customer experience. Although there are plenty of work in detecting images with different defects, few efforts have been dedicated to correct those defects at scale. A major challenge is that there are thousands of product types and each has specific defects, therefore building defect specific models is unscalable. In this paper, we propose a unified Image-to-Image (I2I) translation model to correct multiple defects across different product types. Our model leverages an attention mechanism to hierarchically incorporate high-level defect groups and specific defect types to guide the network to focus on defect-related image regions. Evaluated on eight public datasets, our model reduces the Frechet Inception Distance (FID) by 24.6% in average compared with MoNCE, the state-of-the-art I2I method. Unlike public data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image and Video Retrieval Techniques
MethodsFocus
