Hierarchical Conditional Semi-Paired Image-to-Image Translation For   Multi-Task Image Defect Correction On Shopping Websites

Moyan Li; Jinmiao Fu; Shaoyuan Xu; Huidong Liu; Jia Liu; Bryan Wang

arXiv:2309.05883·cs.CV·September 13, 2023

Hierarchical Conditional Semi-Paired Image-to-Image Translation For Multi-Task Image Defect Correction On Shopping Websites

Moyan Li, Jinmiao Fu, Shaoyuan Xu, Huidong Liu, Jia Liu, Bryan Wang

PDF

Open Access

TL;DR

This paper introduces a hierarchical, semi-paired image-to-image translation model that effectively corrects multiple product image defects across various types on shopping websites, improving image quality and user experience.

Contribution

The proposed unified model leverages an attention mechanism for hierarchical defect correction and combines paired and unpaired data training, addressing scalability and data quality challenges.

Findings

01

Reduces FID by 24.6% on eight datasets compared to MoNCE.

02

Achieves 63.2% FID reduction on a shopping website dataset over WS-I2I.

03

Effectively corrects multiple defect types across diverse product categories.

Abstract

On shopping websites, product images of low quality negatively affect customer experience. Although there are plenty of work in detecting images with different defects, few efforts have been dedicated to correct those defects at scale. A major challenge is that there are thousands of product types and each has specific defects, therefore building defect specific models is unscalable. In this paper, we propose a unified Image-to-Image (I2I) translation model to correct multiple defects across different product types. Our model leverages an attention mechanism to hierarchically incorporate high-level defect groups and specific defect types to guide the network to focus on defect-related image regions. Evaluated on eight public datasets, our model reduces the Frechet Inception Distance (FID) by 24.6% in average compared with MoNCE, the state-of-the-art I2I method. Unlike public data,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image and Video Retrieval Techniques

MethodsFocus