A Novel BiLevel Paradigm for Image-to-Image Translation
Liqian Ma, Qianru Sun, Bernt Schiele, Luc Van Gool

TL;DR
This paper introduces a BiLevel learning paradigm for image-to-image translation that enhances scene-specific adaptation and image quality by alternating between instance-specific and general-purpose models, especially with limited data.
Contribution
The novel BiLevel paradigm enables efficient scene-specific adaptation in I2I translation by combining instance-specific and general-purpose learning, improving quality and diversity.
Findings
Significantly improves classical I2I translation models like PG2 and Pix2Pix.
Achieves higher image quality and more accurate scene details.
Enables fast adaptation with scarce training data.
Abstract
Image-to-image (I2I) translation is a pixel-level mapping that requires a large number of paired training data and often suffers from the problems of high diversity and strong category bias in image scenes. In order to tackle these problems, we propose a novel BiLevel (BiL) learning paradigm that alternates the learning of two models, respectively at an instance-specific (IS) and a general-purpose (GP) level. In each scene, the IS model learns to maintain the specific scene attributes. It is initialized by the GP model that learns from all the scenes to obtain the generalizable translation knowledge. This GP initialization gives the IS model an efficient starting point, thus enabling its fast adaptation to the new scene with scarce training data. We conduct extensive I2I translation experiments on human face and street view datasets. Quantitative results validate that our approach can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
