SemiNFT: Learning to Transfer Presets from Imitation to Appreciation via Hybrid-Sample Reinforcement Learning
Melany Yang, Yuhang Yu, Diwang Weng, Jinwei Chen, Wei Dong

TL;DR
SemiNFT is a diffusion transformer-based framework that learns to transfer color presets by combining imitation learning with reinforcement learning, resulting in improved aesthetic understanding and zero-shot colorization capabilities.
Contribution
The paper introduces SemiNFT, a novel hybrid learning approach that combines paired training and reinforcement learning with a hybrid reward mechanism for advanced preset transfer.
Findings
Outperforms state-of-the-art preset transfer methods
Demonstrates zero-shot colorization and cross-domain transfer
Achieves sophisticated aesthetic comprehension
Abstract
Photorealistic color retouching plays a vital role in visual content creation, yet manual retouching remains inaccessible to non-experts due to its reliance on specialized expertise. Reference-based methods offer a promising alternative by transferring the preset color of a reference image to a source image. However, these approaches often operate as novice learners, performing global color mappings derived from pixel-level statistics, without a true understanding of semantic context or human aesthetics. To address this issue, we propose SemiNFT, a Diffusion Transformer (DiT)-based retouching framework that mirrors the trajectory of human artistic training: beginning with rigid imitation and evolving into intuitive creation. Specifically, SemiNFT is first taught with paired triplets to acquire basic structural preservation and color mapping skills, and then advanced to reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis
