SemiNFT: Learning to Transfer Presets from Imitation to Appreciation via Hybrid-Sample Reinforcement Learning

Melany Yang; Yuhang Yu; Diwang Weng; Jinwei Chen; Wei Dong

arXiv:2602.08582·cs.CV·February 10, 2026

SemiNFT: Learning to Transfer Presets from Imitation to Appreciation via Hybrid-Sample Reinforcement Learning

Melany Yang, Yuhang Yu, Diwang Weng, Jinwei Chen, Wei Dong

PDF

Open Access

TL;DR

SemiNFT is a diffusion transformer-based framework that learns to transfer color presets by combining imitation learning with reinforcement learning, resulting in improved aesthetic understanding and zero-shot colorization capabilities.

Contribution

The paper introduces SemiNFT, a novel hybrid learning approach that combines paired training and reinforcement learning with a hybrid reward mechanism for advanced preset transfer.

Findings

01

Outperforms state-of-the-art preset transfer methods

02

Demonstrates zero-shot colorization and cross-domain transfer

03

Achieves sophisticated aesthetic comprehension

Abstract

Photorealistic color retouching plays a vital role in visual content creation, yet manual retouching remains inaccessible to non-experts due to its reliance on specialized expertise. Reference-based methods offer a promising alternative by transferring the preset color of a reference image to a source image. However, these approaches often operate as novice learners, performing global color mappings derived from pixel-level statistics, without a true understanding of semantic context or human aesthetics. To address this issue, we propose SemiNFT, a Diffusion Transformer (DiT)-based retouching framework that mirrors the trajectory of human artistic training: beginning with rigid imitation and evolving into intuitive creation. Specifically, SemiNFT is first taught with paired triplets to acquire basic structural preservation and color mapping skills, and then advanced to reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis