NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation

Serkan Ozturk; Samet Hicsonmez; Pinar Duygulu

arXiv:2511.01517·cs.CV·November 4, 2025

NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation

Serkan Ozturk, Samet Hicsonmez, Pinar Duygulu

PDF

Open Access

TL;DR

NSYNC introduces a contrastive learning framework using negative synthetic images to enhance style capture in text-to-image models, significantly improving stylization quality.

Contribution

The paper proposes a novel contrastive training scheme with negative synthetic data to better capture styles in text-to-image diffusion models.

Findings

01

Improved stylization performance over baseline methods.

02

Effective suppression of trivial style attributes.

03

Quantitative and qualitative enhancements in style transfer.

Abstract

Current text conditioned image generation methods output realistic looking images, but they fail to capture specific styles. Simply finetuning them on the target style datasets still struggles to grasp the style features. In this work, we present a novel contrastive learning framework to improve the stylization capability of large text-to-image diffusion models. Motivated by the astonishing advance in image generation models that makes synthetic data an intrinsic part of model training in various computer vision tasks, we exploit synthetic image generation in our approach. Usually, the generated synthetic data is dependent on the task, and most of the time it is used to enlarge the available real training dataset. With NSYNC, alternatively, we focus on generating negative synthetic sets to be used in a novel contrastive training scheme along with real positive images. In our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Face recognition and analysis