Style-Editor: Text-driven object-centric style editing

Jihun Park; Jongmin Gim; Kyoungmin Lee; Seunghun Lee; Sunghoon Im

arXiv:2408.08461·cs.CV·April 9, 2025

Style-Editor: Text-driven object-centric style editing

Jihun Park, Jongmin Gim, Kyoungmin Lee, Seunghun Lee, Sunghoon Im

PDF

Open Access

TL;DR

Style-Editor is a novel text-driven, object-centric style editing method that uses a new loss function and modules to precisely modify object styles based on textual input without needing segmentation masks.

Contribution

The paper introduces Style-Editor, a new approach for object-centric style editing guided by text, featuring the PCD loss, TMPS, PRS, and ABP loss for improved accuracy and coherence.

Findings

01

Effective text-guided style editing demonstrated through extensive experiments.

02

Achieves precise object-centric modifications aligned with textual descriptions.

03

Maintains background style and structure while editing objects.

Abstract

We present Text-driven object-centric style editing model named Style-Editor, a novel method that guides style editing at an object-centric level using textual inputs. The core of Style-Editor is our Patch-wise Co-Directional (PCD) loss, meticulously designed for precise object-centric editing that are closely aligned with the input text. This loss combines a patch directional loss for text-guided style direction and a patch distribution consistency loss for even CLIP embedding distribution across object regions. It ensures a seamless and harmonious style editing across object regions. Key to our method are the Text-Matched Patch Selection (TMPS) and Pre-fixed Region Selection (PRS) modules for identifying object locations via text, eliminating the need for segmentation masks. Lastly, we introduce an Adaptive Background Preservation (ABP) loss to maintain the original style and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsContrastive Language-Image Pre-training