Let Triggers Control: Frequency-Aware Dropout for Effective Token Control

Junyoung Koh; Hoyeon Moon; Dongha Kim; Seungmin Lee; Sanghyun Park; and Min Song

arXiv:2603.27199·cs.CV·March 31, 2026

Let Triggers Control: Frequency-Aware Dropout for Effective Token Control

Junyoung Koh, Hoyeon Moon, Dongha Kim, Seungmin Lee, Sanghyun Park, and Min Song

PDF

TL;DR

The paper introduces Frequency-Aware Dropout (FAD), a novel regularization technique that enhances trigger token controllability in text-to-image models by reducing co-occurrence entanglement without extra parameters.

Contribution

FAD is a simple, parameter-free dropout method that improves prompt controllability and personalization in diffusion models through co-occurrence analysis and curriculum-inspired scheduling.

Findings

01

FAD improves prompt fidelity and stylistic precision.

02

FAD enhances user-perceived quality in generated images.

03

FAD achieves these gains without additional parameters or architectural changes.

Abstract

Text-to-image models such as Stable Diffusion have achieved unprecedented levels of high-fidelity visual synthesis. As these models advance, personalization of generative models -- commonly facilitated through Low-Rank Adaptation (LoRA) with a dedicated trigger token -- has become a significant area of research. Previous works have naively assumed that fine-tuning with a single trigger token to represent new concepts. However, this often results in poor controllability, where the trigger token alone fails to reliably evoke the intended concept. We attribute this issue to the frequent co-occurrence of the trigger token with the surrounding context during fine-tuning, which entangles their representations and compromises the token's semantic distinctiveness. To disentangle this, we propose Frequency-Aware Dropout (FAD) -- a novel regularization technique that improves prompt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.