Moderating the Generalization of Score-based Generative Model
Wan Jiang, He Wang, Xin Zhang, Dan Guo, Zhaoxin Fan, Yunfeng Diao, Richang Hong

TL;DR
This paper introduces MSGM, a novel score adjustment method for score-based generative models that reduces undesirable content generation without sacrificing image quality, addressing the limitations of traditional machine unlearning methods.
Contribution
The paper proposes the first moderated score-based generative model (MSGM) with a new score adjustment strategy, enhancing control over generated content in SGMs.
Findings
MSGM significantly reduces undesirable content generation.
MSGM maintains high visual quality in normal image generation.
MSGM is compatible with various diffusion architectures and training strategies.
Abstract
Score-based Generative Models (SGMs) have demonstrated remarkable generalization abilities, e.g. generating unseen, but natural data. However, the greater the generalization power, the more likely the unintended generalization, and the more dangerous the abuse. Research on moderated generalization in SGMs remains limited. To fill this gap, we first examine the current 'gold standard' in Machine Unlearning (MU), i.e., re-training the model after removing the undesirable training data, and find it does not work in SGMs. Further analysis of score functions reveals that the MU 'gold standard' does not alter the original score function, which explains its ineffectiveness. Based on this insight, we propose the first Moderated Score-based Generative Model (MSGM), which introduces a novel score adjustment strategy that redirects the score function away from undesirable data during the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Advanced Statistical Modeling Techniques · Cognitive Science and Mapping
MethodsDiffusion · Inpainting
