PoseGuard: Pose-Guided Generation with Safety Guardrails
Kongxin Wang, Jie Zhang, Peigui Qi, Kunsheng Tang, Tianwei Zhang, Wenbo Zhou

TL;DR
PoseGuard introduces a safety framework for pose-guided video generation that suppresses unsafe outputs while preserving high quality for benign inputs, addressing risks like impersonation and NSFW content.
Contribution
It proposes a dual-objective training strategy with pose-specific LoRA fusion for efficient safety alignment in pose-guided generation.
Findings
Effectively blocks unsafe pose-guided generations
Maintains high fidelity for benign poses
Robust against slight pose variations
Abstract
Pose-guided video generation has become a powerful tool in creative industries, exemplified by frameworks like Animate Anyone. However, conditioning generation on specific poses introduces serious risks, such as impersonation, privacy violations, and NSFW content creation. To address these challenges, we propose , a safety alignment framework for pose-guided generation. PoseGuard is designed to suppress unsafe generations by degrading output quality when encountering malicious poses, while maintaining high-fidelity outputs for benign inputs. We categorize unsafe poses into three representative types: discriminatory gestures such as kneeling or offensive salutes, sexually suggestive poses that lead to NSFW content, and poses imitating copyrighted celebrity movements. PoseGuard employs a dual-objective training strategy combining generation fidelity with safety…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Social Robot Interaction and HRI · Face recognition and analysis
