MotionSwap
Om Patil, Jinesh Modi, Suryabha Mukhopadhyay, Meghaditya Giri, Chhavi Malhotra

TL;DR
This paper improves the SimSwap face swapping framework by integrating attention mechanisms, dynamic loss weighting, and learning rate scheduling, resulting in higher fidelity, identity preservation, and visual quality.
Contribution
The paper introduces architectural and training enhancements to SimSwap, achieving significant performance improvements over the baseline.
Findings
Better identity similarity and attribute consistency
Lower FID scores and improved visual quality
Ablation studies confirming the effectiveness of each enhancement
Abstract
Face swapping technology has gained significant attention in both academic research and commercial applications. This paper presents our implementation and enhancement of SimSwap, an efficient framework for high fidelity face swapping. We introduce several improvements to the original model, including the integration of self and cross-attention mechanisms in the generator architecture, dynamic loss weighting, and cosine annealing learning rate scheduling. These enhancements lead to significant improvements in identity preservation, attribute consistency, and overall visual quality. Our experimental results, spanning 400,000 training iterations, demonstrate progressive improvements in generator and discriminator performance. The enhanced model achieves better identity similarity, lower FID scores, and visibly superior qualitative results compared to the baseline. Ablation studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · 3D Modeling in Geospatial Applications · 3D Shape Modeling and Analysis
