Cosine-Similarity Methods for Efficient Training and Sampling in High-Dimensional Latent Spaces
Xu Duan, Dongmei Chen

TL;DR
This paper introduces a cosine-similarity-based approach to enhance training and sampling in high-dimensional latent spaces, leading to faster convergence and higher quality image generation.
Contribution
It proposes a novel cosine similarity mechanism for better coupling in latent spaces, improving training efficiency and sample quality in generative models.
Findings
Reduced FID from 11.99 to 8.60 with fine-tuning strategies
Achieved FID of 3.30 in one epoch matching 80-epoch baseline
Accelerated convergence and improved sample fidelity
Abstract
Latent generative models are increasingly shifting from traditional VAEs toward representation autoencoders and semantically aligned latent spaces, which lift images into higher-dimensional feature domains where semantic factors become more separable. Yet these spaces also contain geometric regularities that existing methods do not fully exploit--particularly in the directional relationships between features. We introduce a cosine-similarity-based mechanism that improves both training and sampling by selecting couplings that produce cleaner, less entangled velocity fields. This simple alignment reduces gradient noise, accelerates convergence, and improves sample fidelity. Building on this idea, we develop cosine-similarity-based fine-tuning and time-scheduling strategies that reduce the FID of an 800-epoch RAE from 11.99 to 8.60. Furthermore, by formulating an optimal-transport…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
