Cosine-Similarity Methods for Efficient Training and Sampling in High-Dimensional Latent Spaces

Xu Duan; Dongmei Chen

arXiv:2512.00684·math.OC·December 2, 2025

Cosine-Similarity Methods for Efficient Training and Sampling in High-Dimensional Latent Spaces

Xu Duan, Dongmei Chen

PDF

Open Access

TL;DR

This paper introduces a cosine-similarity-based approach to enhance training and sampling in high-dimensional latent spaces, leading to faster convergence and higher quality image generation.

Contribution

It proposes a novel cosine similarity mechanism for better coupling in latent spaces, improving training efficiency and sample quality in generative models.

Findings

01

Reduced FID from 11.99 to 8.60 with fine-tuning strategies

02

Achieved FID of 3.30 in one epoch matching 80-epoch baseline

03

Accelerated convergence and improved sample fidelity

Abstract

Latent generative models are increasingly shifting from traditional VAEs toward representation autoencoders and semantically aligned latent spaces, which lift images into higher-dimensional feature domains where semantic factors become more separable. Yet these spaces also contain geometric regularities that existing methods do not fully exploit--particularly in the directional relationships between features. We introduce a cosine-similarity-based mechanism that improves both training and sampling by selecting couplings that produce cleaner, less entangled velocity fields. This simple alignment reduces gradient noise, accelerates convergence, and improves sample fidelity. Building on this idea, we develop cosine-similarity-based fine-tuning and time-scheduling strategies that reduce the FID of an 800-epoch RAE from 11.99 to 8.60. Furthermore, by formulating an optimal-transport…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis