Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN

Shivangi Nigam; Adarsh Prasad Behera; Shekhar Verma; and P. Nagabhushan

arXiv:2508.03415·cs.CV·August 6, 2025

Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN

Shivangi Nigam, Adarsh Prasad Behera, Shekhar Verma, and P. Nagabhushan

PDF

TL;DR

This paper introduces Fd-CycleGAN, an advanced image translation framework that leverages frequency-aware supervision and local neighborhood encoding to improve the quality, diversity, and efficiency of image-to-image translation tasks.

Contribution

It proposes a novel frequency-guided latent learning approach integrated into CycleGAN, enhancing distribution alignment and translation quality in low-data regimes.

Findings

01

Superior perceptual quality over baseline methods

02

Faster convergence and improved mode diversity

03

Effective in low-data and diverse dataset scenarios

Abstract

This paper presents Fd-CycleGAN, an image-to-image (I2I) translation framework that enhances latent representation learning to approximate real data distributions. Building upon the foundation of CycleGAN, our approach integrates Local Neighborhood Encoding (LNE) and frequency-aware supervision to capture fine-grained local pixel semantics while preserving structural coherence from the source domain. We employ distribution-based loss metrics, including KL/JS divergence and log-based similarity measures, to explicitly quantify the alignment between real and generated image distributions in both spatial and frequency domains. To validate the efficacy of Fd-CycleGAN, we conduct experiments on diverse datasets -- Horse2Zebra, Monet2Photo, and a synthetically augmented Strike-off dataset. Compared to baseline CycleGAN and other state-of-the-art methods, our approach demonstrates superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.