Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN
Shivangi Nigam, Adarsh Prasad Behera, Shekhar Verma, and P. Nagabhushan

TL;DR
This paper introduces Fd-CycleGAN, an advanced image translation framework that leverages frequency-aware supervision and local neighborhood encoding to improve the quality, diversity, and efficiency of image-to-image translation tasks.
Contribution
It proposes a novel frequency-guided latent learning approach integrated into CycleGAN, enhancing distribution alignment and translation quality in low-data regimes.
Findings
Superior perceptual quality over baseline methods
Faster convergence and improved mode diversity
Effective in low-data and diverse dataset scenarios
Abstract
This paper presents Fd-CycleGAN, an image-to-image (I2I) translation framework that enhances latent representation learning to approximate real data distributions. Building upon the foundation of CycleGAN, our approach integrates Local Neighborhood Encoding (LNE) and frequency-aware supervision to capture fine-grained local pixel semantics while preserving structural coherence from the source domain. We employ distribution-based loss metrics, including KL/JS divergence and log-based similarity measures, to explicitly quantify the alignment between real and generated image distributions in both spatial and frequency domains. To validate the efficacy of Fd-CycleGAN, we conduct experiments on diverse datasets -- Horse2Zebra, Monet2Photo, and a synthetically augmented Strike-off dataset. Compared to baseline CycleGAN and other state-of-the-art methods, our approach demonstrates superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
