Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach

Sagar Shrestha; Xiao Fu

arXiv:2401.09671·cs.LG·August 26, 2025·1 cites

Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach

Sagar Shrestha, Xiao Fu

PDF

Open Access 3 Reviews

TL;DR

This paper addresses the challenge of identifiability in unsupervised domain translation by proposing a diversified distribution matching approach that eliminates measure-preserving automorphisms, leading to more accurate and consistent translations.

Contribution

It introduces an MPA elimination theory and a novel UDT framework that ensures translation identifiability through auxiliary variable-induced distribution matching.

Findings

01

Theoretically proves the non-existence of MPA under certain conditions.

02

Proposes a diversified distribution matching method for UDT.

03

Experimental results support the theoretical claims.

Abstract

Unsupervised domain translation (UDT) aims to find functions that convert samples from one domain (e.g., sketches) to another domain (e.g., photos) without changing the high-level semantic meaning (also referred to as ``content''). The translation functions are often sought by probability distribution matching of the transformed source domain and target domain. CycleGAN stands as arguably the most representative approach among this line of work. However, it was noticed in the literature that CycleGAN and variants could fail to identify the desired translation functions and produce content-misaligned translations. This limitation arises due to the presence of multiple translation functions -- referred to as ``measure-preserving automorphism" (MPA) -- in the solution space of the learning criteria. Despite awareness of such identifiability issues, solutions have remained elusive. This…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The writing of the paper is good with detailed exposition of the problem. It also includes detailed notes on related literature. The paper produces promising qualitative and quantitative results based on experiments. It also gives ample ablation and suggestions on the architecture and parameters involved. The brief declaration of limitations is appreciated.

Weaknesses

The theory under quite strong assumptions tends to be straightforward and does not fully complement what the paper set out to achieve. It revolves around a particular model and due to certain vague notions becomes somewhat vacuous. The empirical results give the paper strength which the theory fails to support. In my opinion, the experiments should be prioritized.

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

1. It is innovative to introduce auxiliary variables for tackling the MPA issue. I'd like to offer more insights on this approach. Essentially, **supervised domain translation can be seen as a specific instance of their method.** By choosing a specific auxiliary variable, we can tailor each conditional distribution $p(x|u=u_i)$ to hold precisely one sample, $x_i$, with a probability of 1, whereas all other samples in the space have a zero probability. Similarly, we manipulate each conditional di

Weaknesses

Overall, the structure of the theory is clear. However, there are several mistakes that should be corrected: 1. The MPA of the PDF of a gaussian distribution $N(\mu, \sigma)$ should be $h(x) = 2\mu - x$, rather than $h(x) = \mu - x$. 2. Within the "Notation" section of the introduction, "A" ought to be a subset of "Y", not "X". Honestly, it is impractical to check every detail of the proof. The author should ensure the proof's rigor and review it meticulously. Additional suggestion: Assumption

Reviewer 03Rating 8· accept, good paperConfidence 4

Strengths

1. The paper is well written and easy to read. 2. The suggested elimination idea is well-motivated and simple to implement. 3. While *Theorem 1* operates under idealized assumptions, *Theorem 2* makes an attempt to show that the author's proposal is robust under more realistic circumstances. 4. The UDT tasks they experiment on seem challenging enough to be interesting.

Weaknesses

The abstract sounds very specialistic to me. I think the paper might be of interest to a broader audience, but some readers unfamiliar with the jargon might be put off by the abstract.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Connection · Residual Block · GAN Least Squares Loss · Convolution · Sigmoid Activation · Cycle Consistency Loss · Instance Normalization · Tanh Activation