Physics-informed Guided Disentanglement in Generative Networks
Fabio Pizzati, Pietro Cerri, Raoul de Charette

TL;DR
This paper introduces a physics-informed framework for disentangling visual traits in image translation networks, improving controllability and quality by leveraging physical models or neural guidance.
Contribution
It presents a novel, versatile approach to disentanglement using physics models or neural networks, enhancing image translation performance in challenging scenarios.
Findings
Significant qualitative improvements in image translation quality.
Quantitative performance gains demonstrated across multiple scenarios.
Effective disentanglement achieved with physics-guided and neural-guided strategies.
Abstract
Image-to-image translation (i2i) networks suffer from entanglement effects in presence of physics-related phenomena in target domain (such as occlusions, fog, etc), lowering altogether the translation quality, controllability and variability. In this paper, we propose a general framework to disentangle visual traits in target images. Primarily, we build upon collection of simple physics models, guiding the disentanglement with a physical model that renders some of the target traits, and learning the remaining ones. Because physics allows explicit and interpretable outputs, our physical models (optimally regressed on target) allows generating unseen scenarios in a controllable manner. Secondarily, we show the versatility of our framework to neural-guided disentanglement where a generative network is used in place of a physical model in case the latter is not directly accessible.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
