Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields
Tom Fischer, Pascal Peter, Joachim Weickert, Eddy Ilg

TL;DR
This paper introduces a hybrid model combining PDE-based methods with neural networks for optical flow inpainting, achieving state-of-the-art results with improved accuracy and robustness over existing approaches.
Contribution
The paper presents a novel joint architecture that integrates explicit PDE models with CNNs, enhancing optical flow inpainting performance and generalization.
Findings
Outperforms explicit and data-driven baselines in reconstruction quality
Achieves 11-27% lower endpoint error than explicit methods
Sets new state-of-the-art results for optical flow inpainting from random masks
Abstract
Deep learning has revolutionized the field of computer vision by introducing large scale neural networks with millions of parameters. Training these networks requires massive datasets and leads to intransparent models that can fail to generalize. At the other extreme, models designed from partial differential equations (PDEs) embed specialized domain knowledge into mathematical equations and usually rely on few manually chosen hyperparameters. This makes them transparent by construction and if designed and calibrated carefully, they can generalize well to unseen scenarios. In this paper, we show how to bring model- and data-driven approaches together by combining the explicit PDE-based approaches with convolutional neural networks to obtain the best of both worlds. We illustrate a joint architecture for the task of inpainting optical flow fields and show that the combination of model-…
Peer Reviews
Decision·ICML 2024 Poster
1. The paper successfully combines the strengths of explicit PDE-based models and CNNs, leveraging the interpretability and generalization capabilities of the former and the learning power of the latter. This integration provides an effective architecture for inpainting optical flow fields. 2. The proposed model achieves superior results compared to both explicit and data-driven baselines. The evaluation demonstrates higher reconstruction quality, robustness, and generalization capabilities, ma
1. Although the paper compares the proposed model with explicit and data-driven baselines, it would be beneficial to include a comparison with other recent state-of-the-art methods in inpainting for optical flow fields. This would provide a more comprehensive evaluation and enhance the paper's contribution. 2. The paper assumes prior knowledge of diffusion processes and their application in inpainting. I wonder why diffusion-based inpainting is suitable for flow inpainting? Are there any theore
The particular combination of learned and explicit diffusion computation is novel. The metrics demonstrate accuracy superior to the baselines. The ablation study in Section 4.3 is informative.
The approach has only been demonstrated on one niche application -- optical flow. The paper does mention sparse mask inpainting of images several times, which could be another use case to strengthen the paper. More results would be appreciated too, perhaps on some real-world datasets such as KITTI.
- Good clarity The paper includes sufficient details for understanding the main methods (equations, network architecture details, and implementation details). This helps the reproduction of the method. - Better accuracy over baselines The paper compares its method with several baselines (FlowNetS, WGAIN, EED, and PD) and achieves better accuracy than them.
- Limited evaluation The paper evaluates the method only on one synthetic dataset, Sintel. To ensure the method also works on real-world domains, it would be great to evaluate the method on other datasets such as KITTI, Middlebury, etc. Furthermore, the paper doesn't compare with any previous optical flow inpainting methods (eg., Raad et al, "On Anisotropic Optical Flow Inpainting Algorithms"). Achieving better accuracy than baselines is great, but a comparison with previous work would be als
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Cell Image Analysis Techniques
MethodsInpainting · Diffusion
