DDCNet-Multires: Effective Receptive Field Guided Multiresolution CNN for Dense Prediction
Ali Salehi, Madhusudhanan Balasubramanian

TL;DR
This paper introduces DDCNet-Multires, a compact multiresolution CNN guided by effective receptive field strategies, improving dense optical flow estimation in scenes with large displacements and heterogeneous motion.
Contribution
It extends previous DDCNet designs by cascading sub-nets with decreasing ERF extents to handle complex motion dynamics without adding specialized layers.
Findings
Outperforms previous DDCNet variants in accuracy.
Achieves comparable results to lightweight methods on benchmark datasets.
Remains compact and efficient for dense optical flow tasks.
Abstract
Dense optical flow estimation is challenging when there are large displacements in a scene with heterogeneous motion dynamics, occlusion, and scene homogeneity. Traditional approaches to handle these challenges include hierarchical and multiresolution processing methods. Learning-based optical flow methods typically use a multiresolution approach with image warping when a broad range of flow velocities and heterogeneous motion is present. Accuracy of such coarse-to-fine methods is affected by the ghosting artifacts when images are warped across multiple resolutions and by the vanishing problem in smaller scene extents with higher motion contrast. Previously, we devised strategies for building compact dense prediction networks guided by the effective receptive field (ERF) characteristics of the network (DDCNet). The DDCNet design was intentionally simple and compact allowing it to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Glaucoma and retinal disorders
