Stereo Object Matching Network
Jaesung Choe, Kyungdon Joo, Francois Rameau, In So Kweon

TL;DR
This paper introduces a stereo object matching method that integrates 2D image context and 3D object information within the cost volume, improving depth accuracy especially near object boundaries.
Contribution
The paper proposes two novel strategies, RoISelect and fusion-by-occupancy, to incorporate 3D objectness into the cost volume for enhanced stereo matching.
Findings
Achieves competitive depth estimation on KITTI dataset.
Improves boundary accuracy in depth maps.
Effectively integrates 3D object information into stereo matching.
Abstract
This paper presents a stereo object matching method that exploits both 2D contextual information from images as well as 3D object-level information. Unlike existing stereo matching methods that exclusively focus on the pixel-level correspondence between stereo images within a volumetric space (i.e., cost volume), we exploit this volumetric structure in a different manner. The cost volume explicitly encompasses 3D information along its disparity axis, therefore it is a privileged structure that can encapsulate the 3D contextual information from objects. However, it is not straightforward since the disparity values map the 3D metric space in a non-linear fashion. Thus, we present two novel strategies to handle 3D objectness in the cost volume space: selective sampling (RoISelect) and 2D-3D fusion (fusion-by-occupancy), which allow us to seamlessly incorporate 3D object-level information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Image Enhancement Techniques
