TL;DR
This paper introduces separable convolutions to optimize 3D stereo networks, significantly reducing computational costs and parameters while often improving performance, making them more practical for real-world applications.
Contribution
Proposes plug-&-run separable convolutions that drastically cut operations and parameters in 3D stereo networks without sacrificing accuracy.
Findings
Up to 94% of network operations are due to 3D convolutions.
Achieves up to 7x reduction in operations and 3.5x in parameters.
Performance is maintained or improved with the new convolutions.
Abstract
Deep learning based 3D stereo networks give superior performance compared to 2D networks and conventional stereo methods. However, this improvement in the performance comes at the cost of increased computational complexity, thus making these networks non-practical for the real-world applications. Specifically, these networks use 3D convolutions as a major work horse to refine and regress disparities. In this work first, we show that these 3D convolutions in stereo networks consume up to 94% of overall network operations and act as a major bottleneck. Next, we propose a set of "plug-&-run" separable convolutions to reduce the number of parameters and operations. When integrated with the existing state of the art stereo networks, these convolutions lead up to 7x reduction in number of operations and up to 3.5x reduction in parameters without compromising their performance. In fact these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
