Optical Flow Estimation using a Spatial Pyramid Network
Anurag Ranjan, Michael J. Black

TL;DR
This paper introduces SPyNet, a deep learning-based optical flow estimation method that combines classical pyramid techniques with neural networks, resulting in a simpler, more efficient, and more accurate approach than previous deep learning models like FlowNet.
Contribution
The paper presents a novel spatial pyramid network for optical flow that simplifies the model, reduces parameters by 96%, and improves accuracy over existing deep learning methods.
Findings
SPyNet outperforms FlowNet on standard benchmarks.
The model is significantly smaller and more efficient.
Learned filters resemble classical spatio-temporal filters.
Abstract
We learn to compute optical flow by combining a classical spatial-pyramid formulation with deep learning. This estimates large motions in a coarse-to-fine approach by warping one image of a pair at each pyramid level by the current flow estimate and computing an update to the flow. Instead of the standard minimization of an objective function at each pyramid level, we train one deep network per level to compute the flow update. Unlike the recent FlowNet approach, the networks do not need to deal with large motions; these are dealt with by the pyramid. This has several advantages. First, our Spatial Pyramid Network (SPyNet) is much simpler and 96% smaller than FlowNet in terms of model parameters. This makes it more efficient and appropriate for embedded applications. Second, since the flow at each pyramid level is small (< 1 pixel), a convolutional approach applied to pairs of warped…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsConvolution
