TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation
Stefano Alletto, Davide Abati, Simone Calderara, Rita Cucchiara, Luca, Rigazio

TL;DR
TransFlow is an unsupervised deep learning model that estimates optical flow in ego-centric videos by combining geometric priors with pixel-level refinement, achieving superior generalization and lower error on unseen data.
Contribution
It introduces a two-stage deep architecture that integrates geometric constraints with pixel-level refinement for unsupervised optical flow estimation in driving scenes.
Findings
Outperforms other unsupervised algorithms in optical flow estimation.
Shows better generalization than supervised methods on unseen data.
Achieves a threefold reduction in error on new datasets.
Abstract
We address unsupervised optical flow estimation for ego-centric motion. We argue that optical flow can be cast as a geometrical warping between two successive video frames and devise a deep architecture to estimate such transformation in two stages. First, a dense pixel-level flow is computed with a geometric prior imposing strong spatial constraints. Such prior is typical of driving scenes, where the point of view is coherent with the vehicle motion. We show how such global transformation can be approximated with an homography and how spatial transformer layers can be employed to compute the flow field implied by such transformation. The second stage then refines the prediction feeding a second deeper network. A final reconstruction loss compares the warping of frame X(t) with the subsequent frame X(t+1) and guides both estimates. The model, which we named TransFlow, performs favorably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Neural Network Applications
MethodsSpatial Transformer
