Disentangling Architecture and Training for Optical Flow
Deqing Sun, Charles Herrmann, Fitsum Reda, Michael Rubinstein, David, Fleet, William T. Freeman

TL;DR
This paper revisits existing optical flow models with modern training techniques and datasets, revealing significant performance improvements and emphasizing the importance of training details and datasets for generalization.
Contribution
It demonstrates that applying modern training methods to existing models substantially improves their performance and generalization, without developing new architectures.
Findings
PWC-Net and IRR-PWC improved by up to 30% on benchmarks
New RAFT achieves state-of-the-art accuracy with 4.31% Fl-all on KITTI 2015
Training techniques significantly impact optical flow model performance
Abstract
How important are training details and datasets to recent optical flow models like RAFT? And do they generalize? To explore these questions, rather than develop a new model, we revisit three prominent models, PWC-Net, IRR-PWC and RAFT, with a common set of modern training techniques and datasets, and observe significant performance gains, demonstrating the importance and generality of these training details. Our newly trained PWC-Net and IRR-PWC models show surprisingly large improvements, up to 30% versus original published results on Sintel and KITTI 2015 benchmarks. They outperform the more recent Flow1D on KITTI 2015 while being 3x faster during inference. Our newly trained RAFT achieves an Fl-all score of 4.31% on KITTI 2015, more accurate than all published optical flow methods at the time of writing. Our results demonstrate the benefits of separating the contributions of models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Advanced Vision and Imaging · Advanced Image Processing Techniques
