SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model
Shili Zhou, Ruian He, Weimin Tan, Bo Yan

TL;DR
SAMFlow leverages the Segment Anything Model to improve optical flow estimation by enhancing object integrity, achieving state-of-the-art results on Sintel and KITTI-15 benchmarks.
Contribution
The paper introduces a novel integration of SAM with FlowFormer, including task-specific adaptation modules, to address fragmentation in optical flow estimation.
Findings
Achieves top performance on Sintel and KITTI-15 benchmarks.
Surpasses previous methods in EPE and F1-all metrics.
Demonstrates the effectiveness of large vision models in optical flow tasks.
Abstract
Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Retinal Imaging and Analysis · Advanced Neural Network Applications
MethodsSegment Anything Model · Fragmentation
