SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment   Anything Model

Shili Zhou; Ruian He; Weimin Tan; Bo Yan

arXiv:2307.16586·cs.CV·December 22, 2023·1 cites

SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

Shili Zhou, Ruian He, Weimin Tan, Bo Yan

PDF

Open Access 1 Video

TL;DR

SAMFlow leverages the Segment Anything Model to improve optical flow estimation by enhancing object integrity, achieving state-of-the-art results on Sintel and KITTI-15 benchmarks.

Contribution

The paper introduces a novel integration of SAM with FlowFormer, including task-specific adaptation modules, to address fragmentation in optical flow estimation.

Findings

01

Achieves top performance on Sintel and KITTI-15 benchmarks.

02

Surpasses previous methods in EPE and F1-all metrics.

03

Demonstrates the effectiveness of large vision models in optical flow tasks.

Abstract

Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model· underline

Taxonomy

TopicsAdvanced Vision and Imaging · Retinal Imaging and Analysis · Advanced Neural Network Applications

MethodsSegment Anything Model · Fragmentation