UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan; Lei Luo; Zhuo Hui; Can Pu; Xiaoyu Xiang; Rakesh Ranjan and; Denis Demandolx

arXiv:2405.02608·cs.CV·May 7, 2024

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan, Lei Luo, Zhuo Hui, Can Pu, Xiaoyu Xiang, Rakesh Ranjan and, Denis Demandolx

PDF

Open Access 1 Repo

TL;DR

UnSAMFlow is an unsupervised optical flow method that incorporates object-level information from the Segment Anything Model, improving boundary accuracy and robustness across domains.

Contribution

It introduces a novel integration of SAM-based semantic augmentation and homography-based smoothness for enhanced unsupervised optical flow estimation.

Findings

01

Outperforms state-of-the-art on KITTI and Sintel datasets

02

Produces sharp object boundaries in optical flow

03

Generalizes well across different domains

Abstract

Traditional unsupervised optical flow methods are vulnerable to occlusions and motion boundaries due to lack of object-level information. Therefore, we propose UnSAMFlow, an unsupervised flow network that also leverages object information from the latest foundation model Segment Anything Model (SAM). We first include a self-supervised semantic augmentation module tailored to SAM masks. We also analyze the poor gradient landscapes of traditional smoothness losses and propose a new smoothness definition based on homography instead. A simple yet effective mask feature module has also been added to further aggregate features on the object level. With all these adaptations, our method produces clear optical flow estimation with sharp boundaries around objects, which outperforms state-of-the-art methods on both KITTI and Sintel datasets. Our method also generalizes well across domains and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/unsamflow
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Retinal Imaging and Analysis · Advanced Vision and Imaging

MethodsSegment Anything Model