TL;DR
DeltaFlow introduces a lightweight multi-frame scene flow estimation framework that efficiently captures temporal motion cues, improves accuracy with novel loss functions, and outperforms existing methods in speed and error reduction.
Contribution
It proposes a novel $ riangle$ scheme for efficient multi-frame feature extraction and introduces Category-Balanced and Instance Consistency Losses to enhance flow accuracy.
Findings
Achieves up to 22% lower error on benchmark datasets.
Runs twice as fast as comparable multi-frame methods.
Demonstrates strong cross-domain generalization.
Abstract
Previous dominant methods for scene flow estimation focus mainly on input from two consecutive frames, neglecting valuable information in the temporal domain. While recent trends shift towards multi-frame reasoning, they suffer from rapidly escalating computational costs as the number of frames grows. To leverage temporal information more efficiently, we propose DeltaFlow (Flow), a lightweight 3D framework that captures motion cues via a scheme, extracting temporal features with minimal computational cost, regardless of the number of frames. Additionally, scene flow estimation faces challenges such as imbalanced object class distributions and motion inconsistency. To tackle these issues, we introduce a Category-Balanced Loss to enhance learning across underrepresented classes and an Instance Consistency Loss to enforce coherent object motion, improving flow accuracy.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
