FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation
Jingyi Tang, Gu Wang, Zeyu Chen, Shengquan Li, Xiu Li, Xiangyang Ji

TL;DR
FAFA is a novel self-supervised framework that leverages frequency-aware flow estimation and synthetic data to improve underwater object pose estimation without requiring real annotations.
Contribution
The paper introduces FAFA, a frequency-aware, self-supervised approach for underwater pose estimation that reduces reliance on real-world annotations and enhances domain adaptation.
Findings
Significant performance improvements over state-of-the-art methods.
Effective domain-invariant feature extraction via FFT-based augmentation.
Successful adaptation to real underwater environments using self-supervision.
Abstract
Although methods for estimating the pose of objects in indoor scenes have achieved great success, the pose estimation of underwater objects remains challenging due to difficulties brought by the complex underwater environment, such as degraded illumination, blurring, and the substantial cost of obtaining real annotations. In response, we introduce FAFA, a Frequency-Aware Flow-Aided self-supervised framework for 6D pose estimation of unmanned underwater vehicles (UUVs). Essentially, we first train a frequency-aware flow-based pose estimator on synthetic data, where an FFT-based augmentation approach is proposed to facilitate the network in capturing domain-invariant features and target domain styles from a frequency perspective. Further, we perform self-supervised training by enforcing flow-aided multi-level consistencies to adapt it to the real-world underwater environment. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Vehicles and Communication Systems · Target Tracking and Data Fusion in Sensor Networks · Teleoperation and Haptic Systems
