EgoExo++: Integrating On-demand Exocentric Visuals with 2.5D Ground Surface Estimation for Interactive Teleoperation of Underwater ROVs
Adnan Abdullah, Ruo Chen, Ioannis Rekleitis, and Md Jahidul Islam

TL;DR
EgoExo++ enhances underwater ROV teleoperation by synthesizing exocentric views and 2.5D ground surface estimation from egocentric video, improving navigation accuracy and user experience in complex environments.
Contribution
The paper introduces EgoExo++, a novel system that combines on-demand exocentric view synthesis with ground surface estimation for improved underwater ROV control.
Findings
EgoExo++ achieves 16% faster mission completion times.
Reduces path deviation ratio by 5 times.
Fewer collision events in user studies.
Abstract
Underwater ROVs (Remotely Operated Vehicles) are indispensable for subsea exploration and task execution, yet typical teleoperation engines based on egocentric (first-person) video feeds restrict human operators' field-of-view and limit precise maneuvering in complex, unstructured underwater environments. To address this, we first propose EgoExo, a geometry-driven solution integrated into a visual SLAM pipeline that synthesizes on-demand exocentric (third-person) views from egocentric camera feeds. We further propose EgoExo++, which extends beyond 2D exocentric view synthesis (EgoExo) to augment a piecewise planar 2.5D ground surface estimation on-the-fly. Its anchor-free aerial viewpoint supports ground-relative reasoning, such as clearance and terrain-based navigation marker following. The computations involved are closed-form and rely solely on egocentric views and monocular SLAM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
