EgoExo++: Integrating On-demand Exocentric Visuals with 2.5D Ground Surface Estimation for Interactive Teleoperation of Underwater ROVs

Adnan Abdullah; Ruo Chen; Ioannis Rekleitis; and Md Jahidul Islam

arXiv:2407.00848·cs.RO·April 24, 2026·3 cites

EgoExo++: Integrating On-demand Exocentric Visuals with 2.5D Ground Surface Estimation for Interactive Teleoperation of Underwater ROVs

Adnan Abdullah, Ruo Chen, Ioannis Rekleitis, and Md Jahidul Islam

PDF

TL;DR

EgoExo++ enhances underwater ROV teleoperation by synthesizing exocentric views and 2.5D ground surface estimation from egocentric video, improving navigation accuracy and user experience in complex environments.

Contribution

The paper introduces EgoExo++, a novel system that combines on-demand exocentric view synthesis with ground surface estimation for improved underwater ROV control.

Findings

01

EgoExo++ achieves 16% faster mission completion times.

02

Reduces path deviation ratio by 5 times.

03

Fewer collision events in user studies.

Abstract

Underwater ROVs (Remotely Operated Vehicles) are indispensable for subsea exploration and task execution, yet typical teleoperation engines based on egocentric (first-person) video feeds restrict human operators' field-of-view and limit precise maneuvering in complex, unstructured underwater environments. To address this, we first propose EgoExo, a geometry-driven solution integrated into a visual SLAM pipeline that synthesizes on-demand exocentric (third-person) views from egocentric camera feeds. We further propose EgoExo++, which extends beyond 2D exocentric view synthesis (EgoExo) to augment a piecewise planar 2.5D ground surface estimation on-the-fly. Its anchor-free aerial viewpoint supports ground-relative reasoning, such as clearance and terrain-based navigation marker following. The computations involved are closed-form and rely solely on egocentric views and monocular SLAM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.