CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation
Thanh-Dat Truong, Chi Nhan Duong, Ashley Dowling, Son Lam Phung,, Jackson Cothren, Khoa Luu

TL;DR
This paper introduces CROVIA, a novel cross-view adaptation method that leverages geometry constraints and multi-modal networks to improve UAV scene segmentation by transferring knowledge from vehicle view datasets.
Contribution
The work proposes a new geometry-based constraint and a GeiCo loss for cross-view adaptation, enabling effective knowledge transfer without paired data.
Findings
Achieves state-of-the-art results on SYNTHIA to UAVID and GTA5 to UAVID benchmarks.
Introduces new cross-view adaptation benchmarks for UAV scene segmentation.
Demonstrates effective transfer of knowledge from vehicle to UAV perspectives.
Abstract
Understanding semantic scene segmentation of urban scenes captured from the Unmanned Aerial Vehicles (UAV) perspective plays a vital role in building a perception model for UAV. With the limitations of large-scale densely labeled data, semantic scene segmentation for UAV views requires a broad understanding of an object from both its top and side views. Adapting from well-annotated autonomous driving data to unlabeled UAV data is challenging due to the cross-view differences between the two data types. Our work proposes a novel Cross-View Adaptation (CROVIA) approach to effectively adapt the knowledge learned from on-road vehicle views to UAV views. First, a novel geometry-based constraint to cross-view adaptation is introduced based on the geometry correlation between views. Second, cross-view correlations from image space are effectively transferred to segmentation space without any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning
