Progressive Transformation Learning for Leveraging Virtual Images in Training
Yi-Ting Shen, Hyungtae Lee, Heesung Kwon, Shuvra Shikhar Bhattacharyya

TL;DR
This paper introduces Progressive Transformation Learning (PTL), a method that iteratively enhances virtual images' realism to improve UAV-based object detection, especially in small data and cross-domain scenarios.
Contribution
The paper proposes a novel PTL framework that progressively augments training data with transformed virtual images, effectively reducing domain gap issues in UAV object detection.
Findings
PTL significantly improves detection performance over baseline methods.
PTL is especially effective in small data and cross-domain settings.
The Mahalanobis distance effectively quantifies domain gap for image selection.
Abstract
To effectively interrogate UAV-based images for detecting objects of interest, such as humans, it is essential to acquire large-scale UAV-based datasets that include human instances with various poses captured from widely varying viewing angles. As a viable alternative to laborious and costly data curation, we introduce Progressive Transformation Learning (PTL), which gradually augments a training dataset by adding transformed virtual images with enhanced realism. Generally, a virtual2real transformation generator in the conditional GAN framework suffers from quality degradation when a large domain gap exists between real and virtual images. To deal with the domain gap, PTL takes a novel approach that progressively iterates the following three steps: 1) select a subset from a pool of virtual images according to the domain gap, 2) transform the selected virtual images to enhance realism,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
