PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation
Deyi Ji, Wenwei Jin, Hongtao Lu, Feng Zhao

TL;DR
PPTFormer introduces a pseudo multi-perspective transformer that enhances UAV image segmentation by simulating multiple perspectives without needing multi-view datasets, achieving state-of-the-art results.
Contribution
The paper proposes a novel pseudo multi-perspective learning approach with a transformer architecture, improving UAV segmentation accuracy without multi-view data.
Findings
Achieves state-of-the-art performance on five UAV datasets.
Effectively simulates UAV perspectives through pseudo multi-perspective attention.
Significantly improves segmentation accuracy over existing methods.
Abstract
The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV-captured images. Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these issues, we introduce the PPTFormer, a novel \textbf{P}seudo Multi-\textbf{P}erspective \textbf{T}rans\textbf{former} network that revolutionizes UAV image segmentation. Our approach circumvents the need for actual multi-perspective data by creating pseudo perspectives for enhanced multi-perspective learning. The PPTFormer network boasts Perspective Representation, novel Perspective Prototypes, and a specialized encoder and decoder that together achieve superior segmentation results through Pseudo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Infrared Target Detection Methodologies · Advanced Vision and Imaging
MethodsSoftmax · Attention Is All You Need
