Unconstrained Body Recognition at Altitude and Range: Comparing Four   Approaches

Blake A Myers; Matthew Q Hill; Veda Nandan Gandi; Thomas M Metz; Alice; J O'Toole

arXiv:2502.07130·cs.CV·February 12, 2025

Unconstrained Body Recognition at Altitude and Range: Comparing Four Approaches

Blake A Myers, Matthew Q Hill, Veda Nandan Gandi, Thomas M Metz, Alice, J O'Toole

PDF

Open Access

TL;DR

This paper compares four approaches to long-term body shape-based person identification across various challenging conditions, introducing new models and training methods with extensive datasets.

Contribution

It presents novel body identification models based on Vision Transformers and expands existing ResNet models, trained on a large, diverse dataset for robust long-term identification.

Findings

01

Transformers outperform ResNet models in challenging conditions

02

Models maintain high accuracy over large distances and altitude variations

03

Training on diverse datasets improves long-term identification robustness

Abstract

This study presents an investigation of four distinct approaches to long-term person identification using body shape. Unlike short-term re-identification systems that rely on temporary features (e.g., clothing), we focus on learning persistent body shape characteristics that remain stable over time. We introduce a body identification model based on a Vision Transformer (ViT) (Body Identification from Diverse Datasets, BIDDS) and on a Swin-ViT model (Swin-BIDDS). We also expand on previous approaches based on the Linguistic and Non-linguistic Core ResNet Identity Models (LCRIM and NLCRIM), but with improved training. All models are trained on a large and diverse dataset of over 1.9 million images of approximately 5k identities across 9 databases. Performance was evaluated on standard re-identification benchmark datasets (MARS, MSMT17, Outdoor Gait, DeepChange) and on an unconstrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHigh Altitude and Hypoxia

MethodsAttention Is All You Need · Average Pooling · Global Average Pooling · Linear Layer · Multi-Head Attention · Max Pooling · Position-Wise Feed-Forward Layer · Adam · Convolution · Softmax