Focus On Details: Online Multi-object Tracking with Diverse Fine-grained Representation
Hao Ren, Shoudong Han, Huilin Ding, Ziwen Zhang, Hongwei Wang, Faquan, Wang

TL;DR
This paper introduces a novel multi-object tracking method that utilizes diverse fine-grained appearance representations, achieved through multi-scale feature alignment and local mask extraction, leading to state-of-the-art results especially in challenging scenarios.
Contribution
It proposes a new fine-grained representation framework with Flow Alignment FPN and Multi-head Part Mask Generator for improved identity discrimination in MOT.
Findings
Achieves state-of-the-art performance on MOT17 and MOT20 datasets.
Significantly outperforms ByteTrack on DanceTrack with 5.0% higher HOTA.
Demonstrates the effectiveness of diverse fine-grained features in challenging tracking scenarios.
Abstract
Discriminative representation is essential to keep a unique identifier for each target in Multiple object tracking (MOT). Some recent MOT methods extract features of the bounding box region or the center point as identity embeddings. However, when targets are occluded, these coarse-grained global representations become unreliable. To this end, we propose exploring diverse fine-grained representation, which describes appearance comprehensively from global and local perspectives. This fine-grained representation requires high feature resolution and precise semantic information. To effectively alleviate the semantic misalignment caused by indiscriminate contextual information aggregation, Flow Alignment FPN (FAFPN) is proposed for multi-scale feature alignment aggregation. It generates semantic flow among feature maps from different resolutions to transform their pixel positions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Air Quality Monitoring and Forecasting · Advanced Neural Network Applications
MethodsTest · 1x1 Convolution · Convolution · Feature Pyramid Network
