# Instance-Aware Representation Learning and Association for Online   Multi-Person Tracking

**Authors:** Hefeng Wu, Yafei Hu, Keze Wang, Hanhui Li, Lin Nie, Hui Cheng

arXiv: 1905.12409 · 2019-05-30

## TL;DR

This paper introduces a deep learning-based online multi-person tracking method that learns instance-aware representations and dynamically adapts to appearance changes, improving robustness over traditional detection-to-association approaches.

## Contribution

It proposes a multi-branch neural network architecture with dynamic instance subnets and an online association mechanism for improved multi-person tracking.

## Key findings

- Outperforms recent online MPT methods on benchmark datasets.
- Effectively handles occlusions and detection errors.
- Adapts to appearance variations through online hard example mining.

## Abstract

Multi-Person Tracking (MPT) is often addressed within the detection-to-association paradigm. In such approaches, human detections are first extracted in every frame and person trajectories are then recovered by a procedure of data association (usually offline). However, their performances usually degenerate in presence of detection errors, mutual interactions and occlusions. In this paper, we present a deep learning based MPT approach that learns instance-aware representations of tracked persons and robustly online infers states of the tracked persons. Specifically, we design a multi-branch neural network (MBN), which predicts the classification confidences and locations of all targets by taking a batch of candidate regions as input. In our MBN architecture, each branch (instance-subnet) corresponds to an individual to be tracked and new branches can be dynamically created for handling newly appearing persons. Then based on the output of MBN, we construct a joint association matrix that represents meaningful states of tracked persons (e.g., being tracked or disappearing from the scene) and solve it by using the efficient Hungarian algorithm. Moreover, we allow the instance-subnets to be updated during tracking by online mining hard examples, accounting to person appearance variations over time. We comprehensively evaluate our framework on a popular MPT benchmark, demonstrating its excellent performance in comparison with recent online MPT methods.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.12409/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1905.12409/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/1905.12409/full.md

---
Source: https://tomesphere.com/paper/1905.12409