TL;DR
This paper introduces NMS-Loss, a novel training loss that integrates Non-Maximum Suppression into the training process, significantly improving crowded pedestrian detection accuracy.
Contribution
We propose NMS-Loss, enabling end-to-end training of object detectors with NMS, addressing the disconnect between training targets and evaluation metrics in crowded scenes.
Findings
Achieved 5.92% Miss Rate on Caltech dataset
Achieved 10.08% Miss Rate on CityPersons dataset
Outperformed state-of-the-art methods in crowded pedestrian detection
Abstract
Non-Maximum Suppression (NMS) is essential for object detection and affects the evaluation results by incorporating False Positives (FP) and False Negatives (FN), especially in crowd occlusion scenes. In this paper, we raise the problem of weak connection between the training targets and the evaluation metrics caused by NMS and propose a novel NMS-Loss making the NMS procedure can be trained end-to-end without any additional network parameters. Our NMS-Loss punishes two cases when FP is not suppressed and FN is wrongly eliminated by NMS. Specifically, we propose a pull loss to pull predictions with the same target close to each other, and a push loss to push predictions with different targets away from each other. Experimental results show that with the help of NMS-Loss, our detector, namely NMS-Ped, achieves impressive results with Miss Rate of 5.92% on Caltech dataset and 10.08% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
