POPCat: Propagation of particles for complex annotation tasks
Adam Srebrnjak Yang, Dheeraj Khanna, and John S. Zelek

TL;DR
POPcat introduces a semi-supervised, particle-tracking-based pipeline that efficiently generates annotations for complex multi-object video datasets, significantly improving detection metrics on challenging benchmarks.
Contribution
It presents a novel semi-supervised annotation method leveraging particle tracking and temporal features, enhancing annotation efficiency and accuracy for complex video datasets.
Findings
Achieved up to 24.5% improvement in recall on GMOT-40
Improved mAP50 by 9.6% on AnimalTrack
Enhanced detection metrics on Visdrone-2019
Abstract
Novel dataset creation for all multi-object tracking, crowd-counting, and industrial-based videos is arduous and time-consuming when faced with a unique class that densely populates a video sequence. We propose a time efficient method called POPCat that exploits the multi-target and temporal features of video data to produce a semi-supervised pipeline for segmentation or box-based video annotation. The method retains the accuracy level associated with human level annotation while generating a large volume of semi-supervised annotations for greater generalization. The method capitalizes on temporal features through the use of a particle tracker to expand the domain of human-provided target points. This is done through the use of a particle tracker to reassociate the initial points to a set of images that follow the labeled frame. A YOLO model is then trained with this generated data, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
