Accelerated Video Annotation driven by Deep Detector and Tracker
Eric Price, Aamir Ahmad

TL;DR
This paper introduces a fast video annotation method combining deep learning-based detector and tracker to improve accuracy and reduce manual effort in labeling moving objects in videos.
Contribution
It presents a novel annotation approach that integrates SSD and RE$^3$ for more reliable and efficient object annotation in videos, reducing drift and manual supervision.
Findings
Significantly reduces annotation drift compared to traditional methods.
Decreases manual supervision time required for video annotation.
Validated on drone videos with improved annotation accuracy.
Abstract
Annotating object ground truth in videos is vital for several downstream tasks in robot perception and machine learning, such as for evaluating the performance of an object tracker or training an image-based object detector. The accuracy of the annotated instances of the moving objects on every image frame in a video is crucially important. Achieving that through manual annotations is not only very time consuming and labor intensive, but is also prone to high error rate. State-of-the-art annotation methods depend on manually initializing the object bounding boxes only in the first frame and then use classical tracking methods, e.g., adaboost, or kernelized correlation filters, to keep track of those bounding boxes. These can quickly drift, thereby requiring tedious manual supervision. In this paper, we propose a new annotation method which leverages a combination of a learning-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
