CrowdMOT: Crowdsourcing Strategies for Tracking Multiple Objects in Videos
Samreen Anjum, Chi Lin, Danna Gurari

TL;DR
CrowdMOT is a crowdsourcing platform designed to improve multi-object tracking in videos by optimizing task decomposition and annotation sharing, achieving higher quality results than current systems.
Contribution
Introduces CrowdMOT, a novel crowdsourcing framework that explores task design strategies to enhance annotation quality in multi-object tracking.
Findings
Task decomposition affects annotation quality.
Sharing previous annotations improves accuracy.
Strategies outperform existing crowdsourcing methods.
Abstract
Crowdsourcing is a valuable approach for tracking objects in videos in a more scalable manner than possible with domain experts. However, existing frameworks do not produce high quality results with non-expert crowdworkers, especially for scenarios where objects split. To address this shortcoming, we introduce a crowdsourcing platform called CrowdMOT, and investigate two micro-task design decisions: (1) whether to decompose the task so that each worker is in charge of annotating all objects in a sub-segment of the video versus annotating a single object across the entire video, and (2) whether to show annotations from previous workers to the next individuals working on the task. We conduct experiments on a diversity of videos which show both familiar objects (aka - people) and unfamiliar objects (aka - cells). Our results highlight strategies for efficiently collecting higher quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Visual Attention and Saliency Detection · Tactile and Sensory Interactions
