Fast Hand Detection in Collaborative Learning Environments
Sravani Teeparthi, Venkatesh Jatla, Marios S. Pattichis, Sylvia, Celedon Pattichis, Carlos LopezLeiva

TL;DR
This paper presents a long-term hand detection method for collaborative learning videos that effectively handles occlusions and appearance changes, achieving high accuracy and real-time performance.
Contribution
It introduces a novel approach combining object detection, time projection, clustering, and region removal for robust long-term hand detection.
Findings
Achieved 72% AP at 0.5 IoU with standard detection.
Improved AP to 81% using data augmentation.
Runs at 4.7x real-time with 80% reduction in false positives.
Abstract
Long-term object detection requires the integration of frame-based results over several seconds. For non-deformable objects, long-term detection is often addressed using object detection followed by video tracking. Unfortunately, tracking is inapplicable to objects that undergo dramatic changes in appearance from frame to frame. As a related example, we study hand detection over long video recordings in collaborative learning environments. More specifically, we develop long-term hand detection methods that can deal with partial occlusions and dramatic changes in appearance. Our approach integrates object-detection, followed by time projections, clustering, and small region removal to provide effective hand detection over long videos. The hand detector achieved average precision (AP) of 72% at 0.5 intersection over union (IoU). The detection results were improved to 81% by using our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
