Long Term Object Detection and Tracking in Collaborative Learning Environments
Sravani Teeparthi

TL;DR
This paper presents a reliable, real-time object detection and tracking system for long videos in collaborative learning environments, achieving high accuracy and speed improvements over previous methods.
Contribution
Developed a high-precision, fast object detection and tracking system for long videos, validated on diverse sessions, with significant speed and accuracy enhancements.
Findings
Keyboard detector achieved 92% AP at 0.5 IoU.
Combined system runs at 21X real-time speed with high accuracy.
Hand detector achieved 81% AP with 80% false-positive reduction.
Abstract
Human activity recognition in videos is a challenging problem that has drawn a lot of interest, particularly when the goal requires the analysis of a large video database. AOLME project provides a collaborative learning environment for middle school students to explore mathematics, computer science, and engineering by processing digital images and videos. As part of this project, around 2200 hours of video data was collected for analysis. Because of the size of the dataset, it is hard to analyze all the videos of the dataset manually. Thus, there is a huge need for reliable computer-based methods that can detect activities of interest. My thesis is focused on the development of accurate methods for detecting and tracking objects in long videos. All the models are validated on videos from 7 different sessions, ranging from 45 minutes to 90 minutes. The keyboard detector achieved a very…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · IoT-based Smart Home Systems · Human Pose and Action Recognition
