Facial Recognition in Collaborative Learning Videos
Phuong Tran, Marios Pattichis, Sylvia Celed\'on-Pattichis, Carlos, L\'opezLeiva

TL;DR
This paper presents a fast and accurate face recognition system tailored for collaborative learning videos, effectively handling occlusions, pose variations, and long durations by leveraging past detection data and prototype face collections.
Contribution
The authors introduce a dynamic face recognition system that improves speed and accuracy in challenging collaborative learning video scenarios, outperforming baseline methods.
Findings
Achieved 86.2% accuracy, surpassing baseline's 70.8%.
Recognition was 28.1 times faster than the baseline.
Effectively handled occlusion and pose variation challenges.
Abstract
Face recognition in collaborative learning videos presents many challenges. In collaborative learning videos, students sit around a typical table at different positions to the recording camera, come and go, move around, get partially or fully occluded. Furthermore, the videos tend to be very long, requiring the development of fast and accurate methods. We develop a dynamic system of recognizing participants in collaborative learning systems. We address occlusion and recognition failures by using past information about the face detection history. We address the need for detecting faces from different poses and the need for speed by associating each participant with a collection of prototype faces computed through sampling or K-means clustering. Our results show that the proposed system is proven to be very fast and accurate. We also compare our system against a baseline system that uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Hand Gesture Recognition Systems
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
