Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition
Asma Baobaid, Mahmoud Meribout

TL;DR
This paper presents a framework that leverages all hardware engines in edge GPUs for concurrent video face detection and recognition, improving throughput and power efficiency in real-time applications.
Contribution
It introduces a unified, automated approach to utilize all GPU hardware engines simultaneously for face detection, recognition, and decoding tasks at the edge.
Findings
Higher throughput achieved on NVIDIA edge Orin GPU
Power consumption reduced by around 5%
Performance improves with multiple video streams
Abstract
Video face detection and recognition in public places at the edge is required in several applications, such as security reinforcement and contactless access to authorized venues. This paper aims to maximize the simultaneous usage of hardware engines available in edge GPUs nowadays by leveraging the concurrency and pipelining of tasks required for face detection and recognition. This also includes the video decoding task, which is required in most face monitoring applications as the video streams are usually carried via Gbps Ethernet network. This constitutes an improvement over previous works where the tasks are usually allocated to a single engine due to the lack of a unified and automated framework that simultaneously explores all hardware engines. In addition, previously, the input faces were usually embedded in still images or within raw video streams that overlook the burst delay…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Parallel Computing and Optimization Techniques · Face recognition and analysis
