Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video   Face Detection and Recognition

Asma Baobaid; Mahmoud Meribout

arXiv:2505.04502·cs.CV·May 8, 2025

Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition

Asma Baobaid, Mahmoud Meribout

PDF

Open Access

TL;DR

This paper presents a framework that leverages all hardware engines in edge GPUs for concurrent video face detection and recognition, improving throughput and power efficiency in real-time applications.

Contribution

It introduces a unified, automated approach to utilize all GPU hardware engines simultaneously for face detection, recognition, and decoding tasks at the edge.

Findings

01

Higher throughput achieved on NVIDIA edge Orin GPU

02

Power consumption reduced by around 5%

03

Performance improves with multiple video streams

Abstract

Video face detection and recognition in public places at the edge is required in several applications, such as security reinforcement and contactless access to authorized venues. This paper aims to maximize the simultaneous usage of hardware engines available in edge GPUs nowadays by leveraging the concurrency and pipelining of tasks required for face detection and recognition. This also includes the video decoding task, which is required in most face monitoring applications as the video streams are usually carried via Gbps Ethernet network. This constitutes an improvement over previous works where the tasks are usually allocated to a single engine due to the lack of a unified and automated framework that simultaneously explores all hardware engines. In addition, previously, the input faces were usually embedded in still images or within raw video streams that overlook the burst delay…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Parallel Computing and Optimization Techniques · Face recognition and analysis