Score: 0

Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition

Published: May 7, 2025 | arXiv ID: 2505.04502v1

By: Asma Baobaid, Mahmoud Meribout

Potential Business Impact:

Makes cameras find and know faces faster.

Business Areas:

Image Recognition Data and Analytics, Software

Video face detection and recognition in public places at the edge is required in several applications, such as security reinforcement and contactless access to authorized venues. This paper aims to maximize the simultaneous usage of hardware engines available in edge GPUs nowadays by leveraging the concurrency and pipelining of tasks required for face detection and recognition. This also includes the video decoding task, which is required in most face monitoring applications as the video streams are usually carried via Gbps Ethernet network. This constitutes an improvement over previous works where the tasks are usually allocated to a single engine due to the lack of a unified and automated framework that simultaneously explores all hardware engines. In addition, previously, the input faces were usually embedded in still images or within raw video streams that overlook the burst delay caused by the decoding stage. The results on real-life video streams suggest that simultaneously using all the hardware engines available in the recent NVIDIA edge Orin GPU, higher throughput, and a slight saving of power consumption of around 300 mW, accounting for around 5%, have been achieved while satisfying the real-time performance constraint. The performance gets even higher by considering several video streams simultaneously. Further performance improvement could have been obtained if the number of shuffle layers that were created by the tensor RT framework for the face recognition task was lower. Thus, the paper suggests some hardware improvements to the existing edge GPU processors to enhance their performance even higher.

Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration

CV and Pattern Recognition

Makes cameras find and know faces faster, cheaper.

7 May 2025 0

86%

Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis

Hardware Architecture

Makes MRI scans and diagnoses much faster.

2 Oct 2025 0

86%

Boosting performance of computer vision applications through embedded GPUs on the edge

CV and Pattern Recognition

Makes phone apps with cool pictures run faster.

3 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇦🇪 United Arab Emirates

Page Count

10 pages

Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition

Makes cameras find and know faces faster.

Technical Abstract

Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration

Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis

Boosting performance of computer vision applications through embedded GPUs on the edge