Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis
By: Ashiyana Abdul Majeed, Mahmoud Meribout, Safa Mohammed Sali
Potential Business Impact:
Makes MRI scans and diagnoses much faster.
Advancements in AI have greatly enhanced the medical imaging process, making it quicker to diagnose patients. However, very few have investigated the optimization of a multi-model system with hardware acceleration. As specialized edge devices emerge, the efficient use of their accelerators is becoming increasingly crucial. This paper proposes a hardware-accelerated method for simultaneous reconstruction and diagnosis of \ac{MRI} from \ac{CT} images. Real-time performance of achieving a throughput of nearly 150 frames per second was achieved by leveraging hardware engines available in modern NVIDIA edge GPU, along with scheduling techniques. This includes the GPU and the \ac{DLA} available in both Jetson AGX Xavier and Jetson AGX Orin, which were considered in this paper. The hardware allocation of different layers of the multiple AI models was done in such a way that the ideal time between the hardware engines is reduced. In addition, the AI models corresponding to the \ac{GAN} model were fine-tuned in such a way that no fallback execution into the GPU engine is required without compromising accuracy. Indeed, the accuracy corresponding to the fine-tuned edge GPU-aware AI models exhibited an accuracy enhancement of 5\%. A further hardware allocation of two fine-tuned GPU-aware GAN models proves they can double the performance over the original model, leveraging adequate partitioning on the NVIDIA Jetson AGX Xavier and Orin devices. The results prove the effectiveness of employing hardware-aware models in parallel for medical image analysis and diagnosis.
Similar Papers
Hardware Acceleration in Portable MRIs: State of the Art and Future Prospects
Hardware Architecture
Makes portable MRI scans faster and better.
Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration
CV and Pattern Recognition
Makes cameras find and know faces faster, cheaper.
Accelerated Digital Twin Learning for Edge AI: A Comparison of FPGA and Mobile GPU
Distributed, Parallel, and Cluster Computing
Makes digital body copies learn faster, cheaper.