Score: 0

Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows

Published: March 7, 2025 | arXiv ID: 2503.08700v1

By: Julien Posso , Hugo Kieffer , Nicolas Menga and more

Potential Business Impact:

Lets drones quickly understand what they see.

Business Areas:

Image Recognition Data and Analytics, Software

This study introduces a lightweight U-Net model optimized for real-time semantic segmentation of aerial images, targeting the efficient utilization of Commercial Off-The-Shelf (COTS) embedded computing platforms. We maintain the accuracy of the U-Net on a real-world dataset while significantly reducing the model's parameters and Multiply-Accumulate (MAC) operations by a factor of 16. Our comprehensive analysis covers three hardware platforms (CPU, GPU, and FPGA) and five different toolchains (TVM, FINN, Vitis AI, TensorFlow GPU, and cuDNN), assessing each on metrics such as latency, power consumption, memory footprint, energy efficiency, and FPGA resource usage. The results highlight the trade-offs between these platforms and toolchains, with a particular focus on the practical deployment challenges in real-world applications. Our findings demonstrate that while the FPGA with Vitis AI emerges as the superior choice due to its performance, energy efficiency, and maturity, it requires specialized hardware knowledge, emphasizing the need for a balanced approach in selecting embedded computing solutions for semantic segmentation tasks

Efficient FPGA-accelerated Convolutional Neural Networks for Cloud Detection on CubeSats

Signal Processing

Makes small satellites see clouds in space.

4 Apr 2025 1

88%

Real Time FPGA Based CNNs for Detection, Classification, and Tracking in Autonomous Systems: State of the Art Designs and Optimizations

Hardware Architecture

Makes cameras understand things faster and with less power.

4 Sep 2025 1

87%

Red grape detection with accelerated artificial neural networks in the FPGA's programmable logic

CV and Pattern Recognition

Makes robots see and move much faster.

3 Jul 2025 0

View PDF Login to Bookmark

Page Count

11 pages

Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows

Lets drones quickly understand what they see.

Technical Abstract

Efficient FPGA-accelerated Convolutional Neural Networks for Cloud Detection on CubeSats

Real Time FPGA Based CNNs for Detection, Classification, and Tracking in Autonomous Systems: State of the Art Designs and Optimizations

Red grape detection with accelerated artificial neural networks in the FPGA's programmable logic