hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware
By: Jan-Frederik Schulte , Benjamin Ramhorst , Chang Sun and more
Potential Business Impact:
Makes smart computer programs run super fast.
We present hls4ml, a free and open-source platform that translates machine learning (ML) models from modern deep learning frameworks into high-level synthesis (HLS) code that can be integrated into full designs for field-programmable gate arrays (FPGAs) or application-specific integrated circuits (ASICs). With its flexible and modular design, hls4ml supports a large number of deep learning frameworks and can target HLS compilers from several vendors, including Vitis HLS, Intel oneAPI and Catapult HLS. Together with a wider eco-system for software-hardware co-design, hls4ml has enabled the acceleration of ML inference in a wide range of commercial and scientific applications where low latency, resource usage, and power consumption are critical. In this paper, we describe the structure and functionality of the hls4ml platform. The overarching design considerations for the generated HLS code are discussed, together with selected performance results.
Similar Papers
ForgeHLS: A Large-Scale, Open-Source Dataset for High-Level Synthesis
Hardware Architecture
Creates better computer chips from code.
wa-hls4ml: A Benchmark and Surrogate Models for hls4ml Resource and Latency Estimation
Machine Learning (CS)
Predicts computer chip needs for AI tasks.
TimelyHLS: LLM-Based Timing-Aware and Architecture-Specific FPGA HLS Optimization
Cryptography and Security
Makes computer chips faster and smaller automatically.