Real-Time In-Network Machine Learning on P4-Programmable FPGA SmartNICs with Fixed-Point Arithmetic and Taylor
By: Mohammad Firas Sada , John J. Graham , Mahidhar Tatineni and more
Potential Business Impact:
Lets computers learn from network traffic instantly.
As machine learning (ML) applications become integral to modern network operations, there is an increasing demand for network programmability that enables low-latency ML inference for tasks such as Quality of Service (QoS) prediction and anomaly detection in cybersecurity. ML models provide adaptability through dynamic weight adjustments, making Programming Protocol-independent Packet Processors (P4)-programmable FPGA SmartNICs an ideal platform for investigating In-Network Machine Learning (INML). These devices offer high-throughput, low-latency packet processing and can be dynamically reconfigured via the control plane, allowing for flexible integration of ML models directly at the network edge. This paper explores the application of the P4 programming paradigm to neural networks and regression models, where weights and biases are stored in control plane table lookups. This approach enables flexible programmability and efficient deployment of retrainable ML models at the network edge, independent of core infrastructure at the switch level.
Similar Papers
INSIGHT: A Survey of In-Network Systems for Intelligent, High-Efficiency AI and Topology Optimization
Networking and Internet Architecture
Computers learn faster inside the network.
FENIX: Enabling In-Network DNN Inference with FPGA-Enhanced Programmable Switches
Networking and Internet Architecture
Makes internet traffic analysis super fast and accurate.
Securing P4 Programs by Information Flow Control
Cryptography and Security
Keeps secret data safe in computer networks.