Towards Practical and Usable In-network Classification
By: Di Zhu, Jianxi Chen, Hyojoon Kim
In-network machine learning enables real-time classification directly on network hardware, offering consistently low inference latency. However, current solutions are limited by strict hardware constraints, scarce on-device resources, and poor usability, making them impractical for ML developers and cloud operators. To this end, we propose ACORN, an end-to-end system that automates the distributed deployment of practical machine learning models across the network. ACORN provides a fully automated pipeline that loads and deploys Python ML models on network devices using an optimized deployment plan from an ILP planner. To support larger models under hardware constraints and allow runtime programmability, ACORN adopts a novel data plane representation for Decision Tree, Random Forest, and Support Vector Machine models. We implement ACORN prototype in P4 and run it on real programmable hardware. Our evaluation shows ACORN can deploy classification ML models with 2-4x more features than state-of-the-art solutions, while imposing negligible overhead on network performance and traffic. We will make our data plane program, model translator, optimizer, and all related scripts publicly available.
Similar Papers
Intrusion Detection on Resource-Constrained IoT Devices with Hardware-Aware ML and DL
Networking and Internet Architecture
Finds computer spies on small devices.
Real-time Continual Learning on Intel Loihi 2
Machine Learning (CS)
Lets AI learn new things without forgetting old ones.
Joint Optimization of DNN Model Caching and Request Routing in Mobile Edge Computing
Networking and Internet Architecture
Makes phone apps faster by splitting up smart programs.