Improving Internet Traffic Matrix Prediction via Time Series Clustering
By: Martha Cash, Alexander Wyglinski
Potential Business Impact:
Makes internet faster by predicting traffic jams.
We present a novel framework that leverages time series clustering to improve internet traffic matrix (TM) prediction using deep learning (DL) models. Traffic flows within a TM often exhibit diverse temporal behaviors, which can hinder prediction accuracy when training a single model across all flows. To address this, we propose two clustering strategies, source clustering and histogram clustering, that group flows with similar temporal patterns prior to model training. Clustering creates more homogeneous data subsets, enabling models to capture underlying patterns more effectively and generalize better than global prediction approaches that fit a single model to the entire TM. Compared to existing TM prediction methods, our method reduces RMSE by up to 92\% for Abilene and 75\% for G\'EANT. In routing scenarios, our clustered predictions also reduce maximum link utilization (MLU) bias by 18\% and 21\%, respectively, demonstrating the practical benefits of clustering when TMs are used for network optimization.
Similar Papers
How does the Performance of the Data-driven Traffic Flow Forecasting Models deteriorate with Increasing Forecasting Horizon? An Extensive Approach Considering Statistical, Machine Learning and Deep Learning Models
Machine Learning (CS)
Predicts traffic jams before they happen.
Multi-Grained Temporal-Spatial Graph Learning for Stable Traffic Flow Forecasting
Machine Learning (CS)
Predicts traffic jams better by seeing big and small patterns.
A comparative study of deep learning and ensemble learning to extend the horizon of traffic forecasting
Machine Learning (CS)
Predicts traffic jams up to 30 days ahead.