From Data to Decision: A Multi-Stage Framework for Class Imbalance Mitigation in Optical Network Failure Analysis
By: Yousuf Moiz Ali , Jaroslaw E. Prilepsky , Nicola Sambo and more
Potential Business Impact:
Finds internet problems faster and more accurately.
Machine learning-based failure management in optical networks has gained significant attention in recent years. However, severe class imbalance, where normal instances vastly outnumber failure cases, remains a considerable challenge. While pre- and in-processing techniques have been widely studied, post-processing methods are largely unexplored. In this work, we present a direct comparison of pre-, in-, and post-processing approaches for class imbalance mitigation in failure detection and identification using an experimental dataset. For failure detection, post-processing methods-particularly Threshold Adjustment-achieve the highest F1 score improvement (up to 15.3%), while Random Under-Sampling provides the fastest inference. In failure identification, GenAI methods deliver the most substantial performance gains (up to 24.2%), whereas post-processing shows limited impact in multi-class settings. When class overlap is present and latency is critical, over-sampling methods such as the SMOTE are most effective; without latency constraints, Meta-Learning yields the best results. In low-overlap scenarios, Generative AI approaches provide the highest performance with minimal inference time.
Similar Papers
Improving Slow Transfer Predictions: Generative Methods Compared
Machine Learning (CS)
Helps computers guess slow data transfers faster.
Cyber Security Data Science: Machine Learning Methods and their Performance on Imbalanced Datasets
Machine Learning (CS)
Finds computer threats faster by trying different tricks.
Enhancing IoT Cyber Attack Detection in the Presence of Highly Imbalanced Data
Machine Learning (CS)
Finds hidden internet dangers in busy networks.