Segmentation over Complexity: Evaluating Ensemble and Hybrid Approaches for Anomaly Detection in Industrial Time Series
By: Emilio Mastriani , Alessandro Costa , Federico Incardona and more
Potential Business Impact:
Finds problems in machines before they break.
In this study, we investigate the effectiveness of advanced feature engineering and hybrid model architectures for anomaly detection in a multivariate industrial time series, focusing on a steam turbine system. We evaluate the impact of change point-derived statistical features, clustering-based substructure representations, and hybrid learning strategies on detection performance. Despite their theoretical appeal, these complex approaches consistently underperformed compared to a simple Random Forest + XGBoost ensemble trained on segmented data. The ensemble achieved an AUC-ROC of 0.976, F1-score of 0.41, and 100% early detection within the defined time window. Our findings highlight that, in scenarios with highly imbalanced and temporally uncertain data, model simplicity combined with optimized segmentation can outperform more sophisticated architectures, offering greater robustness, interpretability, and operational utility.
Similar Papers
Improving Anomaly Detection in Industrial Time Series: The Role of Segmentation and Heterogeneous Ensemble
Machine Learning (CS)
Finds problems in factory machines early.
Hybrid Ensemble Method for Detecting Cyber-Attacks in Water Distribution Systems Using the BATADAL Dataset
Cryptography and Security
Finds computer attacks on water pipes.
Multivariate Time Series Anomaly Detection in Industry 5.0
Machine Learning (CS)
Finds factory problems even with messy data.