Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
By: Mohammad Rafid Ul Islam, Prasad Tadepalli, Alan Fern
Potential Business Impact:
Fixes broken data for smarter computers.
Missing values in multivariate time series data can harm machine learning performance and introduce bias. These gaps arise from sensor malfunctions, blackouts, and human error and are typically addressed by data imputation. Previous work has tackled the imputation of missing data in random, complete blackouts and forecasting scenarios. The current paper addresses a more general missing pattern, which we call "partial blackout," where a subset of features is missing for consecutive time steps. We introduce a two-stage imputation process using self-attention and diffusion processes to model feature and temporal correlations. Notably, our model effectively handles missing data during training, enhancing adaptability and ensuring reliable imputation and performance, even with incomplete datasets. Our experiments on benchmark and two real-world time series datasets demonstrate that our model outperforms the state-of-the-art in partial blackout scenarios and shows better scalability.
Similar Papers
Imputation of Missing Data in Smooth Pursuit Eye Movements Using a Self-Attention-based Deep Learning Approach
Machine Learning (CS)
Fixes broken eye-tracking data for better health checks.
Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification
Machine Learning (CS)
Fixes missing data in charts and graphs.
Filling the Missings: Spatiotemporal Data Imputation by Conditional Diffusion
Machine Learning (CS)
Fixes broken data for weather and traffic.