Data Compression for Time Series Modelling: A Case Study of Smart Grid Demand Forecasting
By: Mikkel Bue Lykkegaard , Svend Vendelbo Nielsen , Akanksha Upadhyay and more
Potential Business Impact:
Shrinks energy data without losing prediction power.
Efficient time series forecasting is essential for smart energy systems, enabling accurate predictions of energy demand, renewable resource availability, and grid stability. However, the growing volume of high-frequency data from sensors and IoT devices poses challenges for storage and transmission. This study explores Discrete Wavelet Transform (DWT)-based data compression as a solution to these challenges while ensuring forecasting accuracy. A case study of a seawater supply system in Hirtshals, Denmark, operating under dynamic weather, operational schedules, and seasonal trends, is used for evaluation. Biorthogonal wavelets of varying orders were applied to compress data at different rates. Three forecasting models - Ordinary Least Squares (OLS), XGBoost, and the Time Series Dense Encoder (TiDE) - were tested to assess the impact of compression on forecasting performance. Lossy compression rates up to $r_{\mathrm{lossy}} = 0.999$ were analyzed, with the Normalized Mutual Information (NMI) metric quantifying the relationship between compression and information retention. Results indicate that wavelet-based compression can retain essential features for accurate forecasting when applied carefully. XGBoost proved highly robust to compression artifacts, maintaining stable performance across diverse compression rates. In contrast, OLS demonstrated sensitivity to smooth wavelets and high compression rates, while TiDE showed some variability but remained competitive. This study highlights the potential of wavelet-based compression for scalable, efficient data management in smart energy systems without sacrificing forecasting accuracy. The findings are relevant to other fields requiring high-frequency time series forecasting, including climate modeling, water supply systems, and industrial operations.
Similar Papers
Lossless Compression of Time Series Data: A Comparative Study
Information Theory
Makes storing and sending data much smaller.
Lossless Compression: A New Benchmark for Time Series Model Evaluation
Machine Learning (CS)
Tests computer models by how well they shrink data.
Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform
Computation and Language
Makes computer language understanding smaller, faster, better.