Score: 0

Hidden Leaks in Time Series Forecasting: How Data Leakage Affects LSTM Evaluation Across Configurations and Validation Strategies

Published: December 7, 2025 | arXiv ID: 2512.06932v1

By: Salma Albelali, Moataz Ahmed

Potential Business Impact:

Fixes computer predictions so they don't cheat.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Deep learning models, particularly Long Short-Term Memory (LSTM) networks, are widely used in time series forecasting due to their ability to capture complex temporal dependencies. However, evaluation integrity is often compromised by data leakage, a methodological flaw in which input-output sequences are constructed before dataset partitioning, allowing future information to unintentionally influence training. This study investigates the impact of data leakage on performance, focusing on how validation design mediates leakage sensitivity. Three widely used validation techniques (2-way split, 3-way split, and 10-fold cross-validation) are evaluated under both leaky (pre-split sequence generation) and clean conditions, with the latter mitigating leakage risk by enforcing temporal separation during data splitting prior to sequence construction. The effect of leakage is assessed using RMSE Gain, which measures the relative increase in RMSE caused by leakage, computed as the percentage difference between leaky and clean setups. Empirical results show that 10-fold cross-validation exhibits RMSE Gain values of up to 20.5% at extended lag steps. In contrast, 2-way and 3-way splits demonstrate greater robustness, typically maintaining RMSE Gain below 5% across diverse configurations. Moreover, input window size and lag step significantly influence leakage sensitivity: smaller windows and longer lags increase the risk of leakage, whereas larger windows help reduce it. These findings underscore the need for configuration-aware, leakage-resistant evaluation pipelines to ensure reliable performance estimation.

Evaluating the Sensitivity of BiLSTM Forecasting Models to Sequence Length and Input Noise

Machine Learning (CS)

Makes computer predictions more accurate by fixing bad data.

7 Dec 2025 0

86%

Attention-Enhanced LSTM Modeling for Improved Temperature and Rainfall Forecasting in Bangladesh

Machine Learning (CS)

Predicts weather changes better for Bangladesh.

12 Oct 2025 0

86%

Forecasting Russian Equipment Losses Using Time Series and Deep Learning Models

Machine Learning (CS)

Predicts war equipment losses using computer smarts.

9 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇸🇦 Saudi Arabia

Page Count

16 pages

Hidden Leaks in Time Series Forecasting: How Data Leakage Affects LSTM Evaluation Across Configurations and Validation Strategies

Fixes computer predictions so they don't cheat.

Technical Abstract

Evaluating the Sensitivity of BiLSTM Forecasting Models to Sequence Length and Input Noise

Attention-Enhanced LSTM Modeling for Improved Temperature and Rainfall Forecasting in Bangladesh

Forecasting Russian Equipment Losses Using Time Series and Deep Learning Models