Score: 0

Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Published: October 6, 2025 | arXiv ID: 2510.04900v1

By: Nick Janßen, Melanie Schaller, Bodo Rosenhahn

Potential Business Impact:

Tests computer predictions on fake data.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Understanding the robustness of deep learning models for multivariate long-term time series forecasting (M-LTSF) remains challenging, as evaluations typically rely on real-world datasets with unknown noise properties. We propose a simulation-based evaluation framework that generates parameterizable synthetic datasets, where each dataset instance corresponds to a different configuration of signal components, noise types, signal-to-noise ratios, and frequency characteristics. These configurable components aim to model real-world multivariate time series data without the ambiguity of unknown noise. This framework enables fine-grained, systematic evaluation of M-LTSF models under controlled and diverse scenarios. We benchmark four representative architectures S-Mamba (state-space), iTransformer (transformer-based), R-Linear (linear), and Autoformer (decomposition-based). Our analysis reveals that all models degrade severely when lookback windows cannot capture complete periods of seasonal patters in the data. S-Mamba and Autoformer perform best on sawtooth patterns, while R-Linear and iTransformer favor sinusoidal signals. White and Brownian noise universally degrade performance with lower signal-to-noise ratio while S-Mamba shows specific trend-noise and iTransformer shows seasonal-noise vulnerability. Further spectral analysis shows that S-Mamba and iTransformer achieve superior frequency reconstruction. This controlled approach, based on our synthetic and principle-driven testbed, offers deeper insights into model-specific strengths and limitations through the aggregation of MSE scores and provides concrete guidance for model selection based on signal characteristics and noise conditions.

A Multi-scale Representation Learning Framework for Long-Term Time Series Forecasting

Machine Learning (CS)

Predicts future events more accurately.

13 May 2025 0

89%

TSGym: Design Choices for Deep Multivariate Time-Series Forecasting

Machine Learning (CS)

Builds smarter computer predictions for changing data.

21 Sep 2025 2

88%

MFRS: A Multi-Frequency Reference Series Approach to Scalable and Accurate Time-Series Forecasting

Machine Learning (CS)

Predicts future events by finding hidden patterns.

11 Mar 2025 2

View PDF Login to Bookmark

Page Count

13 pages

Benchmarking M-LTSF: Frequency and Noise-Based Evaluation of Multivariate Long Time Series Forecasting Models

Tests computer predictions on fake data.

Technical Abstract

A Multi-scale Representation Learning Framework for Long-Term Time Series Forecasting

TSGym: Design Choices for Deep Multivariate Time-Series Forecasting

MFRS: A Multi-Frequency Reference Series Approach to Scalable and Accurate Time-Series Forecasting