Score: 0

DDTime: Dataset Distillation with Spectral Alignment and Information Bottleneck for Time-Series Forecasting

Published: November 20, 2025 | arXiv ID: 2511.16715v1

By: Yuqi Li , Kuiye Ding , Chuanguang Yang and more

Potential Business Impact:

Makes computer predictions faster with less data.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Time-series forecasting is fundamental across many domains, yet training accurate models often requires large-scale datasets and substantial computational resources. Dataset distillation offers a promising alternative by synthesizing compact datasets that preserve the learning behavior of full data. However, extending dataset distillation to time-series forecasting is non-trivial due to two fundamental challenges: 1.temporal bias from strong autocorrelation, which leads to distorted value-term alignment between teacher and student models; and 2.insufficient diversity among synthetic samples, arising from the absence of explicit categorical priors to regularize trajectory variety. In this work, we propose DDTime, a lightweight and plug-in distillation framework built upon first-order condensation decomposition. To tackle Challenge 1, it revisits value-term alignment through temporal statistics and introduces a frequency-domain alignment mechanism to mitigate autocorrelation-induced bias, ensuring spectral consistency and temporal fidelity. To address Challenge 2, we further design an inter-sample regularization inspired by the information bottleneck principle, which enhances diversity and maximizes information density across synthetic trajectories. The combined objective is theoretically compatible with a wide range of condensation paradigms and supports stable first-order optimization. Extensive experiments on 20 benchmark datasets and diverse forecasting architectures demonstrate that DDTime consistently outperforms existing distillation methods, achieving about 30% relative accuracy gains while introducing about 2.49% computational overhead. All code and distilled datasets will be released.

Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

CV and Pattern Recognition

Makes big video files much smaller for computers.

27 May 2025 1

89%

TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution

CV and Pattern Recognition

Makes computer learning faster and better.

2 Dec 2025 2

89%

TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation

Machine Learning (CS)

Makes smart computer predictions faster and smaller.

20 Feb 2025 0

View PDF Login to Bookmark

Page Count

36 pages

DDTime: Dataset Distillation with Spectral Alignment and Information Bottleneck for Time-Series Forecasting

Makes computer predictions faster with less data.

Technical Abstract

Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets

TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution

TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation