Score: 1

Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models

Published: March 14, 2025 | arXiv ID: 2503.11411v1

By: Xu Liu , Taha Aksu , Juncheng Liu and more

BigTech Affiliations: Salesforce Research

Potential Business Impact:

Creates fake data to train smart computer programs.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Time series analysis is crucial for understanding dynamics of complex systems. Recent advances in foundation models have led to task-agnostic Time Series Foundation Models (TSFMs) and Large Language Model-based Time Series Models (TSLLMs), enabling generalized learning and integrating contextual information. However, their success depends on large, diverse, and high-quality datasets, which are challenging to build due to regulatory, diversity, quality, and quantity constraints. Synthetic data emerge as a viable solution, addressing these challenges by offering scalable, unbiased, and high-quality alternatives. This survey provides a comprehensive review of synthetic data for TSFMs and TSLLMs, analyzing data generation strategies, their role in model pretraining, fine-tuning, and evaluation, and identifying future research directions.

Country of Origin
🇺🇸 United States

Page Count
11 pages

Category
Computer Science:
Machine Learning (CS)