Score: 1

FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting

Published: June 5, 2025 | arXiv ID: 2506.05597v1

By: Yash Vijay, Harini Subramanyan

BigTech Affiliations: C3.ai

Potential Business Impact:

Predicts future events more accurately and simply.

Business Areas:

Facial Recognition Data and Analytics, Software

While Transformers excel in language and vision-where inputs are semantically rich and exhibit univariate dependency structures-their architectural complexity leads to diminishing returns in time series forecasting. Time series data is characterized by low per-timestep information density and complex dependencies across channels and covariates, requiring conditioning on structured variable interactions. To address this mismatch and overparameterization, we propose FaCTR, a lightweight spatiotemporal Transformer with an explicitly structural design. FaCTR injects dynamic, symmetric cross-channel interactions-modeled via a low-rank Factorization Machine into temporally contextualized patch embeddings through a learnable gating mechanism. It further encodes static and dynamic covariates for multivariate conditioning. Despite its compact design, FaCTR achieves state-of-the-art performance on eleven public forecasting benchmarks spanning both short-term and long-term horizons, with its largest variant using close to only 400K parameters-on average 50x smaller than competitive spatiotemporal transformer baselines. In addition, its structured design enables interpretability through cross-channel influence scores-an essential requirement for real-world decision-making. Finally, FaCTR supports self-supervised pretraining, positioning it as a compact yet versatile foundation for downstream time series tasks.

From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction

Information Retrieval

Helps online ads show better by understanding user choices.

15 Nov 2025 2

87%

Channel Dependence, Limited Lookback Windows, and the Simplicity of Datasets: How Biased is Time Series Forecasting?

Machine Learning (CS)

Makes computer predictions better on tricky data.

13 Feb 2025 1

87%

Intelligently Augmented Contrastive Tensor Factorization: Empowering Multi-dimensional Time Series Classification in Low-Data Environments

Machine Learning (CS)

Teaches computers to understand complex data with less examples.

3 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

27 pages

FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting

Predicts future events more accurately and simply.

Technical Abstract

From Scaling to Structured Expressivity: Rethinking Transformers for CTR Prediction

Channel Dependence, Limited Lookback Windows, and the Simplicity of Datasets: How Biased is Time Series Forecasting?

Intelligently Augmented Contrastive Tensor Factorization: Empowering Multi-dimensional Time Series Classification in Low-Data Environments