Forecasting Clicks in Digital Advertising: Multimodal Inputs and Interpretable Outputs
By: Briti Gangopadhyay, Zhao Wang, Shingo Takamatsu
Potential Business Impact:
Predicts ad clicks better using words and numbers.
Forecasting click volume is a key task in digital advertising, influencing both revenue and campaign strategy. Traditional time series models rely solely on numerical data, often overlooking rich contextual information embedded in textual elements, such as keyword updates. We present a multimodal forecasting framework that combines click data with textual logs from real-world ad campaigns and generates human-interpretable explanations alongside numeric predictions. Reinforcement learning is used to improve comprehension of textual information and enhance fusion of modalities. Experiments on a large-scale industry dataset show that our method outperforms baselines in both accuracy and reasoning quality.
Similar Papers
Adaptive Information Routing for Multimodal Time Series Forecasting
Machine Learning (CS)
Helps predict prices by reading news.
Multimodal Language Models with Modality-Specific Experts for Financial Forecasting from Interleaved Sequences of Text and Time Series
Computational Engineering, Finance, and Science
Predicts stock prices better using news and numbers.
Text Reinforcement for Multimodal Time Series Forecasting
Computation and Language
Makes predictions better by improving text.