C2:Cross learning module enhanced decision transformer with Constraint-aware loss for auto-bidding
By: Jinren Ding , Xuejian Xu , Shen Jiang and more
Potential Business Impact:
Teaches computers to bid smarter in online ads.
Decision Transformer (DT) shows promise for generative auto-bidding by capturing temporal dependencies, but suffers from two critical limitations: insufficient cross-correlation modeling among state, action, and return-to-go (RTG) sequences, and indiscriminate learning of optimal/suboptimal behaviors. To address these, we propose C2, a novel framework enhancing DT with two core innovations: (1) a Cross Learning Block (CLB) via cross-attention to strengthen inter-sequence correlation modeling; (2) a Constraint-aware Loss (CL) incorporating budget and Cost-Per-Acquisition (CPA) constraints for selective learning of optimal trajectories. Extensive offline evaluations on the AuctionNet dataset demonstrate consistent performance gains (up to 3.2% over state-of-the-art method) across diverse budget settings; ablation studies verify the complementary synergy of CLB and CL, confirming C2's superiority in auto-bidding. The code for reproducing our results is available at: https://github.com/Dingjinren/C2.
Similar Papers
C2:Cross learning module enhanced decision transformer with Constraint-aware loss for auto-bidding
Machine Learning (CS)
Helps online ads bid smarter and save money.
HALO: Hindsight-Augmented Learning for Online Auto-Bidding
Machine Learning (CS)
Helps online ads reach the right people cheaper.
Robust Adversarial Reinforcement Learning in Stochastic Games via Sequence Modeling
Machine Learning (CS)
Makes smart robots safer from tricky challenges.