C2:Cross learning module enhanced decision transformer with Constraint-aware loss for auto-bidding
By: Jinren Ding , Xuejian Xu , Shen Jiang and more
Potential Business Impact:
Helps online ads bid smarter and save money.
Decision Transformer (DT) shows promise for generative auto-bidding by capturing temporal dependencies, but suffers from two critical limitations: insufficient cross-correlation modeling among state, action, and return-to-go (RTG) sequences, and indiscriminate learning of optimal/suboptimal behaviors. To address these, we propose C2, a novel framework enhancing DT with two core innovations: (1) a Cross Learning Block (CLB) via cross-attention to strengthen inter-sequence correlation modeling; (2) a Constraint-aware Loss (CL) incorporating budget and Cost-Per-Acquisition (CPA) constraints for selective learning of optimal trajectories. Extensive offline evaluations on the AuctionNet dataset demonstrate consistent performance gains (up to 3.23\% over state-of-the-art GAVE) across diverse budget settings; ablation studies verify the complementary synergy of CLB and CL, confirming C2's superiority in auto-bidding. The code for reproducing our results is available at: https://github.com/Dingjinren/C2.
Similar Papers
C2:Cross learning module enhanced decision transformer with Constraint-aware loss for auto-bidding
Machine Learning (CS)
Teaches computers to bid smarter in online ads.
HALO: Hindsight-Augmented Learning for Online Auto-Bidding
Machine Learning (CS)
Helps online ads reach the right people cheaper.
Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies
Machine Learning (CS)
Helps online ads make more money for sellers.