Zenith: Scaling up Ranking Models for Billion-scale Livestreaming Recommendation
By: Ruifeng Zhang , Zexi Huang , Zikai Wang and more
Potential Business Impact:
Makes online videos recommend better for you.
Accurately capturing feature interactions is essential in recommender systems, and recent trends show that scaling up model capacity could be a key driver for next-level predictive performance. While prior work has explored various model architectures to capture multi-granularity feature interactions, relatively little attention has been paid to efficient feature handling and scaling model capacity without incurring excessive inference latency. In this paper, we address this by presenting Zenith, a scalable and efficient ranking architecture that learns complex feature interactions with minimal runtime overhead. Zenith is designed to handle a few high-dimensional Prime Tokens with Token Fusion and Token Boost modules, which exhibits superior scaling laws compared to other state-of-the-art ranking methods, thanks to its improved token heterogeneity. Its real-world effectiveness is demonstrated by deploying the architecture to TikTok Live, a leading online livestreaming platform that attracts billions of users globally. Our A/B test shows that Zenith achieves +1.05%/-1.10% in online CTR AUC and Logloss, and realizes +9.93% gains in Quality Watch Session / User and +8.11% in Quality Watch Duration / User.
Similar Papers
RankMixer: Scaling Up Ranking Models in Industrial Recommenders
Information Retrieval
Makes recommendation systems faster and cheaper.
RankMixer: Scaling Up Ranking Models in Industrial Recommenders
Information Retrieval
Makes online suggestions faster and better.
STORE: Semantic Tokenization, Orthogonal Rotation and Efficient Attention for Scaling Up Ranking Models
Information Retrieval
Makes online recommendations faster and smarter.