RADAR: Recall Augmentation through Deferred Asynchronous Retrieval
By: Amit Jaspal, Qian Dang, Ajantha Ramineni
Potential Business Impact:
Finds better videos for you faster.
Modern large-scale recommender systems employ multi-stage ranking funnel (Retrieval, Pre-ranking, Ranking) to balance engagement and computational constraints (latency, CPU). However, the initial retrieval stage, often relying on efficient but less precise methods like K-Nearest Neighbors (KNN), struggles to effectively surface the most engaging items from billion-scale catalogs, particularly distinguishing highly relevant and engaging candidates from merely relevant ones. We introduce Recall Augmentation through Deferred Asynchronous Retrieval (RADAR), a novel framework that leverages asynchronous, offline computation to pre-rank a significantly larger candidate set for users using the full complexity ranking model. These top-ranked items are stored and utilized as a high-quality retrieval source during online inference, bypassing online retrieval and pre-ranking stages for these candidates. We demonstrate through offline experiments that RADAR significantly boosts recall (2X Recall@200 vs DNN retrieval baseline) by effectively combining a larger retrieved candidate set with a more powerful ranking model. Online A/B tests confirm a +0.8% lift in topline engagement metrics, validating RADAR as a practical and effective method to improve recommendation quality under strict online serving constraints.
Similar Papers
RADAR: Retrieval-Augmented Detector with Adversarial Refinement for Robust Fake News Detection
Computation and Language
Finds fake news better by teaching computers to argue.
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Artificial Intelligence
Teaches robots to learn from past actions.
Retrieval Augmented Decision-Making: A Requirements-Driven, Multi-Criteria Framework for Structured Decision Support
Artificial Intelligence
Helps people make smart choices from complex papers.