Addressing Personalized Bias for Unbiased Learning to Rank
By: Zechun Niu , Lang Mei , Liu Yang and more
Potential Business Impact:
Helps search engines show better results for everyone.
Unbiased learning to rank (ULTR), which aims to learn unbiased ranking models from biased user behavior logs, plays an important role in Web search. Previous research on ULTR has studied a variety of biases in users' clicks, such as position bias, presentation bias, and outlier bias. However, existing work often assumes that the behavior logs are collected from an ``average'' user, neglecting the differences between different users in their search and browsing behaviors. In this paper, we introduce personalized factors into the ULTR framework, which we term the user-aware ULTR problem. Through a formal causal analysis of this problem, we demonstrate that existing user-oblivious methods are biased when different users have different preferences over queries and personalized propensities of examining documents. To address such a personalized bias, we propose a novel user-aware inverse-propensity-score estimator for learning-to-rank objectives. Specifically, our approach models the distribution of user browsing behaviors for each query and aggregates user-weighted examination probabilities to determine propensities. We theoretically prove that the user-aware estimator is unbiased under some mild assumptions and shows lower variance compared to the straightforward way of calculating a user-dependent propensity for each impression. Finally, we empirically verify the effectiveness of our user-aware estimator by conducting extensive experiments on two semi-synthetic datasets and a real-world dataset.
Similar Papers
LLMs for estimating positional bias in logged interaction data
Information Retrieval
Makes online lists show better, fairer results.
RewardRank: Optimizing True Learning-to-Rank Utility
Information Retrieval
Shows online stores what shoppers really want.
User Invariant Preference Learning for Multi-Behavior Recommendation
Information Retrieval
Finds what you *really* like for better suggestions.