Source Coverage and Citation Bias in LLM-based vs. Traditional Search Engines
By: Peixian Zhang , Qiming Ye , Zifan Peng and more
LLM-based Search Engines (LLM-SEs) introduces a new paradigm for information seeking. Unlike Traditional Search Engines (TSEs) (e.g., Google), these systems summarize results, often providing limited citation transparency. The implications of this shift remain largely unexplored, yet raises key questions regarding trust and transparency. In this paper, we present a large-scale empirical study of LLM-SEs, analyzing 55,936 queries and the corresponding search results across six LLM-SEs and two TSEs. We confirm that LLM-SEs cites domain resources with greater diversity than TSEs. Indeed, 37% of domains are unique to LLM-SEs. However, certain risks still persist: LLM-SEs do not outperform TSEs in credibility, political neutrality and safety metrics. Finally, to understand the selection criteria of LLM-SEs, we perform a feature-based analysis to identify key factors influencing source choice. Our findings provide actionable insights for end users, website owners, and developers.
Similar Papers
Evolving Paradigms in Task-Based Search and Learning: A Comparative Analysis of Traditional Search Engine with LLM-Enhanced Conversational Search System
Information Retrieval
Helps people learn better using smart computer answers.
When Content is Goliath and Algorithm is David: The Style and Semantic Effects of Generative Search Engine
Information Retrieval
Helps AI search engines show better, faster results.
Auditing LLM Editorial Bias in News Media Exposure
Computers and Society
AI news tools show different opinions than Google.