Score: 1

Evaluating Learned Query Performance Prediction Models at LinkedIn: Challenges, Opportunities, and Findings

Published: April 24, 2025 | arXiv ID: 2504.17181v1

By: Chujun Song , Slim Bouguerra , Erik Krogen and more

BigTech Affiliations: LinkedIn

Potential Business Impact:

Helps computers guess how long database tasks take.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Recent advancements in learning-based query performance prediction models have demonstrated remarkable efficacy. However, these models are predominantly validated using synthetic datasets focused on cardinality or latency estimations. This paper explores the application of these models to LinkedIn's complex real-world OLAP queries executed on Trino, addressing four primary research questions: (1) How do these models perform on real-world industrial data with limited information? (2) Can these models generalize to new tasks, such as CPU time prediction and classification? (3) What additional information available from the query plan could be utilized by these models to enhance their performance? (4) What are the theoretical performance limits of these models given the available data? To address these questions, we evaluate several models-including TLSTM, TCNN, QueryFormer, and XGBoost, against the industrial query workload at LinkedIn, and extend our analysis to CPU time regression and classification tasks. We also propose a multi-task learning approach to incorporate underutilized operator-level metrics that could enhance model understanding. Additionally, we empirically analyze the inherent upper bound that can be achieved from the models.

Redefining Cost Estimation in Database Systems: The Role of Execution Plan Features and Machine Learning

Databases

Helps computers guess how long database tasks take.

7 Oct 2025 0

87%

Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction

Machine Learning (CS)

Makes computer predictions more trustworthy.

14 Jun 2025 1

86%

Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems

Information Retrieval

Makes online recommendations faster and better.

13 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

14 pages

Evaluating Learned Query Performance Prediction Models at LinkedIn: Challenges, Opportunities, and Findings

Helps computers guess how long database tasks take.

Technical Abstract

Redefining Cost Estimation in Database Systems: The Role of Execution Plan Features and Machine Learning

Are We Really Measuring Progress? Transferring Insights from Evaluating Recommender Systems to Temporal Link Prediction

Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems