A Query Optimization Method Utilizing Large Language Models
By: Zhiming Yao , Haoyang Li , Jing Zhang and more
Potential Business Impact:
Makes computer searches find answers much faster.
Query optimization is a critical task in database systems, focused on determining the most efficient way to execute a query from an enormous set of possible strategies. Traditional approaches rely on heuristic search methods and cost predictions, but these often struggle with the complexity of the search space and inaccuracies in performance estimation, leading to suboptimal plan choices. This paper presents LLMOpt, a novel framework that leverages Large Language Models (LLMs) to address these challenges through two innovative components: (1) LLM for Plan Candidate Generation (LLMOpt(G)), which eliminates heuristic search by utilizing the reasoning abilities of LLMs to directly generate high-quality query plans, and (2) LLM for Plan Candidate Selection (LLMOpt(S)), a list-wise cost model that compares candidates globally to enhance selection accuracy. To adapt LLMs for query optimization, we propose fine-tuning pre-trained models using optimization data collected offline. Experimental results on the JOB, JOB-EXT, and Stack benchmarks show that LLMOpt(G) and LLMOpt(S) outperform state-of-the-art methods, including PostgreSQL, BAO, and HybridQO. Notably, LLMOpt(S) achieves the best practical performance, striking a balance between plan quality and inference efficiency.
Similar Papers
Large Scale Multi-Task Bayesian Optimization with Large Language Models
Machine Learning (CS)
AI learns from past jobs to do new ones better.
Exploiting Edited Large Language Models as General Scientific Optimizers
Optimization and Control
Helps computers solve hard science problems better.
Training-Free Query Optimization via LLM-Based Plan Similarity
Databases
Speeds up computer searches by learning from past ones.