This is Going to Sound Crazy, But What If We Used Large Language Models to Boost Automatic Database Tuning Algorithms By Leveraging Prior History? We Will Find Better Configurations More Quickly Than Retraining From Scratch!
By: William Zhang, Wan Shen Lim, Andrew Pavlo
Potential Business Impact:
Helps computer programs run faster when things change.
Tuning database management systems (DBMSs) is challenging due to trillions of possible configurations and evolving workloads. Recent advances in tuning have led to breakthroughs in optimizing over the possible configurations. However, due to their design and inability to leverage query-level historical insights, existing automated tuners struggle to adapt and re-optimize the DBMS when the environment changes (e.g., workload drift, schema transfer). This paper presents the Booster framework that assists existing tuners in adapting to environment changes (e.g., drift, cross-schema transfer). Booster structures historical artifacts into query-configuration contexts, prompts large language models (LLMs) to suggest configurations for each query based on relevant contexts, and then composes the query-level suggestions into a holistic configuration with beam search. With multiple OLAP workloads, we evaluate Booster's ability to assist different state-of-the-art tuners (e.g., cost-/machine learning-/LLM-based) in adapting to environment changes. By composing recommendations derived from query-level insights, Booster assists tuners in discovering configurations that are up to 74% better and in up to 4.7x less time than the alternative approach of continuing to tune from historical configurations.
Similar Papers
Automated Algorithm Design for Auto-Tuning Optimizers
Machine Learning (CS)
Makes computer programs run much faster automatically.
Can Large Language Models Be Query Optimizer for Relational Databases?
Databases
Makes computer databases find information faster.
LLM4Hint: Leveraging Large Language Models for Hint Recommendation in Offline Query Optimization
Databases
Helps computers understand data questions faster.