Dynamic Design of Machine Learning Pipelines via Metalearning
By: Edesio Alcobaça, André C. P. L. F. de Carvalho
Potential Business Impact:
Makes smart computer programs learn faster and better.
Automated machine learning (AutoML) has democratized the design of machine learning based systems, by automating model selection, hyperparameter tuning and feature engineering. However, the high computational cost associated with traditional search and optimization strategies, such as Random Search, Particle Swarm Optimization and Bayesian Optimization, remains a significant challenge. Moreover, AutoML systems typically explore a large search space, which can lead to overfitting. This paper introduces a metalearning method for dynamically designing search spaces for AutoML system. The proposed method uses historical metaknowledge to select promising regions of the search space, accelerating the optimization process. According to experiments conducted for this study, the proposed method can reduce runtime by 89\% in Random Search and search space by (1.8/13 preprocessor and 4.3/16 classifier), without compromising significant predictive performance. Moreover, the proposed method showed competitive performance when adapted to Auto-Sklearn, reducing its search space. Furthermore, this study encompasses insights into meta-feature selection, meta-model explainability, and the trade-offs inherent in search space reduction strategies.
Similar Papers
Meta-Learning and Synthetic Data for Automated Pretraining and Finetuning
Machine Learning (CS)
Helps computers learn faster with less data.
An experimental survey and Perspective View on Meta-Learning for Automated Algorithms Selection and Parametrization
Machine Learning (CS)
Helps computers pick the best tools for jobs.
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
Computation and Language
Makes AI learn faster and use less power.