Score: 2

An Empirical Evaluation of Modern MLOps Frameworks

Published: January 28, 2026 | arXiv ID: 2601.20415v1

By: Jon Marcos-Mercadé, Unai Lopez-Novoa, Mikel Egaña Aranguren

Potential Business Impact:

Helps pick the best AI tools for jobs.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

Given the increasing adoption of AI solutions in professional environments, it is necessary for developers to be able to make informed decisions about the current tool landscape. This work empirically evaluates various MLOps (Machine Learning Operations) tools to facilitate the management of the ML model lifecycle: MLflow, Metaflow, Apache Airflow, and Kubeflow Pipelines. The tools are evaluated by assessing the criteria of Ease of installation, Configuration flexibility, Interoperability, Code instrumentation complexity, result interpretability, and Documentation when implementing two common ML scenarios: Digit classifier with MNIST and Sentiment classifier with IMDB and BERT. The evaluation is completed by providing weighted results that lead to practical conclusions on which tools are best suited for different scenarios.

Country of Origin
🇪🇸 Spain


Page Count
30 pages

Category
Computer Science:
Software Engineering