Predicting the Performance of Scientific Workflow Tasks for Cluster Resource Management: An Overview of the State of the Art
By: Jonathan Bader , Kathleen West , Soeren Becker and more
Potential Business Impact:
Helps computers guess how long jobs will take.
Scientific workflow management systems support large-scale data analysis on cluster infrastructures. For this, they interact with resource managers which schedule workflow tasks onto cluster nodes. In addition to workflow task descriptions, resource managers rely on task performance estimates such as main memory consumption and runtime to efficiently manage cluster resources. Such performance estimates should be automated, as user-based task performance estimates are error-prone. In this book chapter, we describe key characteristics of methods for workflow task runtime and memory prediction, provide an overview and a detailed comparison of state-of-the-art methods from the literature, and discuss how workflow task performance prediction is useful for scheduling, energy-efficient and carbon-aware computing, and cost prediction.
Similar Papers
Machine Learning-Driven Predictive Resource Management in Complex Science Workflows
Distributed, Parallel, and Cluster Computing
Predicts computer needs for science experiments.
Energy-Aware Workflow Execution: An Overview of Techniques for Saving Energy and Emissions in Scientific Compute Clusters
Distributed, Parallel, and Cluster Computing
Makes science computers use less energy and pollution.
Exploring the Potential of Carbon-Aware Execution for Scientific Workflows
Distributed, Parallel, and Cluster Computing
Saves energy and cuts pollution from science computers.