A Systematic Evaluation of the Potential of Carbon-Aware Execution for Scientific Workflows
By: Kathleen West , Youssef Moawad , Fabian Lehmann and more
Potential Business Impact:
Saves energy by running science tasks smarter.
Scientific workflows are widely used to automate scientific data analysis and often involve computationally intensive processing of large datasets on compute clusters. As such, their execution tends to be long-running and resource-intensive, resulting in substantial energy consumption and, depending on the energy mix, carbon emissions. Meanwhile, a wealth of carbon-aware computing methods have been proposed, yet little work has focused specifically on scientific workflows, even though they present a substantial opportunity for carbon-aware computing because they are often significantly delay tolerant, efficiently interruptible, highly scalable and widely heterogeneous. In this study, we first exemplify the problem of carbon emissions associated with running scientific workflows, and then show the potential for carbon-aware workflow execution. For this, we estimate the carbon footprint of seven real-world Nextflow workflows executed on different cluster infrastructures using both average and marginal carbon intensity data. Furthermore, we systematically evaluate the impact of carbon-aware temporal shifting, and the pausing and resuming of the workflow. Moreover, we apply resource scaling to workflows and workflow tasks. Finally, we report the potential reduction in overall carbon emissions, with temporal shifting capable of decreasing emissions by over 80%, and resource scaling capable of decreasing emissions by 67%.
Similar Papers
Exploring the Potential of Carbon-Aware Execution for Scientific Workflows
Distributed, Parallel, and Cluster Computing
Saves energy and cuts pollution from science computers.
Energy-Aware Workflow Execution: An Overview of Techniques for Saving Energy and Emissions in Scientific Compute Clusters
Distributed, Parallel, and Cluster Computing
Makes science computers use less energy and pollution.
Quantifying the Carbon Reduction of DAG Workloads: A Job Shop Scheduling Perspective
Distributed, Parallel, and Cluster Computing
Smarter computer jobs save energy and cut pollution.