Exploring the Potential of Carbon-Aware Execution for Scientific Workflows
By: Kathleen West , Fabian Lehmann , Vasilis Bountris and more
Potential Business Impact:
Saves energy and cuts pollution from science computers.
Scientific workflows are widely used to automate scientific data analysis and often involve processing large quantities of data on compute clusters. As such, their execution tends to be long-running and resource intensive, leading to significant energy consumption and carbon emissions. Meanwhile, a wealth of carbon-aware computing methods have been proposed, yet little work has focused specifically on scientific workflows, even though they present a substantial opportunity for carbon-aware computing because they are inherently delay tolerant, efficiently interruptible, and highly scalable. In this study, we demonstrate the potential for carbon-aware workflow execution. For this, we estimate the carbon footprint of two real-world Nextflow workflows executed on cluster infrastructure. We use a linear power model for energy consumption estimates and real-world average and marginal CI data for two regions. We evaluate the impact of carbon-aware temporal shifting, pausing and resuming, and resource scaling. Our findings highlight significant potential for reducing emissions of workflows and workflow tasks.
Similar Papers
A Systematic Evaluation of the Potential of Carbon-Aware Execution for Scientific Workflows
Distributed, Parallel, and Cluster Computing
Saves energy by running science tasks smarter.
Energy-Aware Workflow Execution: An Overview of Techniques for Saving Energy and Emissions in Scientific Compute Clusters
Distributed, Parallel, and Cluster Computing
Makes science computers use less energy and pollution.
Quantifying the Carbon Reduction of DAG Workloads: A Job Shop Scheduling Perspective
Distributed, Parallel, and Cluster Computing
Smarter computer jobs save energy and cut pollution.