Addressing Reproducibility Challenges in HPC with Continuous Integration
By: Valérie Hayot-Sasson , Nathaniel Hudson , André Bauer and more
Potential Business Impact:
Makes computer programs easier to check and reuse.
The high-performance computing (HPC) community has adopted incentive structures to motivate reproducible research, with major conferences awarding badges to papers that meet reproducibility requirements. Yet, many papers do not meet such requirements. The uniqueness of HPC infrastructure and software, coupled with strict access requirements, may limit opportunities for reproducibility. In the absence of resource access, we believe that regular documented testing, through continuous integration (CI), coupled with complete provenance information, can be used as a substitute. Here, we argue that better HPC-compliant CI solutions will improve reproducibility of applications. We present a survey of reproducibility initiatives and describe the barriers to reproducibility in HPC. To address existing limitations, we present a GitHub Action, CORRECT, that enables secure execution of tests on remote HPC resources. We evaluate CORRECT's usability across three different types of HPC applications, demonstrating the effectiveness of using CORRECT for automating and documenting reproducibility evaluations.
Similar Papers
Report on Challenges of Practical Reproducibility for Systems and HPC Computer Science
Distributed, Parallel, and Cluster Computing
Makes computer science experiments easier to repeat.
Usability Evaluation of Cloud for HPC Applications
Distributed, Parallel, and Cluster Computing
Tests if cloud computers work for science.
Towards an Optimized Benchmarking Platform for CI/CD Pipelines
Distributed, Parallel, and Cluster Computing
Find software problems faster, saving computer power.