Generating CodeMeta through declarative mapping rules: An open-ended approach using ShExML
By: Herminio García-González
Potential Business Impact:
Makes research software easier to find and use.
Nowadays, software is one of the cornerstones when conducting research in several scientific fields which employ computer-based methodologies to answer new research questions. However, for these experiments to be completely reproducible, research software should comply with the FAIR principles, yet its metadata can be represented following different data models and spread across different locations. In order to bring some cohesion to the field, CodeMeta was proposed as a vocabulary to represent research software metadata in a unified and standardised manner. While existing tools can help users to generate CodeMeta files for some specific use cases, they fall short on flexibility and adaptability. Hence, in this work, I propose the use of declarative mapping rules to generate CodeMeta files, illustrated through the implementation of three crosswalks in ShExML which are then expanded and merged to cover the generation of CodeMeta files for two existing research software artefacts. Moreover, the outputs are validated using SHACL and ShEx and the whole generation workflow is automated requiring minimal user intervention upon a new version release. This work can, therefore, be used as an example upon which other developers can include a CodeMeta generation workflow in their repositories, facilitating the adoption of CodeMeta and, ultimately, increasing research software FAIRness.
Similar Papers
Generating CodeMeta using declarative mapping rules: An open-ended approach using ShExML
Digital Libraries
Makes science computer programs easier to share.
SMECS: A Software Metadata Extraction and Curation Software
Software Engineering
Helps scientists easily share their computer programs.
AI-assisted JSON Schema Creation and Mapping
Software Engineering
Helps anyone create data rules with words.