Score: 0

Managing, Analyzing and Sharing Research Data with Gen3 Data Commons

Published: August 7, 2025 | arXiv ID: 2508.04944v1

By: Craig Barnes , Kyle Burton , Michael S. Fitzsimons and more

Potential Business Impact:

Lets scientists easily share and study huge amounts of data.

Gen3 is an open-source data platform for building data commons. A data commons is a cloud-based data platform for managing, analyzing, and sharing data with a research community. Gen3 has been used to build over a dozen data commons that in aggregate contain over 28 PB of data and 64 million FAIR data objects. To set up a Gen3 data commons, you first define a data model. Gen3 then autogenerates 1) a data portal for searching and exploring data in the commons; 2) a data portal for submitting data to the commons; and 3) FAIR APIs for accessing the data programmatically. Gen3 is built over a small number of standards-based software services, which are designed to support current and future Gen3 components so that Gen3 can interoperate with other data platforms and data ecosystems.

Page Count
20 pages

Category
Computer Science:
Distributed, Parallel, and Cluster Computing