Beyond Citations: A Cross-Domain Metric for Dataset Impact and Shareability
By: Smitha Muthya Sudheendra , Zhongxing Zhang , Wenwen Cao and more
Potential Business Impact:
Measures how much research data is actually used.
The scientific community increasingly relies on open data sharing, yet existing metrics inadequately capture the true impact of datasets as research outputs. Traditional measures, such as the h-index, focus on publications and citations but fail to account for dataset accessibility, reuse, and cross-disciplinary influence. We propose the X-index, a novel author-level metric that quantifies the value of data contributions through a two-step process: (i) computing a dataset-level value score (V-score) that integrates breadth of reuse, FAIRness, citation impact, and transitive reuse depth, and (ii) aggregating V-scores into an author-level X-index. Using datasets from computational social science, medicine, and crisis communication, we validate our approach against expert ratings, achieving a strong correlation. Our results demonstrate that the X-index provides a transparent, scalable, and low-cost framework for assessing data-sharing practices and incentivizing open science. The X-index encourages sustainable data-sharing practices and gives institutions, funders, and platforms a tangible way to acknowledge the lasting influence of research datasets.
Similar Papers
Distinguishing True Influence from Hyperprolificity with Citation Distance
Digital Libraries
Measures scientists' true impact, not just how much they publish.
The Community Index: A More Comprehensive Approach to Assessing Scholarly Impact
Digital Libraries
Measures research impact better by valuing diverse teams.
Linking Data Citation to Repository Visibility: An Empirical Study
Digital Libraries
Makes research data easier to find and cite.