Score: 0

Beyond Citations: A Cross-Domain Metric for Dataset Impact and Shareability

Published: November 17, 2025 | arXiv ID: 2511.12966v1

By: Smitha Muthya Sudheendra , Zhongxing Zhang , Wenwen Cao and more

Potential Business Impact:

Measures how much research data is actually used.

Business Areas:
Content Discovery Content and Publishing, Media and Entertainment

The scientific community increasingly relies on open data sharing, yet existing metrics inadequately capture the true impact of datasets as research outputs. Traditional measures, such as the h-index, focus on publications and citations but fail to account for dataset accessibility, reuse, and cross-disciplinary influence. We propose the X-index, a novel author-level metric that quantifies the value of data contributions through a two-step process: (i) computing a dataset-level value score (V-score) that integrates breadth of reuse, FAIRness, citation impact, and transitive reuse depth, and (ii) aggregating V-scores into an author-level X-index. Using datasets from computational social science, medicine, and crisis communication, we validate our approach against expert ratings, achieving a strong correlation. Our results demonstrate that the X-index provides a transparent, scalable, and low-cost framework for assessing data-sharing practices and incentivizing open science. The X-index encourages sustainable data-sharing practices and gives institutions, funders, and platforms a tangible way to acknowledge the lasting influence of research datasets.

Country of Origin
🇺🇸 United States

Page Count
8 pages

Category
Computer Science:
Computers and Society