Score: 0

Towards a general diffusion-based information quality assessment model

Published: August 19, 2025 | arXiv ID: 2508.13927v3

By: Anthony Lopes Temporao , Mickael Temporão , Corentin Vande Kerckhove and more

Potential Business Impact:

Find good information by how it spreads.

Business Areas:
Reputation Information Technology

The rapid and unregulated dissemination of information in the digital era has amplified the global "infodemic," complicating the identification of high quality information. We present a lightweight, interpretable and non-invasive framework for assessing information quality based solely on diffusion dynamics, demonstrated here in the context of academic publications. Using a heterogeneous dataset of 29,264 sciences, technology, engineering, mathematics (STEM) and social science papers from ArnetMiner and OpenAlex, we model the diffusion network of each paper as a set of three theoretically motivated features: diversity, timeliness, and salience. A Generalized Additive Model (GAM) trained on these features achieved Pearson correlations of 0.834 for next-year citation gain and up to 95.62% accuracy in predicting high-impact papers. Feature relevance studies reveal timeliness and salience as the most robust predictors, while diversity offers less stable benefits in the academic setting but may be more informative in social media contexts. The framework's transparency, domain-agnostic design, and minimal feature requirements position it as a scalable tool for global information quality assessment, opening new avenues for moving beyond binary credibility labels toward richer, diffusion-informed evaluation metrics.

Country of Origin
🇧🇪 Belgium

Page Count
24 pages

Category
Computer Science:
Social and Information Networks