Score: 0

Towards the Formalization of a Trustworthy AI for Mining Interpretable Models explOiting Sophisticated Algorithms

Published: October 23, 2025 | arXiv ID: 2510.20621v1

By: Riccardo Guidotti , Martina Cinquini , Marta Marchiori Manerba and more

Potential Business Impact:

Makes AI fair, private, and understandable.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Interpretable-by-design models are crucial for fostering trust, accountability, and safe adoption of automated decision-making models in real-world applications. In this paper we formalize the ground for the MIMOSA (Mining Interpretable Models explOiting Sophisticated Algorithms) framework, a comprehensive methodology for generating predictive models that balance interpretability with performance while embedding key ethical properties. We formally define here the supervised learning setting across diverse decision-making tasks and data types, including tabular data, time series, images, text, transactions, and trajectories. We characterize three major families of interpretable models: feature importance, rule, and instance based models. For each family, we analyze their interpretability dimensions, reasoning mechanisms, and complexity. Beyond interpretability, we formalize three critical ethical properties, namely causality, fairness, and privacy, providing formal definitions, evaluation metrics, and verification procedures for each. We then examine the inherent trade-offs between these properties and discuss how privacy requirements, fairness constraints, and causal reasoning can be embedded within interpretable pipelines. By evaluating ethical measures during model generation, this framework establishes the theoretical foundations for developing AI systems that are not only accurate and interpretable but also fair, privacy-preserving, and causally aware, i.e., trustworthy.

Foundations of Interpretable Models

Machine Learning (CS)

Makes AI easier to understand and build.

1 Aug 2025 3

88%

Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design

Machine Learning (CS)

Makes medical AI trustworthy and understandable.

22 Aug 2025 0

88%

Beyond single-model XAI: aggregating multi-model explanations for enhanced trustworthiness

Machine Learning (CS)

Makes AI decisions easier to trust.

13 Oct 2025 0

View PDF Login to Bookmark

Page Count

42 pages

Towards the Formalization of a Trustworthy AI for Mining Interpretable Models explOiting Sophisticated Algorithms

Makes AI fair, private, and understandable.

Technical Abstract

Foundations of Interpretable Models

Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design

Beyond single-model XAI: aggregating multi-model explanations for enhanced trustworthiness