Score: 0

Refining Filter Global Feature Weighting for Fully-Unsupervised Clustering

Published: March 12, 2025 | arXiv ID: 2503.11706v1

By: Fabian Galis, Darian Onchis

Potential Business Impact:

Helps computers find hidden groups in data.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

In the context of unsupervised learning, effective clustering plays a vital role in revealing patterns and insights from unlabeled data. However, the success of clustering algorithms often depends on the relevance and contribution of features, which can differ between various datasets. This paper explores feature weighting for clustering and presents new weighting strategies, including methods based on SHAP (SHapley Additive exPlanations), a technique commonly used for providing explainability in various supervised machine learning tasks. By taking advantage of SHAP values in a way other than just to gain explainability, we use them to weight features and ultimately improve the clustering process itself in unsupervised scenarios. Our empirical evaluations across five benchmark datasets and clustering methods demonstrate that feature weighting based on SHAP can enhance unsupervised clustering quality, achieving up to a 22.69\% improvement over other weighting methods (from 0.586 to 0.719 in terms of the Adjusted Rand Index). Additionally, these situations where the weighted data boosts the results are highlighted and thoroughly explored, offering insight for practical applications.

Shapley-Inspired Feature Weighting in $k$-means with No Additional Hyperparameters

Machine Learning (CS)

Finds important patterns by ignoring bad data.

11 Aug 2025 1

87%

FORCE: Feature-Oriented Representation with Clustering and Explanation

Machine Learning (CS)

Helps computers learn hidden patterns for better predictions.

7 Apr 2025 0

86%

SHAP-Based Supervised Clustering for Sample Classification and the Generalized Waterfall Plot

Machine Learning (CS)

Shows why computers make certain decisions.

9 Oct 2025 0

View PDF Login to Bookmark

Page Count

15 pages

Refining Filter Global Feature Weighting for Fully-Unsupervised Clustering

Helps computers find hidden groups in data.

Technical Abstract

Shapley-Inspired Feature Weighting in $k$-means with No Additional Hyperparameters

FORCE: Feature-Oriented Representation with Clustering and Explanation

SHAP-Based Supervised Clustering for Sample Classification and the Generalized Waterfall Plot