Joint Learning of Unsupervised Multi-view Feature and Instance Co-selection with Cross-view Imputation
By: Yuxin Cai , Yanyong Huang , Jinyuan Chang and more
Feature and instance co-selection, which aims to reduce both feature dimensionality and sample size by identifying the most informative features and instances, has attracted considerable attention in recent years. However, when dealing with unlabeled incomplete multi-view data, where some samples are missing in certain views, existing methods typically first impute the missing data and then concatenate all views into a single dataset for subsequent co-selection. Such a strategy treats co-selection and missing data imputation as two independent processes, overlooking potential interactions between them. The inter-sample relationships gleaned from co-selection can aid imputation, which in turn enhances co-selection performance. Additionally, simply merging multi-view data fails to capture the complementary information among views, ultimately limiting co-selection effectiveness. To address these issues, we propose a novel co-selection method, termed Joint learning of Unsupervised multI-view feature and instance Co-selection with cross-viEw imputation (JUICE). JUICE first reconstructs incomplete multi-view data using available observations, bringing missing data recovery and feature and instance co-selection together in a unified framework. Then, JUICE leverages cross-view neighborhood information to learn inter-sample relationships and further refine the imputation of missing values during reconstruction. This enables the selection of more representative features and instances. Extensive experiments demonstrate that JUICE outperforms state-of-the-art methods.
Similar Papers
Cross-view Joint Learning for Mixed-Missing Multi-view Unsupervised Feature Selection
Machine Learning (CS)
Finds important data even when some is missing.
Simple Yet Effective Selective Imputation for Incomplete Multi-view Clustering
CV and Pattern Recognition
Groups similar things even with missing info.
A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels
Machine Learning (CS)
Helps computers learn from incomplete data.