Score: 0

A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels

Published: August 15, 2025 | arXiv ID: 2508.11180v1

By: Yiyang Shen, Weiran Wang

Potential Business Impact:

Helps computers learn from incomplete data.

Multi-view learning is widely applied to real-life datasets, such as multiple omics biological data, but it often suffers from both missing views and missing labels. Prior probabilistic approaches addressed the missing view problem by using a product-of-experts scheme to aggregate representations from present views and achieved superior performance over deterministic classifiers, using the information bottleneck (IB) principle. However, the IB framework is inherently fully supervised and cannot leverage unlabeled data. In this work, we propose a semi-supervised generative model that utilizes both labeled and unlabeled samples in a unified framework. Our method maximizes the likelihood of unlabeled samples to learn a latent space shared with the IB on labeled data. We also perform cross-view mutual information maximization in the latent space to enhance the extraction of shared information across views. Compared to existing approaches, our model achieves better predictive and imputation performance on both image and multi-omics data with missing views and limited labeled samples.

Interpretable Generative and Discriminative Learning for Multimodal and Incomplete Clinical Data

Machine Learning (Stat)

Helps doctors understand sick people better.

10 Oct 2025 0

88%

Informative missingness and its implications in semi-supervised learning

Machine Learning (Stat)

Teaches computers with less data, better results.

4 Dec 2025 0

88%

Cross-view Joint Learning for Mixed-Missing Multi-view Unsupervised Feature Selection

Machine Learning (CS)

Finds important data even when some is missing.

15 Nov 2025 2

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

9 pages

A Semi-supervised Generative Model for Incomplete Multi-view Data Integration with Missing Labels

Helps computers learn from incomplete data.

Technical Abstract

Interpretable Generative and Discriminative Learning for Multimodal and Incomplete Clinical Data

Informative missingness and its implications in semi-supervised learning

Cross-view Joint Learning for Mixed-Missing Multi-view Unsupervised Feature Selection