Score: 0

No Data? No Problem: Robust Vision-Tabular Learning with Missing Values

Published: December 22, 2025 | arXiv ID: 2512.19602v1

By: Marta Hasny , Laura Daza , Keno Bressem and more

Large-scale medical biobanks provide imaging data complemented by extensive tabular information, such as demographics or clinical measurements. However, this abundance of tabular attributes does not reflect real-world datasets, where only a subset of attributes may be available. This discrepancy calls for methods that can leverage all the tabular data during training while remaining robust to missing values at inference. To address this challenge, we propose RoVTL (Robust Vision-Tabular Learning), a framework designed to handle any level of tabular data availability, from 0% to 100%. RoVTL comprises two key stages: contrastive pretraining, where we introduce tabular attribute missingness as data augmentation to promote robustness, and downstream task tuning using a gated cross-attention module for multimodal fusion. During fine-tuning, we employ a novel Tabular More vs. Fewer loss that ranks performance based on the amount of available tabular data. Combined with disentangled gradient learning, this enables consistent performance across all tabular data completeness scenarios. We evaluate RoVTL on cardiac MRI scans from the UK Biobank, demonstrating superior robustness to missing tabular data compared to prior methods. Furthermore, RoVTL successfully generalizes to an external cardiac MRI dataset for multimodal disease classification, and extends to the natural images domain, achieving robust performance on a car advertisements dataset. The code is available at https://github.com/marteczkah/RoVTL.

TGV: Tabular Data-Guided Learning of Visual Cardiac Representations

CV and Pattern Recognition

Helps doctors see patient differences in heart scans.

19 Mar 2025 0

88%

Unleashing the Power of Image-Tabular Self-Supervised Learning via Breaking Cross-Tabular Barriers

CV and Pattern Recognition

Helps doctors diagnose diseases better across hospitals.

16 Dec 2025 3

87%

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

CV and Pattern Recognition

Teaches computers to read tables without examples.

1 Dec 2025 2

View PDF Login to Bookmark

No Data? No Problem: Robust Vision-Tabular Learning with Missing Values

Technical Abstract

TGV: Tabular Data-Guided Learning of Visual Cardiac Representations

Unleashing the Power of Image-Tabular Self-Supervised Learning via Breaking Cross-Tabular Barriers

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition