HMVI: Unifying Heterogeneous Attributes with Natural Neighbors for Missing Value Inference
By: Xiaopeng Luo, Zexi Tan, Zhuowei Wang
Potential Business Impact:
Fills in missing data to make computers smarter.
Missing value imputation is a fundamental challenge in machine intelligence, heavily dependent on data completeness. Current imputation methods often handle numerical and categorical attributes independently, overlooking critical interdependencies among heterogeneous features. To address these limitations, we propose a novel imputation approach that explicitly models cross-type feature dependencies within a unified framework. Our method leverages both complete and incomplete instances to ensure accurate and consistent imputation in tabular data. Extensive experimental results demonstrate that the proposed approach achieves superior performance over existing techniques and significantly enhances downstream machine learning tasks, providing a robust solution for real-world systems with missing data.
Similar Papers
MissHDD: Hybrid Deterministic Diffusion for Hetrogeneous Incomplete Data Imputation
Machine Learning (CS)
Fixes missing info in computer charts.
An Interdisciplinary and Cross-Task Review on Missing Data Imputation
Machine Learning (Stat)
Fixes broken data for better computer decisions.
IVGAE: Handling Incomplete Heterogeneous Data with a Variational Graph Autoencoder
Machine Learning (CS)
Fixes missing info in computer tables.