A PCA-based Data Prediction Method
By: Peteris Daugulis , Vija Vagale , Emiliano Mancini and more
Potential Business Impact:
Fills in missing numbers in data sets.
The problem of choosing appropriate values for missing data is often encountered in the data science. We describe a novel method containing both traditional mathematics and machine learning elements for prediction (imputation) of missing data. This method is based on the notion of distance between shifted linear subspaces representing the existing data and candidate sets. The existing data set is represented by the subspace spanned by its first principal components. Solutions for the case of the Euclidean metric are given.
Similar Papers
Quantum-Inspired Optimization Process for Data Imputation
Quantum Physics
Fixes missing health data for better health predictions.
An Interdisciplinary and Cross-Task Review on Missing Data Imputation
Machine Learning (Stat)
Fixes broken data for better computer decisions.
Kernel Representation and Similarity Measure for Incomplete Data
Machine Learning (CS)
Finds patterns in messy, missing information.