Water Quality Data Imputation via A Fast Latent Factorization of Tensors with PID-based Optimizer
By: Qian Liu , Lan Wang , Bing Yang and more
Potential Business Impact:
Fixes bad water data for better decisions.
Water quality data can supply a substantial decision support for water resources utilization and pollution prevention. However, there are numerous missing values in water quality data due to inescapable factors like sensor failure, thereby leading to biased result for hydrological analysis and failing to support environmental governance decision accurately. A Latent Factorization of Tensors (LFT) with Stochastic Gradient Descent (SGD) proves to be an efficient imputation method. However, a standard SGD-based LFT model commonly surfers from the slow convergence that impairs its efficiency. To tackle this issue, this paper proposes a Fast Latent Factorization of Tensors (FLFT) model. It constructs an adjusted instance error into SGD via leveraging a nonlinear PID controller to incorporates the past, current and future information of prediction error for improving convergence rate. Comparing with state-of-art models in real world datasets, the results of experiment indicate that the FLFT model achieves a better convergence rate and higher accuracy.
Similar Papers
Latent Tensor Factorization with Nonlinear PID Control for Missing Data Recovery in Non-Intrusive Load Monitoring
Machine Learning (CS)
Fixes smart meter data errors faster and better.
Academic Network Representation via Prediction-Sampling Incorporated Tensor Factorization
Machine Learning (CS)
Finds hidden science connections to predict future discoveries.
A Proportional-Integral Controller-Incorporated SGD Algorithm for High Efficient Latent Factor Analysis
Machine Learning (CS)
Learns faster from big data by remembering past lessons.