Enhancing Differentially Private Linear Regression via Public Second-Moment
By: Zilong Cao, Hai Zhang
Potential Business Impact:
Makes private data analysis more accurate and safe.
Leveraging information from public data has become increasingly crucial in enhancing the utility of differentially private (DP) methods. Traditional DP approaches often require adding noise based solely on private data, which can significantly degrade utility. In this paper, we address this limitation in the context of the ordinary least squares estimator (OLSE) of linear regression based on sufficient statistics perturbation (SSP) under the unbounded data assumption. We propose a novel method that involves transforming private data using the public second-moment matrix to compute a transformed SSP-OLSE, whose second-moment matrix yields a better condition number and improves the OLSE accuracy and robustness. We derive theoretical error bounds about our method and the standard SSP-OLSE to the non-DP OLSE, which reveal the improved robustness and accuracy achieved by our approach. Experiments on synthetic and real-world datasets demonstrate the utility and effectiveness of our method.
Similar Papers
Near-Optimal Private Linear Regression via Iterative Hessian Mixing
Machine Learning (CS)
Makes private data safe for computer learning.
Statistical Inference for Differentially Private Stochastic Gradient Descent
Machine Learning (Stat)
Makes private data safe for computer learning.
Lower Bounds for Public-Private Learning under Distribution Shift
Machine Learning (CS)
Combines data to learn better, but only if it's similar.