Score: 0

An Easily Tunable Approach to Robust and Sparse High-Dimensional Linear Regression

Published: June 14, 2025 | arXiv ID: 2506.12591v1

By: Takeyuki Sasai, Hironori Fujisawa

Potential Business Impact:

Finds hidden patterns even with messy data.

Business Areas:

A/B Testing Data and Analytics

Sparse linear regression methods such as Lasso require a tuning parameter that depends on the noise variance, which is typically unknown and difficult to estimate in practice. In the presence of heavy-tailed noise or adversarial outliers, this problem becomes more challenging. In this paper, we propose an estimator for robust and sparse linear regression that eliminates the need for explicit prior knowledge of the noise scale. Our method builds on the Huber loss and incorporates an iterative scheme that alternates between coefficient estimation and adaptive noise calibration via median-of-means. The approach is theoretically grounded and achieves sharp non-asymptotic error bounds under both sub-Gaussian and heavy-tailed noise assumptions. Moreover, the proposed method accommodates arbitrary outlier contamination in the response without requiring prior knowledge of the number of outliers or the sparsity level. While previous robust estimators avoid tuning parameters related to the noise scale or sparsity, our procedure achieves comparable error bounds when the number of outliers is unknown, and improved bounds when it is known. In particular, the improved bounds match the known minimax lower bounds up to constant factors.

Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding

Methodology

Makes computer models better with messy data.

9 Jun 2025 1

88%

Understanding Robust Machine Learning for Nonparametric Regression with Heavy-Tailed Noise

Machine Learning (CS)

Makes computers learn from messy, unreliable data.

10 Oct 2025 0

87%

Efficient Group Lasso Regularized Rank Regression with Data-Driven Parameter Determination

Machine Learning (Stat)

Makes computer predictions more trustworthy with bad data.

13 Oct 2025 0

View PDF Login to Bookmark

Page Count

55 pages

An Easily Tunable Approach to Robust and Sparse High-Dimensional Linear Regression

Finds hidden patterns even with messy data.

Technical Abstract

Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding

Understanding Robust Machine Learning for Nonparametric Regression with Heavy-Tailed Noise

Efficient Group Lasso Regularized Rank Regression with Data-Driven Parameter Determination