Score: 1

Regularized least squares learning with heavy-tailed noise is minimax optimal

Published: May 20, 2025 | arXiv ID: 2505.14214v2

By: Mattes Mollenhauer , Nicole Mücke , Dimitri Meunier and more

Potential Business Impact:

Makes computer learning work better with messy data.

Business Areas:

A/B Testing Data and Analytics

This paper examines the performance of ridge regression in reproducing kernel Hilbert spaces in the presence of noise that exhibits a finite number of higher moments. We establish excess risk bounds consisting of subgaussian and polynomial terms based on the well known integral operator framework. The dominant subgaussian component allows to achieve convergence rates that have previously only been derived under subexponential noise - a prevalent assumption in related work from the last two decades. These rates are optimal under standard eigenvalue decay conditions, demonstrating the asymptotic robustness of regularized least squares against heavy-tailed noise. Our derivations are based on a Fuk-Nagaev inequality for Hilbert-space valued random variables.

Understanding Robust Machine Learning for Nonparametric Regression with Heavy-Tailed Noise

Machine Learning (CS)

Makes computers learn from messy, unreliable data.

10 Oct 2025 0

90%

Minimax Optimal Robust Sparse Regression with Heavy-Tailed Designs: A Gradient-Based Approach

Methodology

Makes computer learning work with messy data.

9 Jan 2026 1

89%

Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding

Methodology

Makes computer models better with messy data.

9 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇬🇧 🇩🇪 Germany, United Kingdom

Page Count

33 pages

Regularized least squares learning with heavy-tailed noise is minimax optimal

Makes computer learning work better with messy data.

Technical Abstract

Understanding Robust Machine Learning for Nonparametric Regression with Heavy-Tailed Noise

Minimax Optimal Robust Sparse Regression with Heavy-Tailed Designs: A Gradient-Based Approach

Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding