Score: 1

On Misspecified Error Distributions in Bayesian Functional Clustering: Consequences and Remedies

Published: October 20, 2025 | arXiv ID: 2510.17215v1

By: Fumiya Iwashige , Tomoya Wakayama , Shonosuke Sugasawa and more

Potential Business Impact:

Finds hidden groups in data better.

Business Areas:

Big Data Data and Analytics

Nonparametric Bayesian approaches provide a flexible framework for clustering without pre-specifying the number of groups, yet they are well known to overestimate the number of clusters, especially for functional data. We show that a fundamental cause of this phenomenon lies in misspecification of the error structure: errors are conventionally assumed to be independent across observed points in Bayesian functional models. Through high-dimensional clustering theory, we demonstrate that ignoring the underlying correlation leads to excess clusters regardless of the flexibility of prior distributions. Guided by this theory, we propose incorporating the underlying correlation structures via Gaussian processes and also present its scalable approximation with principled hyperparameter selection. Numerical experiments illustrate that even simple clustering based on Dirichlet processes performs well once error dependence is properly modeled.

Total Robustness in Bayesian Nonlinear Regression for Measurement Error Problems under Model Misspecification

Methodology

Makes computer predictions more trustworthy with bad data.

3 Oct 2025 0

88%

Implementing Errors on Errors: Bayesian vs Frequentist

High Energy Physics - Phenomenology

Fixes science numbers when they don't match.

10 May 2025 0

88%

Total Robustness in Bayesian Nonlinear Regression for Measurement Error Problems under Model Misspecification

Methodology

Makes computer predictions more accurate with messy data.

3 Oct 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

39 pages

On Misspecified Error Distributions in Bayesian Functional Clustering: Consequences and Remedies

Finds hidden groups in data better.

Technical Abstract

Total Robustness in Bayesian Nonlinear Regression for Measurement Error Problems under Model Misspecification

Implementing Errors on Errors: Bayesian vs Frequentist

Total Robustness in Bayesian Nonlinear Regression for Measurement Error Problems under Model Misspecification