Additive Density Regression
By: Eva-Maria Maier , Alexander Fottner , Sonja Greven and more
Potential Business Impact:
Helps understand income differences using math.
We present a structured additive regression approach to model conditional densities given scalar covariates, where only samples of the conditional distributions are observed. This links our approach to distributional regression models for scalar data. The model is formulated in a Bayes Hilbert space -- preserving nonnegativity and integration to one under summation and scalar multiplication -- with respect to an arbitrary finite measure. This allows to consider, amongst others, continuous, discrete and mixed densities. Our theoretical results include asymptotic existence, uniqueness, consistency, and asymptotic normality of the penalized maximum likelihood estimator, as well as confidence regions and inference for the (effect) densities. For estimation, we propose to maximize the penalized log-likelihood corresponding to an appropriate multinomial, or equivalently, Poisson regression model, which we show to approximate the original penalized maximum likelihood problem. We apply our framework to a motivating gender economic data set from the German Socio-Economic Panel Study (SOEP), analyzing the distribution of the woman's share in a couple's total labor income given covariate effects for year, place of residence and age of the youngest child. As the income share is a continuous variable having discrete point masses at zero and one for single-earner couples, the corresponding densities are of mixed type.
Similar Papers
Transforming Conditional Density Estimation Into a Single Nonparametric Regression Task
Machine Learning (Stat)
Helps computers guess missing information more accurately.
Robust Variable Selection in High-dimensional Nonparametric Additive Model
Methodology
Finds important patterns even with messy data.
Convex Mixed-Integer Programming for Causal Additive Models with Optimization and Statistical Guarantees
Methodology
Finds hidden connections between things from data.