Simultaneous Heterogeneity and Reduced-rank Learning for Multivariate Response Regression
By: Jie Wu , Bo Zhang , Daoji Li and more
Potential Business Impact:
Finds hidden groups in mixed data.
Heterogeneous data are now ubiquitous in many applications in which correctly identifying the subgroups from a heterogeneous population is critical. Although there is an increasing body of literature on subgroup detection, existing methods mainly focus on the univariate response setting. In this paper, we propose a joint heterogeneity and reduced-rank learning framework to simultaneously identify the subgroup structure and estimate the covariate effects for heterogeneous multivariate response regression. In particular, our approach uses rank-constrained pairwise fusion penalization and conducts the subgroup analysis without requiring prior knowledge regarding the individual subgroup memberships. We implement the proposed approach by an alternating direction method of multipliers (ADMM) algorithm and show its convergence. We also establish the asymptotic properties for the resulting estimators under mild and interpretable conditions. A predictive information criterion is proposed to select the rank of the coefficient matrix with theoretical support. The effectiveness of the proposed approach is demonstrated through simulation studies and a real data application.
Similar Papers
High-dimensional low-rank matrix regression with unknown latent structures
Methodology
Finds patterns in data from many people.
Regularized Reduced Rank Regression for mixed predictor and response variables
Methodology
Finds important patterns in messy, big data.
Higher Order Reduced Rank Regression
Machine Learning (Stat)
Finds hidden patterns in complex data.