High Dimensional Sparse Canonical Correlation Analysis for Elliptical Symmetric Distributions
By: Chengde Qian, Yanhong Liu, Long Feng
Potential Business Impact:
Finds hidden connections in messy, big data.
This paper proposes a robust high-dimensional sparse canonical correlation analysis (CCA) method for investigating linear relationships between two high-dimensional random vectors, focusing on elliptical symmetric distributions. Traditional CCA methods, based on sample covariance matrices, struggle in high-dimensional settings, particularly when data exhibit heavy-tailed distributions. To address this, we introduce the spatial-sign covariance matrix as a robust estimator, combined with a sparsity-inducing penalty to efficiently estimate canonical correlations. Theoretical analysis shows that our method is consistent and robust under mild conditions, converging at an optimal rate even in the presence of heavy tails. Simulation studies demonstrate that our approach outperforms existing sparse CCA methods, particularly under heavy-tailed distributions. A real-world application further confirms the method's robustness and efficiency in practice. Our work provides a novel solution for high-dimensional canonical correlation analysis, offering significant advantages over traditional methods in terms of both stability and performance.
Similar Papers
Sparse canonical correlation analysis for multiple measurements with latent trajectories
Methodology
Finds hidden patterns in changing health data.
Two new approaches to multiple canonical correlation analysis for repeated measures data
Methodology
Finds hidden connections in complex, changing data.
Two approaches to multiple canonical correlation analysis for repeated measures data
Methodology
Finds hidden connections in complex, changing data.