Structural Effect and Spectral Enhancement of High-Dimensional Regularized Linear Discriminant Analysis
By: Yonghan Zhang , Zhangni Pu , Lu Yan and more
Potential Business Impact:
Improves computer sorting of data, even with many details.
Regularized linear discriminant analysis (RLDA) is a widely used tool for classification and dimensionality reduction, but its performance in high-dimensional scenarios is inconsistent. Existing theoretical analyses of RLDA often lack clear insight into how data structure affects classification performance. To address this issue, we derive a non-asymptotic approximation of the misclassification rate and thus analyze the structural effect and structural adjustment strategies of RLDA. Based on this, we propose the Spectral Enhanced Discriminant Analysis (SEDA) algorithm, which optimizes the data structure by adjusting the spiked eigenvalues of the population covariance matrix. By developing a new theoretical result on eigenvectors in random matrix theory, we derive an asymptotic approximation on the misclassification rate of SEDA. The bias correction algorithm and parameter selection strategy are then obtained. Experiments on synthetic and real datasets show that SEDA achieves higher classification accuracy and dimensionality reduction compared to existing LDA methods.
Similar Papers
Spatial Sign based Direct Sparse Linear Discriminant Analysis for High Dimensional Data
Methodology
Helps computers sort data better, even when it's messy.
Transfer learning via Regularized Linear Discriminant Analysis
Machine Learning (Stat)
Improves computer guesses using other computer guesses.
Spectrally-Corrected and Regularized QDA Classifier for Spiked Covariance Model
Machine Learning (CS)
Helps computers sort data better, even when it's tricky.