Completing and studentising Spearman's correlation in the presence of ties
By: Landon Hurley
Non-parametric correlation coefficients have been widely used for analysing arbitrary random variables upon common populations, when requiring an explicit error distribution to be known is an unacceptable assumption. We examine an \(\ell_{2}\) representation of a correlation coefficient (Emond and Mason, 2002) from the perspective of a statistical estimator upon random variables, and verify a number of interesting and highly desirable mathematical properties, mathematically similar to the Whitney embedding of a Hilbert space into the \(\ell_{2}\)-norm space. In particular, we show here that, in comparison to the traditional Spearman (1904) \(ρ\), the proposed Kemeny \(ρ_κ\) correlation coefficient satisfies Gauss-Markov conditions in the presence or absence of ties, thereby allowing both discrete and continuous marginal random variables. We also prove under standard regularity conditions a number of desirable scenarios, including the construction of a null hypothesis distribution which is Student-t distributed, parallel to standard practice with Pearson's r, but without requiring either continuous random variables nor particular Gaussian errors. Simulations in particular focus upon highly kurtotic data, with highly nominal empirical coverage consistent with theoretical expectation.
Similar Papers
Measures and Models of Non-Monotonic Dependence
Methodology
Finds hidden patterns in data, even weird ones.
Asymptotic Inference for Rank Correlations
Methodology
Finds how strongly two things are linked.
Standardization of Weighted Ranking Correlation Coefficients
Methodology
Makes weighted rankings fair and understandable.