Abstract
The Mahalanobis distance is a well-known criterion which may be used for detecting outliers in multivariate data However, there are some discrepancies about which critical values are suitable for this purpose. Following a comparison with Wilks's method, this paper shows that the previously recommended (p(n-1)/(n-p))Fp,n-p are unsuitable, and p(n-1)2Fp,n-p-1/n(n-p-1+pFp,n-p-1) are the correct critical values when searching for a single outlier. The importance of which critical values should be used is illustrated when searching for a single outlier in a clinical laboratory data set containing 10 patients and five variables. The jackknifed Mahalanobis distance is also discussed and the relevant critical values are given. Finally, upper bounds for the usual Mahalanobis distance and the jackknifed version are discussed.
Original language | English |
---|---|
Pages (from-to) | 73-81 |
Number of pages | 9 |
Journal | Journal of the Royal Statistical Society. Series C: Applied Statistics |
Volume | 45 |
Issue number | 1 |
DOIs | |
Publication status | Published - 1996 |
Keywords
- Critical values
- Jackknifed Mahalanobis distance
- Mahalanobis distance
- Multivariate outliers