JOURNAL ARTICLE

Detecting outliers in multivariate data while controlling false alarm rate

André Achim

Year: 2012 Journal:   Tutorials in Quantitative Methods for Psychology Vol: 8 (2)Pages: 108-121   Publisher: University of Ottawa

Abstract

Outlier identification often implies inspecting each z-transformed variable and adding a Mahalanobis D 2 . Multiple outliers may mask each other by increasing variance estimates. Caroni & Prescott (1992) proposed a extension of Rosner’s (1983) technique to circumvent masking, taking sample size into account to keep the false alarm risk below, say, α = .05. Simulations studies here compare the single approach to multiple-univariate plus multivariate tests, each at a Bonferroni corrected α level, in terms of power at detecting outliers. Results suggest the former is better only up to about 12 variables. Macros in an Excel spreadsheet implement these techniques. The impetus of the present work was to identify, in the context of a graduate course in statistics, sound statistical procedures to recommend for the examination of data for the detection of outliers, assuming normal distributions . The basic consideration is that the statistical criterion beyond which a piece of data would be considered an outlier must take into account both the number of cases (subjects) inspected as well as the number of variables examined if the variables are inspected one by one. This is required to adequately control the risk of falsely rejecting at least one case that actually belongs to the population. In particular, a fixed critical z-score, irrespective of number of variables or of sample size, can hardly be recommended. Beyond controlling for false alarm (FA) rate,

Keywords:
Outlier Univariate Statistics Bonferroni correction Sample size determination Multivariate statistics Anomaly detection Context (archaeology) Mahalanobis distance Constant false alarm rate Computer science Mathematics Data mining Artificial intelligence

Metrics

4
Cited By
0.32
FWCI (Field Weighted Citation Impact)
5
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Statistical Process Monitoring
Social Sciences →  Decision Sciences →  Statistics, Probability and Uncertainty

Related Documents

JOURNAL ARTICLE

Detecting Outliers in Multivariate Laboratory Data

Harry Southworth

Journal:   Journal of Biopharmaceutical Statistics Year: 2008 Vol: 18 (6)Pages: 1178-1183
JOURNAL ARTICLE

Removing Outliers from 3D Macrotexture Data by Controlling False Discovery Rate

Vincent I. BongioanniSamer W. KatichaGerardo W. Flintsch

Journal:   Journal of Transportation Engineering Part B Pavements Year: 2019 Vol: 145 (3)Pages: 04019016-04019016
JOURNAL ARTICLE

DETECTING MULTIVARIATE OUTLIERS IN ARTEFACT COMPOSITIONAL DATA*

M.J. Baxter

Journal:   Archaeometry Year: 1999 Vol: 41 (2)Pages: 321-338
JOURNAL ARTICLE

Detecting outliers in multivariate data and visualization-R scripts

Sung‐Soo Kim

Journal:   Korean Journal of Applied Statistics Year: 2018 Vol: 31 (4)Pages: 517-528
© 2026 ScienceGate Book Chapters — All rights reserved.