JOURNAL ARTICLE

Robust Multivariate Outlier Detection Methods for Environmental Data

Ibrahim AlameddineMelissa A. KenneyRussell J. GosnellKenneth H. Reckhow

Year: 2010 Journal:   Journal of Environmental Engineering Vol: 136 (11)Pages: 1299-1304   Publisher: American Society of Civil Engineers

Abstract

Outliers are an inevitable concern that needs to be identified and dealt with whenever one analyzes a large data set. Today's water quality data are often collected on different scales, encompass several sites, monitor several correlated parameters, involve a multitude of individuals from several agencies, and span over several years. As such, the ability to identify outliers, which may affect the results of the analysis, is crucial. This note presents several statistical techniques that have been developed to deal with this problem, with particular emphasis on robust multivariate methods. These techniques are capable of isolating outliers while overcoming the effects of masking that can hinder the effectiveness of common outlier detection techniques such as Mahalanobis distances (MD). This note uses a comprehensive national metadata set on lake water quality as a case study to analyze the effectiveness of three robust outlier detection techniques, namely, the minimum covariance determinant (MCD), the minimum volume ellipsoid (MVE), and M-estimators. The note compares the results generated from these three techniques to assess the severity of each method when it comes to labeling observations as outliers. The results demonstrate the limitations of using MD to analyze multidimensional water quality data. The analysis also highlighted the differences between the three robust multivariate methods, whereby the MVE method was found to be the most severe when it came to outlier detection, while the MCD was the most lenient. Of the three robust multivariate outlier detection methods analyzed, the M-estimator proved to be the most flexible because it allowed for downweighting rather than censoring many borderline outlier observations.

Keywords:
Outlier Anomaly detection Mahalanobis distance Multivariate statistics Data mining Computer science Data set Robust statistics Set (abstract data type) Estimator Covariance Data quality Multivariate analysis Statistics Artificial intelligence Metric (unit) Mathematics Machine learning Engineering

Metrics

28
Cited By
1.21
FWCI (Field Weighted Citation Impact)
35
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
Water Quality and Pollution Assessment
Physical Sciences →  Environmental Science →  Water Science and Technology
Advanced Statistical Process Monitoring
Social Sciences →  Decision Sciences →  Statistics, Probability and Uncertainty

Related Documents

JOURNAL ARTICLE

Robust Outlier Detection Method For Multivariate Spatial Data

Sweta ShuklaS. Lalitha

Journal:   National Academy Science Letters Year: 2021 Vol: 44 (6)Pages: 551-554
JOURNAL ARTICLE

Outlier Detection for Compositional Data Using Robust Methods

Peter FilzmoserKarel Hron

Journal:   Mathematical Geosciences Year: 2008 Vol: 40 (3)Pages: 233-248
JOURNAL ARTICLE

Outlier detection in multivariate data

K. Senthamarai KannanK. Manoj

Journal:   Applied Mathematical Sciences Year: 2015 Vol: 9 Pages: 2317-2324
© 2026 ScienceGate Book Chapters — All rights reserved.