JOURNAL ARTICLE

Incremental Methods for Detecting Outliers from Multivariate Data Stream

Abstract

Outlier detection is one of the most important data mining techniques. It has broad applications like fraud detection, credit approval, computer network intrusion detection, anti-money laundering, etc. The basis of outlier detection is to identify data points which are “different” or “far away” from the rest of the data points in the given dataset. Traditional outlier detection method is based on statistical analysis. However, this traditional method has an inherent drawback—it requires the availability of the entire dataset. In practice, especially in the real time data feed application, it is not so realistic to wait for all the data because fresh data are streaming in very quickly. Outlier detection is hence done in batches. However two drawbacks may arise: relatively long processing time because of the massive size, and the result may be outdated soon between successive updates. In this paper, we propose several novel incremental methods to process the real time data effectively for outlier detection. For the experiment, we test three types of mechanisms for analyzing the dataset, namely Global Analysis, Cumulative Analysis and Lightweight Analysis with Sliding Window. The experiment dataset is “household power consumption” which is a popular benchmarking data for Massive Online Analysis.

Keywords:
Computer science Anomaly detection Outlier Data mining Benchmarking Sliding window protocol Intrusion detection system Data stream Process (computing) Artificial intelligence Window (computing)

Metrics

2
Cited By
0.97
FWCI (Field Weighted Citation Impact)
12
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Detecting Outliers in Multivariate Laboratory Data

Harry Southworth

Journal:   Journal of Biopharmaceutical Statistics Year: 2008 Vol: 18 (6)Pages: 1178-1183
JOURNAL ARTICLE

DETECTING MULTIVARIATE OUTLIERS IN ARTEFACT COMPOSITIONAL DATA*

M.J. Baxter

Journal:   Archaeometry Year: 1999 Vol: 41 (2)Pages: 321-338
JOURNAL ARTICLE

Detecting outliers in multivariate data and visualization-R scripts

Sung‐Soo Kim

Journal:   Korean Journal of Applied Statistics Year: 2018 Vol: 31 (4)Pages: 517-528
JOURNAL ARTICLE

Eigenstructure-Based Angle for Detecting Outliers in Multivariate Data

Nazrina Nazrina

Journal:   Sains Malaysiana Year: 2014 Vol: 43 (12)Pages: 1973-1977
© 2026 ScienceGate Book Chapters — All rights reserved.