JOURNAL ARTICLE

A Diagnostic Procedure for High-Dimensional Data Streams via Missed Discovery Rate Control

Wendong LiDongdong XiangFugee TsungXiaolong Pu

Year: 2019 Journal:   Technometrics Vol: 62 (1)Pages: 84-100   Publisher: Taylor & Francis

Abstract

Monitoring complex systems involving high-dimensional data streams (HDS) provides quick real-time detection of abnormal changes of system performance, but accurate and efficient diagnosis of the streams responsible has also become increasingly important in many data-rich statistical process control applications. Existing diagnostic procedures, designed for low/moderate dimensional multivariate process, may miss too much important information in the out-of-control streams with a high signal-to-noise ratio (SNR) or waste too many resources finding useless in-control streams with a low SNR. In addition, these procedures do not differentiate between streams according to their severity. In this article, we formulate the diagnosis problem of HDS as a multiple testing problem and provide a computationally fast diagnostic procedure to control the weighted missed discovery rate (wMDR) at some satisfactory level. The proposed procedure overcomes the limitations of conventional diagnostic procedures by controlling the wMDR and minimizing the expected number of false positives as well. We show theoretically that the proposed procedure is asymptotically valid and optimal in a certain sense. Simulation studies and a real data analysis from a semiconductor manufacturing process show that the proposed procedure works very well in practice.

Keywords:
Data stream mining Computer science False positive paradox Data mining Process (computing) False discovery rate Statistical process control False positives and false negatives Multiple comparisons problem STREAMS Control (management) Artificial intelligence Statistics Mathematics

Metrics

43
Cited By
2.95
FWCI (Field Weighted Citation Impact)
52
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Statistical Process Monitoring
Social Sciences →  Decision Sciences →  Statistics, Probability and Uncertainty
Fault Detection and Control Systems
Physical Sciences →  Engineering →  Control and Systems Engineering
Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
© 2026 ScienceGate Book Chapters — All rights reserved.