JOURNAL ARTICLE

Hit screening with multivariate robust outlier detection

Abstract

Hit screening, which involves the identification of compounds or targets capable of modulating disease-relevant processes, is an important step in drug discovery. Some assays, such as image-based high-content screenings, produce complex multivariate readouts. To fully exploit the richness of such data, advanced analytical methods that go beyond the conventional univariate approaches should be employed. In this work, we tackle the problem of hit identification in multivariate assays. As with univariate assays, a hit from a multivariate assay can be defined as a candidate that yields an assay value sufficiently far away in distance from the mean or central value of inactives. Viewed another way, a hit is an outlier from the distribution of inactives. A method was developed for identifying multivariate hit in high-dimensional data sets based on principal components and robust Mahalanobis distance (the multivariate analogue to the Z- or T -statistic). The proposed method, termed mROUT (multivariate robust outlier detection), demonstrates superior performance over other techniques in the literature in terms of maintaining Type I error, false discovery rate and true discovery rate in simulation studies. The performance of mROUT is also illustrated on a CRISPR knockout data set from in-house phenotypic screening programme.

Keywords:
Univariate Multivariate statistics Mahalanobis distance Outlier Computer science Multivariate analysis False discovery rate Anomaly detection Statistic Data mining Identification (biology) Set (abstract data type) Data set Artificial intelligence Pattern recognition (psychology) Statistics Computational biology Mathematics Machine learning Biology

Metrics

1
Cited By
4.56
FWCI (Field Weighted Citation Impact)
54
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cell Image Analysis Techniques
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Biophysics
Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
Computational Drug Discovery Methods
Physical Sciences →  Computer Science →  Computational Theory and Mathematics

Related Documents

JOURNAL ARTICLE

Robust Outlier Detection Method For Multivariate Spatial Data

Sweta ShuklaS. Lalitha

Journal:   National Academy Science Letters Year: 2021 Vol: 44 (6)Pages: 551-554
JOURNAL ARTICLE

Robust Multivariate Outlier Detection Methods for Environmental Data

Ibrahim AlameddineMelissa A. KenneyRussell J. GosnellKenneth H. Reckhow

Journal:   Journal of Environmental Engineering Year: 2010 Vol: 136 (11)Pages: 1299-1304
JOURNAL ARTICLE

Multivariate Outlier Detection and Robust Covariance Matrix Estimation

Daniel PeñaFrancisco J. Prieto

Journal:   Technometrics Year: 2001 Vol: 43 (3)Pages: 286-310
JOURNAL ARTICLE

Robust Multivariate Outlier Labeling

Dyah Erny HerwindiatiMaman A. DjauhariMuhammad Mashuri

Journal:   Communications in Statistics - Simulation and Computation Year: 2007 Vol: 36 (6)Pages: 1287-1294
© 2026 ScienceGate Book Chapters — All rights reserved.