JOURNAL ARTICLE

Prediction of drug target interaction based on under sampling strategy and random forest algorithm

Chen FengZhigang ZhaoZheng Yi RenKun LuYang YuWenyan Wang

Year: 2025 Journal:   PLoS ONE Vol: 20 (3)Pages: e0318420-e0318420   Publisher: Public Library of Science

Abstract

Drug target interactions (DTIs) play a crucial role in drug discovery and development. The prediction of DTIs based on computational method can effectively assist the experimental techniques for DTIs identification, which are time-consuming and expensive. However, the current computational models suffer from low accuracy and high false positive rate in the prediction of DTIs, especially for datasets with extremely unbalanced sample categories. To accurately identify the interaction between drugs and target proteins, a variety of descriptors that fully show the characteristic information of drugs and targets are extracted and applied to the integrated method random forest (RF) in this work. Here, the random projection method is adopted to reduce the feature dimension such that simplify the model calculation. In addition, to balance the number of samples in different categories, a down sampling method NearMiss (NM) which can control the number of samples is used. Based on the gold standard datasets (nuclear receptors, ion channel, GPCRs and enzymes), the proposed method achieves the auROC of 92.26%, 98.21%, 97.65%, 99.33%, respectively. The experimental results show that the proposed method yields significantly higher performance than that of state-of-the-art methods in predicting drug target interaction.

Keywords:
Random forest Computer science Sampling (signal processing) Random projection Projection (relational algebra) Feature (linguistics) Drug discovery Identification (biology) Drug target Dimension (graph theory) Algorithm Artificial intelligence Data mining Machine learning Pattern recognition (psychology) Bioinformatics Mathematics Pharmacology Medicine

Metrics

4
Cited By
20.07
FWCI (Field Weighted Citation Impact)
49
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Computational Drug Discovery Methods
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Protein Structure and Dynamics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

JOURNAL ARTICLE

Random Forest Algorithm for Enhanced Prediction of Drug Target Interactions

Shikha MehtaSparsh SharmaSagar AnandShreshth SharmaAditi Goel

Journal:   International Journal of Innovative Technology and Exploring Engineering Year: 2020 Vol: 9 (4)Pages: 2008-2012
JOURNAL ARTICLE

Random-forest model for drug–target interaction prediction via Kullback–Leibler divergence

Sang-Jin AhnSi Eun LeeMi‐Hyun Kim

Journal:   Journal of Cheminformatics Year: 2022 Vol: 14 (1)Pages: 67-67
JOURNAL ARTICLE

Prediction of Drug-Target Interaction Using Random Forest in Coronavirus 2019 Case

Aulia FadliAnnisa AnnisaWisnu Ananta Kusuma

Journal:   Bioinformatics and Biomedical Research Journal Year: 2021 Vol: 4 (1)Pages: 1-7
© 2026 ScienceGate Book Chapters — All rights reserved.