JOURNAL ARTICLE

PROTEIN SUBCELLULAR MULTI-LOCALIZATION PREDICTION USING A MIN-MAX MODULAR SUPPORT VECTOR MACHINE

Yang YangBao‐Liang Lu

Year: 2010 Journal:   International Journal of Neural Systems Vol: 20 (01)Pages: 13-28   Publisher: World Scientific

Abstract

Prediction of protein subcellular localization is an important issue in computational biology because it provides important clues for the characterization of protein functions. Currently, much research has been dedicated to developing automatic prediction tools. Most, however, focus on mono-locational proteins, i.e., they assume that proteins exist in only one location. It should be noted that many proteins bear multi-locational characteristics and carry out crucial functions in biological processes. This work aims to develop a general pattern classifier for predicting multiple subcellular locations of proteins. We use an ensemble classifier, called the min-max modular support vector machine (M 3 -SVM), to solve protein subcellular multi-localization problems; and, propose a module decomposition method based on gene ontology (GO) semantic information for M 3 -SVM. The amino acid composition with secondary structure and solvent accessibility information is adopted to represent features of protein sequences. We apply our method to two multi-locational protein data sets. The M 3 -SVMs show higher accuracy and efficiency than traditional SVMs using the same feature vectors. And the GO decomposition also helps to improve prediction accuracy. Moreover, our method has a much higher rate of accuracy than existing subcellular localization predictors in predicting protein multi-localization.

Keywords:
Support vector machine Modular design Subcellular localization Computer science Protein subcellular localization prediction Artificial intelligence Relevance vector machine Pattern recognition (psychology) Computational biology Chemistry Biology Cell biology Biochemistry

Metrics

36
Cited By
3.32
FWCI (Field Weighted Citation Impact)
70
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Genomics and Phylogenetic Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
RNA and protein synthesis mechanisms
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
© 2026 ScienceGate Book Chapters — All rights reserved.