Duygu GeçkinGüleser Kalaycı Demir
A wide range of biological processes, including signal transmission, immunological responses, and metabolic cycles, are impacted by protein-protein interactions. These interactions have enormous implications for figuring out the origins of diseases and creating treatments. However, experimental methods for identifying PPIs are resource-intensive, time-consuming, and have limited coverage. Thus, computational techniques are essential to help and enhance activities related to protein identification. This study aims to build a deep learning network for predicting protein-protein interactions using only sequence information. Three different encoding methods are used to encode protein sequences: Binary Encoding, Autocovariance, and Position Specific Scoring Matrix. In order to predict protein-protein interactions, a convolutional Siamese neural network is employed to find complex patterns between protein sequence pairs. This network consists of two identical subnetworks with matched parameters. When applied to the human dataset, the suggested technique shows strong prediction performance with an accuracy of 84.07%, sensitivity of 92.45%, and precision of 91.45% for the model using the PSSM protein representation approach. An ensemble approach is suggested to combine the outputs from these three encoders because it is known that different encoding techniques capture various aspects of the same protein sequence. The accuracy obtained increased to 86.27% for the ensemble approach on the test set, with a sensitivity of 93.07% and a precision of 92.15%. The outcome highlights the importance of integrating several encoding methods to benefit from their complementary features and raise the accuracy of protein-protein interaction prediction.
Duygu GeçkinGüleser Kalaycı Demir
Duygu GeçkinGüleser Kalaycı Demir
Long ZhangGuoxian YuDawen XiaJun Wang
Wenqi ChenShuang WangTao SongXue LiPeifu HanChangnan Gao
Hongli GaoCheng ChenShuangyi LiCongjing WangWeifeng ZhouBin Yu