Olga O. LebedenkoMikhail S. PolovinkinAnastasiia A. KazovskaiaNikolai R. Skrynnikov
ABSTRACT In this communication, we introduce a new structure‐based affinity predictor for protein–protein complexes. This predictor, dubbed PCANN (Protein Complex Affinity by Neural Network), uses the ESM‐2 language model to encode the information about protein binding interfaces and graph attention network (GAT) to parlay this information into predictions. In the tests employing two previously unused literature‐extracted datasets, PCANN performed better than the best of the publicly available predictors, BindPPI, with mean absolute error (MAE) of 1.3 versus 1.4 kcal/mol. Further progress in the development of predictors using deep learning models is faced with two problems: (i) the amount of experimental data available to train and test new predictors is limited and (ii) the available data are often not very accurate and lack internal consistency with respect to measurement conditions. These issues can be potentially addressed through an AI‐leveraged literature search followed by careful human curation and by introducing additional parameters to account for variations in experimental conditions.
Elizaveta Alexandrovna BogdanovaValery Novoseletsky
Xingjian XuChunmei WangGuo‐Wei WeiJiahui Chen
Xu HongXiaoxue TongJuan XiePinyu LiuXudong LiuQi SongSen LiuShiyong LiuShiyong LiuShiyong Liu
Yi‐Xin WangYibo ZhuXiumin ShiLu Wang