ProBAPred: Inferring protein-protein binding affinity by incorporating protein sequence and structural features

Bangli Lu, Chen Li, Qingfeng Chen, Jiangning Song

Research output: Contribution to journalArticleResearchpeer-review

3 Citations (Scopus)


Protein-protein binding interaction is the most prevalent biological activity that mediates a great variety of biological processes. The increasing availability of experimental data of protein-protein interaction allows a systematic construction of protein-protein interaction networks, significantly contributing to a better understanding of protein functions and their roles in cellular pathways and human diseases. Compared to well-established classification for protein-protein interactions (PPIs), limited work has been conducted for estimating protein-protein binding free energy, which can provide informative real-value regression models for characterizing the protein-protein binding affinity. In this study, we propose a novel ensemble computational framework, termed ProBAPred (Protein-protein Binding Affinity Predictor), for quantitative estimation of protein-protein binding affinity. A large number of sequence and structural features, including physical-chemical properties, binding energy and conformation annotations, were collected and calculated from currently available protein binding complex datasets and the literature. Feature selection based on the WEKA package was performed to identify and characterize the most informative and contributing feature subsets. Experiments on the independent test showed that our ensemble method achieved the lowest Mean Absolute Error (MAE; 1.657kcal/mol) and the second highest correlation coefficient (R-value=0.467), compared with the existing methods. The datasets and source codes of ProBAPred, and the supplementary materials in this study can be downloaded at for academic use. We anticipate that the developed ProBAPred regression models can facilitate computational characterization and experimental studies of protein-protein binding affinity.

Original languageEnglish
Article number1850011
Number of pages18
JournalJournal of Bioinformatics and Computational Biology
Issue number4
Publication statusPublished - 29 Jun 2018


  • feature selection
  • Protein-protein binding affinity
  • regression model
  • sequence-derived features
  • structural features

Cite this