Predicting proteolysis in complex proteomes using deep learning

Matiss Ozols, Alexander Eckersley, Christopher I. Platt, Callum Stewart-Mcguinness, Sarah A. Hibbert, Jerico Revote, Fuyi Li, Christopher E.M. Griffiths, Rachel E.B. Watson, Jiangning Song, Mike Bell, Michael J. Sherratt

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Both protease-and reactive oxygen species (ROS)-mediated proteolysis are thought to be key effectors of tissue remodeling. We have previously shown that comparison of amino acid composition can predict the differential susceptibilities of proteins to photo-oxidation. However, predicting protein susceptibility to endogenous proteases remains challenging. Here, we aim to develop bioin-formatics tools to (i) predict cleavage site locations (and hence putative protein susceptibilities) and (ii) compare the predicted vulnerabilities of skin proteins to protease-and ROS-mediated proteolysis. The first goal of this study was to experimentally evaluate the ability of existing protease cleavage site prediction models (PROSPER and DeepCleave) to identify experimentally determined MMP9 cleavage sites in two purified proteins and in a complex human dermal fibroblast-derived extracellular matrix (ECM) proteome. We subsequently developed deep bidirectional recurrent neural network (BRNN) models to predict cleavage sites for 14 tissue proteases. The predictions of the new models were tested against experimental datasets and combined with amino acid composition analysis (to predict ultraviolet radiation (UVR)/ROS susceptibility) in a new web app: the Manchester proteome susceptibility calculator (MPSC). The BRNN models performed better in predicting cleavage sites in native dermal ECM proteins than existing models (DeepCleave and PROSPER), and application of MPSC to the skin proteome suggests that: compared with the elastic fiber network, fibrillar collagens may be susceptible primarily to protease-mediated proteolysis. We also identify additional putative targets of oxidative damage (dermatopontin, fibulins and defensins) and protease action (laminins and nidogen). MPSC has the potential to identify potential targets of proteolysis in disparate tissues and disease states.

Original languageEnglish
Article number3071
Number of pages20
JournalInternational Journal of Molecular Sciences
Volume22
Issue number6
DOIs
Publication statusPublished - Mar 2021

Keywords

  • Aging
  • Biomark-ers
  • Deep-learning
  • Degradomics
  • Extracellular matrix
  • Machine learning
  • Protease
  • Skin

Cite this