Mining folded proteomes in the era of accurate structure prediction

Research output: Contribution to journalArticleResearchpeer-review

8 Citations (Scopus)

Abstract

Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins.

Original languageEnglish
Article numbere1009930
Number of pages10
JournalPLoS Computational Biology
Volume18
Issue number3
DOIs
Publication statusPublished - Mar 2022

Cite this