TY - JOUR
T1 - Mining folded proteomes in the era of accurate structure prediction
AU - Bayly-Jones, Charles
AU - Whisstock, James C.
N1 - Publisher Copyright:
© 2022 Bayly-Jones, Whisstock. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2022/3
Y1 - 2022/3
N2 - Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins.
AB - Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins.
UR - http://www.scopus.com/inward/record.url?scp=85128245696&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1009930
DO - 10.1371/journal.pcbi.1009930
M3 - Article
C2 - 35333855
AN - SCOPUS:85128245696
SN - 1553-734X
VL - 18
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 3
M1 - e1009930
ER -