Abstract
Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMinestruct (http://glycomine.erc.monash.edu/Lab/GlycoMine-Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMinestruct outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMinestruct to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMinestruct can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies.
| Original language | English |
|---|---|
| Article number | 34595 |
| Number of pages | 16 |
| Journal | Scientific Reports |
| Volume | 6 |
| DOIs | |
| Publication status | Published - 6 Oct 2016 |
Keywords
- glycosylation
- protein structure predictions
Projects
- 1 Finished
-
NHMRC Program in Cellular Microbiology
Lithgow, T. (Primary Chief Investigator (PCI)), Dougan, G. (Chief Investigator (CI)) & Strugnell, R. A. (Chief Investigator (CI))
NHMRC - National Health and Medical Research Council (Australia)
1/01/16 → 31/12/20
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver