Projects per year
Abstract
Anti-cancer peptides (ACPs) are known as potential therapeutics for cancer. Due to their unique ability to target cancer cells without affecting healthy cells directly, they have been extensively studied. Many peptide-based drugs are currently evaluated in the preclinical and clinical trials. Accurate identification of ACPs has received considerable attention in recent years; as such, a number of machine learning-based methods for in silico identification of ACPs have been developed. These methods promote the research on the mechanism of ACPs therapeutics against cancer to some extent. There is a vast difference in these methods in terms of their training/testing datasets, machine learning algorithms, feature encoding schemes, feature selection methods and evaluation strategies used. Therefore, it is desirable to summarize the advantages and disadvantages of the existing methods, provide useful insights and suggestions for the development and improvement of novel computational tools to characterize and identify ACPs. With this in mind, we firstly comprehensively investigate 16 state-of-the-art predictors for ACPs in terms of their core algorithms, feature encoding schemes, performance evaluation metrics and webserver/software usability. Then, comprehensive performance assessment is conducted to evaluate the robustness and scalability of the existing predictors using a well-prepared benchmark dataset. We provide potential strategies for the model performance improvement. Moreover, we propose a novel ensemble learning framework, termed ACPredStackL, for the accurate identification of ACPs. ACPredStackL is developed based on the stacking ensemble strategy combined with SVM, Naïve Bayesian, lightGBM and KNN. Empirical benchmarking experiments against the state-of-the-art methods demonstrate that ACPredStackL achieves a comparative performance for predicting ACPs. The webserver and source code of ACPredStackL is freely available at http://bigdata.biocie.cn/ACPredStackL/ and https://github.com/liangxiaoq/ACPredStackL, respectively.
Original language | English |
---|---|
Pages (from-to) | 1-17 |
Number of pages | 17 |
Journal | Briefings in Bioinformatics |
Volume | 22 |
Issue number | 4 |
DOIs | |
Publication status | Published - Jul 2021 |
Keywords
- anti-cancer peptides
- bioinformatics
- ensemble learning
- performance assessment
- prediction
- sequence analysis
Projects
- 3 Finished
-
ARC Centre of Excellence in Advanced Molecular Imaging
Whisstock, J. (Primary Chief Investigator (PCI)), Abbey, B. (Chief Investigator (CI)), Nugent, K. A. (Chief Investigator (CI)), Quiney, H. M. (Chief Investigator (CI)), Godfrey, D. I. (Chief Investigator (CI)), Heath, W. (Chief Investigator (CI)), Fairlie, D. P. (Chief Investigator (CI)), Chapman, H. (Partner Investigator (PI)), Peele, A. (Partner Investigator (PI)), Davey, J. (Partner Investigator (PI)) & Wittmann, A. (Project Manager)
30/06/14 → 31/03/21
Project: Research
-
Stochastic modelling of telomere length regulation in ageing research
Tian, T. (Primary Chief Investigator (PCI)) & Song, J. (Chief Investigator (CI))
Australian Research Council (ARC), Monash University
3/01/12 → 30/10/17
Project: Research
-
Characterisation of plant cysteine proteases with therapeutic potential
Pike, R. (Primary Chief Investigator (PCI)), Song, J. (Chief Investigator (CI)), Whisstock, J. (Chief Investigator (CI)) & Mynott, T. (Partner Investigator (PI))
Australian Research Council (ARC), Sarantis Limited
1/07/11 → 30/06/14
Project: Research