ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides

Leyi Wei, Chen Zhou, Huangrong Chen, Jiangning Song, Ran Su

Research output: Contribution to journalArticleResearchpeer-review

85 Citations (Scopus)


Motivation: Anti-cancer peptides (ACPs) have recently emerged as promising therapeutic agents for cancer treatment. Due to the avalanche of protein sequence data in the post-genomic era, there is an urgent need to develop automated computational methods to enable fast and accurate identification of novel ACPs within the vast number of candidate proteins and peptides. Results: To address this, we propose a novel predictor named Anti-Cancer peptide Predictor with Feature representation Learning (ACPred-FL) for accurate prediction of ACPs based on sequence information. More specifically, we develop an effective feature representation learning model, with which we can extract and learn a set of informative features from a pool of support vector machine-based models trained using sequence-based feature descriptors. By doing so, the class label information of data samples is fully utilized. To improve the feature representation, we further employ a two-step feature selection technique, resulting in a most informative five-dimensional feature vector for the final peptide representation. Experimental results show that such five features provide the most discriminative power for identifying ACPs than currently available feature descriptors, highlighting the effectiveness of the proposed feature representation learning approach. The developed ACPred-FL method significantly outperforms state-of-the-art methods. Availability and implementation: The web-server of ACPred-FL is available at Supplementary information: Supplementary data are available at Bioinformatics online.

Original languageEnglish
Pages (from-to)4007-4016
Number of pages10
Issue number23
Publication statusPublished - 1 Dec 2018

Cite this