CAPER 3.0: A scalable cloud-based system for data-intensive analysis of chromosome-centric human proteome project data sets

Shuai Yang, Xinlei Zhang, Lihong Diao, Feifei Guo, Dan Wang, Zhongyang Liu, Honglei Li, Junjie Zheng, Jingshan Pan, Edouard C Nice, Dong Li, Fuchu He

Research output: Contribution to journalArticleResearchpeer-review

7 Citations (Scopus)

Abstract

The Chromosome-centric Human Proteome Project (C-HPP) aims to catalog genome-encoded proteins using a chromosome-by-chromosome strategy. As the C-HPP proceeds, the increasing requirement for data-intensive analysis of the MS/MS data poses a challenge to the proteomic community, especially small laboratories lacking computational infrastructure. To address this challenge, we have updated the previous CAPER browser into a higher version, CAPER 3.0, which is a scalable cloud-based system for data-intensive analysis of C-HPP data sets. CAPER 3.0 uses cloud computing technology to facilitate MS/MS-based peptide identification. In particular, it can use both public and private cloud, facilitating the analysis of C-HPP data sets. CAPER 3.0 provides a graphical user interface (GUI) to help users transfer data, configure jobs, track progress, and visualize the results comprehensively. These features enable users without programming expertise to easily conduct data-intensive analysis using CAPER 3.0. Here, we illustrate the usage of CAPER 3.0 with four specific mass spectral data-intensive problems: detecting novel peptides, identifying single amino acid variants (SAVs) derived from known missense mutations, identifying sample-specific SAVs, and identifying exon-skipping events. CAPER 3.0 is available at http://prodigy.bprc.ac.cn/caper3 .
Original languageEnglish
Pages (from-to)3720 - 3728
Number of pages9
JournalJournal of Proteome Research
Volume14
Issue number9
DOIs
Publication statusPublished - 2015

Cite this