HYSYS: Have you swapped your samples?

Jan Schröder, Vincent Corbin, Anthony T. Papenfuss

Research output: Contribution to journalArticleResearchpeer-review

10 Citations (Scopus)

Abstract

Motivation: The application of a genomics assay to samples from a cohort is a frequently applied experimental design in cancer genomics studies. The collection and analysis of cancer sequencing data in the clinical setting is an elaborate process that may involve consenting patients, obtaining possibly-multiple DNA samples, sequencing and analysis. Many of these steps are manual. At any stage mistakes can occur that cause a DNA sample to be labelled incorrectly. However, there is a paucity of methods in the literature to identify such swaps specifically in cancer studies. Results: Here, we introduce a simple method, HYSYS, to estimate the relatedness of samples and test for sample swaps and contamination. The test uses the concordance of homozygous SNPs between samples. The method is motivated by the observation that homozygous germline population variants rarely change in the disease and are not affected by loss of heterozygosity. Our tools include visualization and a testing framework to flag possible sample swaps. We demonstrate the utility of this approach on a small cohort.

Original languageEnglish
Pages (from-to)596-598
Number of pages3
JournalBioinformatics
Volume33
Issue number4
DOIs
Publication statusPublished - 15 Feb 2017
Externally publishedYes

Cite this