High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites

Tom Whitington, Andrew C. Perkins, Timothy L. Bailey

Research output: Contribution to journalArticleResearchpeer-review

47 Citations (Scopus)

Abstract

In silico prediction of transcription factor binding sites (TFBSs) is central to the task of gene regulatory network elucidation. Genomic DNA sequence information provides a basis for these predictions, due to the sequence specificity of TF-binding events. However, DNA sequence alone is an impoverished source of information for the task of TFBS prediction in eukaryotes, as additional factors, such as chromatin structure regulate binding events. We show that incorporating high-throughput chromatin modification estimates can greatly improve the accuracy of in silico prediction of in vivo binding for a wide range of TFs in human and mouse. This improvement is superior to the improvement gained by equivalent use of either transcription start site proximity or phylogenetic conservation information. Importantly, predictions made with the use of chromatin structure information are tissue specific. This result supports the biological hypothesis that chromatin modulates TF binding to produce tissue-specific binding profiles in higher eukaryotes, and suggests that the use of chromatin modification information can lead to accurate tissue-specific transcriptional regulatory network elucidation.

Original languageEnglish
Pages (from-to)14-25
Number of pages12
JournalNucleic Acids Research
Volume37
Issue number1
DOIs
Publication statusPublished - 2009
Externally publishedYes

Cite this