MAResNet: Predicting transcription factor binding sites by combining multi-scale bottom-up and top-down attention and residual network

Ke Han, Long Chen Shen, Yi Heng Zhu, Jian Xu, Jiangning Song, Dong Jun Yu

Research output: Contribution to journalArticleResearchpeer-review

8 Citations (Scopus)

Abstract

Accurate identification of transcription factor binding sites is of great significance in understanding gene expression, biological development and drug design. Although a variety of methods based on deep-learning models and large-scale data have been developed to predict transcription factor binding sites in DNA sequences, there is room for further improvement in prediction performance. In addition, effective interpretation of deep-learning models is greatly desirable. Here we present MAResNet, a new deep-learning method, for predicting transcription factor binding sites on 690 ChIP-seq datasets. More specifically, MAResNet combines the bottom-up and top-down attention mechanisms and a state-of-The-Art feed-forward network (ResNet), which is constructed by stacking attention modules that generate attention-Aware features. In particular, the multi-scale attention mechanism is utilized at the first stage to extract rich and representative sequence features. We further discuss the attention-Aware features learned from different attention modules in accordance with the changes as the layers go deeper. The features learned by MAResNet are also visualized through the TMAP tool to illustrate that the method can extract the unique characteristics of transcription factor binding sites. The performance of MAResNet is extensively tested on 690 test subsets with an average AUC of 0.927, which is higher than that of the current state-of-The-Art methods. Overall, this study provides a new and useful framework for the prediction of transcription factor binding sites by combining the funnel attention modules with the residual network.

Original languageEnglish
Article numberbbab445
Number of pages12
JournalBriefings in Bioinformatics
Volume23
Issue number1
DOIs
Publication statusPublished - Jan 2022

Keywords

  • deep learning
  • multi-scale bottom-up and top-down attention
  • residual network
  • sequence analysis
  • transcription factor binding site

Cite this