{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T08:14:48Z","timestamp":1769847288068,"version":"3.49.0"},"reference-count":51,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2021,10,19]],"date-time":"2021-10-19T00:00:00Z","timestamp":1634601600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01 AI111965"],"award-info":[{"award-number":["R01 AI111965"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP120104460"],"award-info":[{"award-number":["DP120104460"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["LP110200333"],"award-info":[{"award-number":["LP110200333"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000925","name":"National Health and Medical Research Council","doi-asserted-by":"publisher","award":["1127948"],"award-info":[{"award-number":["1127948"]}],"id":[{"id":"10.13039\/501100000925","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000925","name":"National Health and Medical Research Council","doi-asserted-by":"publisher","award":["1144652"],"award-info":[{"award-number":["1144652"]}],"id":[{"id":"10.13039\/501100000925","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100011291","name":"National Key Laboratory of Science and Technology on Communications","doi-asserted-by":"publisher","award":["JZX7Y202001SY000901"],"award-info":[{"award-number":["JZX7Y202001SY000901"]}],"id":[{"id":"10.13039\/501100011291","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["BK20201304"],"award-info":[{"award-number":["BK20201304"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["61872186"],"award-info":[{"award-number":["61872186"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["61772273"],"award-info":[{"award-number":["61772273"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["62072243"],"award-info":[{"award-number":["62072243"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,1,17]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Accurate identification of transcription factor binding sites is of great significance in understanding gene expression, biological development and drug design. Although a variety of methods based on deep-learning models and large-scale data have been developed to predict transcription factor binding sites in DNA sequences, there is room for further improvement in prediction performance. In addition, effective interpretation of deep-learning models is greatly desirable. Here we present MAResNet, a new deep-learning method, for predicting transcription factor binding sites on 690 ChIP-seq datasets. More specifically, MAResNet combines the bottom-up and top-down attention mechanisms and a state-of-the-art feed-forward network (ResNet), which is constructed by stacking attention modules that generate attention-aware features. In particular, the multi-scale attention mechanism is utilized at the first stage to extract rich and representative sequence features. We further discuss the attention-aware features learned from different attention modules in accordance with the changes as the layers go deeper. The features learned by MAResNet are also visualized through the TMAP tool to illustrate that the method can extract the unique characteristics of transcription factor binding sites. The performance of MAResNet is extensively tested on 690 test subsets with an average AUC of 0.927, which is higher than that of the current state-of-the-art methods. Overall, this study provides a new and useful framework for the prediction of transcription factor binding sites by combining the funnel attention modules with the residual network.<\/jats:p>","DOI":"10.1093\/bib\/bbab445","type":"journal-article","created":{"date-parts":[[2021,9,30]],"date-time":"2021-09-30T13:33:48Z","timestamp":1633008828000},"source":"Crossref","is-referenced-by-count":14,"title":["MAResNet: predicting transcription factor binding sites by combining multi-scale bottom-up and top-down attention and residual network"],"prefix":"10.1093","volume":"23","author":[{"given":"Ke","family":"Han","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology , 200 Xiaolingwei, Nanjing, 210094, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0045-4745","authenticated-orcid":false,"given":"Long-Chen","family":"Shen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology , 200 Xiaolingwei, Nanjing, 210094, China"}]},{"given":"Yi-Heng","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology , 200 Xiaolingwei, Nanjing, 210094, China"}]},{"given":"Jian","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology , 200 Xiaolingwei, Nanjing, 210094, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8031-9086","authenticated-orcid":false,"given":"Jiangning","family":"Song","sequence":"additional","affiliation":[{"name":"Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology , Monash University, Melbourne, VIC 3800, Australia"},{"name":"Monash Centre for Data Science, Faculty of Information Technology , Monash University, Melbourne, VIC 3800, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6786-8053","authenticated-orcid":false,"given":"Dong-Jun","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology , 200 Xiaolingwei, Nanjing, 210094, China"}]}],"member":"286","published-online":{"date-parts":[[2021,10,19]]},"reference":[{"key":"2022101900065573000_ref1","doi-asserted-by":"crossref","first-page":"1305","DOI":"10.1016\/S1357-2725(97)00085-X","article-title":"Transcription factors: an overview","volume":"29","author":"Latchman","year":"1997","journal-title":"Int J Biochem Cell Biol"},{"key":"2022101900065573000_ref2","first-page":"126","article-title":"Too many transcription factors: positive and negative interactions","volume":"2","author":"Karin","year":"1990","journal-title":"New Biol"},{"key":"2022101900065573000_ref3","doi-asserted-by":"crossref","first-page":"1790","DOI":"10.1093\/nar\/gkp1084","article-title":"DNA dynamics play a role as a basal transcription factor in the positioning and regulation of gene transcription initiation","volume":"38","author":"Alexandrov","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2022101900065573000_ref4","doi-asserted-by":"crossref","first-page":"8400","DOI":"10.1128\/JVI.75.18.8400-8406.2001","article-title":"Differential regulation of hepatitis B virus gene expression by the Sp1 transcription factor","volume":"75","author":"Li","year":"2001","journal-title":"J Virol"},{"key":"2022101900065573000_ref5","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1016\/j.cels.2017.07.004","article-title":"Mammalian transcription factor networks: recent advances in interrogating biological complexity","volume":"5","author":"Wilkinson","year":"2017","journal-title":"Cell systems"},{"key":"2022101900065573000_ref6","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1038\/nbt1053","article-title":"Assessing computational tools for the discovery of transcription factor binding sites","volume":"23","author":"Tompa","year":"2005","journal-title":"Nat Biotechnol"},{"key":"2022101900065573000_ref7","doi-asserted-by":"crossref","first-page":"1555","DOI":"10.1093\/bioinformatics\/btw024","article-title":"TFBSTools: an R\/bioconductor package for transcription factor binding site analysis","volume":"32","author":"Tan","year":"2016","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref8","doi-asserted-by":"crossref","first-page":"246","DOI":"10.2174\/1574893614666181212102030","article-title":"A review of DNA-binding proteins prediction methods","volume":"14","author":"Qu","year":"2019","journal-title":"Current Bioinformatics"},{"key":"2022101900065573000_ref9","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1016\/j.cell.2018.01.029","article-title":"The human transcription factors","volume":"172","author":"Lambert","year":"2018","journal-title":"Cell"},{"key":"2022101900065573000_ref10","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1016\/j.csbj.2018.10.007","article-title":"iGHBP: computational identification of growth hormone binding proteins from sequences using extremely randomised tree","volume":"16","author":"Basith","year":"2018","journal-title":"Comput Struct Biotechnol J"},{"key":"2022101900065573000_ref11","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1093\/bib\/bbz045","article-title":"Transcription factors\u2013DNA interactions in rice: identification and verification","volume":"21","author":"Shen","year":"2020","journal-title":"Brief Bioinform"},{"key":"2022101900065573000_ref12","doi-asserted-by":"crossref","first-page":"D108","DOI":"10.1093\/nar\/gkj143","article-title":"TRANSFAC\u00ae and its module TRANSCompel\u00ae: transcriptional gene regulation in eukaryotes","volume":"34","author":"Matys","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2022101900065573000_ref13","first-page":"D87","article-title":"JASPAR 2020: update of the open-access database of transcription factor binding profiles","volume":"48","author":"Fornes","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2022101900065573000_ref14","doi-asserted-by":"crossref","first-page":"i121","DOI":"10.1093\/bioinformatics\/btw255","article-title":"Convolutional neural network architectures for predicting DNA\u2013protein binding","volume":"32","author":"Zeng","year":"2016","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref15","doi-asserted-by":"crossref","first-page":"1405","DOI":"10.1093\/bioinformatics\/btz768","article-title":"Expectation pooling: an effective and interpretable pooling method for predicting DNA\u2013protein binding","volume":"36","author":"Luo","year":"2020","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref16","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","article-title":"Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2022101900065573000_ref17","doi-asserted-by":"crossref","first-page":"e107","DOI":"10.1093\/nar\/gkw226","article-title":"DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences","volume":"44","author":"Quang","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"2022101900065573000_ref18","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab101","article-title":"SAResNet: self-attention residual network for predicting DNA-protein binding","volume":"22","author":"Shen","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022101900065573000_ref19","article-title":"High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method","author":"Zhang","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022101900065573000_ref20","doi-asserted-by":"crossref","first-page":"e153","DOI":"10.1093\/nar\/gkt574","article-title":"DNA motif elucidation using belief propagation","volume":"41","author":"Wong","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2022101900065573000_ref21","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003711","article-title":"Enhanced regulatory sequence prediction using gapped k-mer features","volume":"10","author":"Ghandi","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2022101900065573000_ref22","first-page":"630","volume-title":"European conference on computer vision","author":"He","year":"2016"},{"key":"2022101900065573000_ref23","first-page":"770","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"He","year":"2016"},{"key":"2022101900065573000_ref24","volume-title":"arXiv preprint arXiv:1810.04805","author":"Devlin","year":"2018"},{"key":"2022101900065573000_ref25","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkab383","article-title":"PlantDeepSEA, a deep learning-based web service to predict the regulatory effects of genomic variants in plants","volume":"49","author":"Zhao","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2022101900065573000_ref26","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0251865","article-title":"Protein transfer learning improves identification of heat shock protein families","volume":"16","author":"Min","year":"2021","journal-title":"Plos one"},{"key":"2022101900065573000_ref27","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab001","article-title":"Why can deep convolutional neural networks improve protein fold recognition? A visual explanation by interpretation","volume":"22","author":"Liu","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022101900065573000_ref28","doi-asserted-by":"crossref","first-page":"15270","DOI":"10.1038\/s41598-018-33321-1","article-title":"Recurrent neural network for predicting transcription factor binding sites","volume":"8","author":"Shen","year":"2018","journal-title":"Sci Rep"},{"key":"2022101900065573000_ref29","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1007\/s13042-019-00990-x","article-title":"DeepSite: bidirectional LSTM and CNN models for predicting DNA\u2013protein binding","volume":"11","author":"Zhang","year":"2020","journal-title":"International Journal of Machine Learning and Cybernetics"},{"key":"2022101900065573000_ref30","first-page":"126","volume-title":"International conference on intelligent science and big data engineering","author":"Bao","year":"2019"},{"key":"2022101900065573000_ref31","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1093\/bioinformatics\/btq003","article-title":"CD-HIT suite: a web server for clustering and comparing biological sequences","volume":"26","author":"Huang","year":"2010","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref32","doi-asserted-by":"crossref","first-page":"i269","DOI":"10.1093\/bioinformatics\/btz339","article-title":"Comprehensive evaluation of deep learning architectures for prediction of DNA\/RNA sequence binding specificities","volume":"35","author":"Trabelsi","year":"2019","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref33","volume-title":"2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Fe","year":"2017"},{"key":"2022101900065573000_ref34","first-page":"7132","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"Hu","year":"2018"},{"key":"2022101900065573000_ref35","first-page":"5998","volume-title":"Advances in neural information processing systems","author":"Vaswani","year":"2017"},{"key":"2022101900065573000_ref36","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: a deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2022101900065573000_ref37","first-page":"1520","volume-title":"Proceedings of the IEEE international conference on computer vision","author":"Noh","year":"2015"},{"key":"2022101900065573000_ref38","doi-asserted-by":"crossref","first-page":"7794","DOI":"10.1109\/CVPR.2018.00813","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR 2018)","author":"Wang","year":"2018"},{"key":"2022101900065573000_ref39","first-page":"1929","article-title":"Dropout: a simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J Machine Learn Res"},{"key":"2022101900065573000_ref40","first-page":"8026","article-title":"Pytorch: an imperative style, high-performance deep learning library","volume":"32","author":"Paszke","year":"2019","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2022101900065573000_ref41","first-page":"177","volume-title":"Large-scale machine learning with stochastic gradient descent","author":"Bottou","year":"2010"},{"key":"2022101900065573000_ref42","doi-asserted-by":"crossref","first-page":"1184","DOI":"10.1109\/TCBB.2018.2819660","article-title":"High-order convolutional neural network architecture for predicting DNA-protein binding sites","volume":"16","author":"Zhang","year":"2018","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2022101900065573000_ref43","doi-asserted-by":"crossref","first-page":"2205","DOI":"10.1093\/bioinformatics\/btw203","article-title":"gkmSVM: an R package for gapped-kmer SVM","volume":"32","author":"Ghandi","year":"2016","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref44","first-page":"1","article-title":"Visualization of very large high-dimensional data sets as minimum spanning trees","volume":"12","author":"Probst","year":"2020","journal-title":"J Chem"},{"key":"2022101900065573000_ref45","doi-asserted-by":"crossref","first-page":"bbaa171","DOI":"10.1093\/bib\/bbaa171","article-title":"An in silico approach to identification, categorization and prediction of nucleic acid binding proteins","volume":"22","author":"Xu","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022101900065573000_ref46","doi-asserted-by":"crossref","first-page":"1332","DOI":"10.3390\/cells8111332","article-title":"4mCpred-EL: an ensemble learning framework for identification of DNA N4-methylcytosine sites in the mouse genome","volume":"8","author":"Manavalan","year":"2019","journal-title":"Cell"},{"key":"2022101900065573000_ref47","doi-asserted-by":"crossref","first-page":"2796","DOI":"10.1093\/bioinformatics\/btz015","article-title":"i6mA-Pred: identifying DNA N6-methyladenine sites in the rice genome","volume":"35","author":"Chen","year":"2019","journal-title":"Bioinformatics"},{"key":"2022101900065573000_ref48","first-page":"1","volume-title":"BMC systems biology","author":"Xu","year":"2015"},{"key":"2022101900065573000_ref49","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1021\/acs.jcim.7b00397","article-title":"ATPbind: accurate protein\u2013ATP binding site prediction by combining sequence-profiling and structure-based comparisons","volume":"58","author":"Hu","year":"2018","journal-title":"J Chem Inf Model"},{"key":"2022101900065573000_ref50","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-021-24070-3","article-title":"Machine learning differentiates enzymatic and non-enzymatic metals in proteins","volume":"12","author":"Feehan","year":"2021","journal-title":"Nat Commun"},{"key":"2022101900065573000_ref51","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1016\/j.jtbi.2018.01.023","article-title":"PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework","volume":"443","author":"Song","year":"2018","journal-title":"J Theor Biol"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/1\/bbab445\/42230805\/bbab445.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/1\/bbab445\/42230805\/bbab445.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,9]],"date-time":"2023-11-09T19:09:32Z","timestamp":1699556972000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab445\/6399874"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,19]]},"references-count":51,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1,17]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab445","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,1]]},"published":{"date-parts":[[2021,10,19]]},"article-number":"bbab445"}}