{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T17:33:20Z","timestamp":1777570400687,"version":"3.51.4"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T00:00:00Z","timestamp":1694995200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T00:00:00Z","timestamp":1694995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Understanding gene expression processes necessitates the accurate classification and identification of transcription factors, which is supported by high-throughput sequencing technologies. However, these techniques suffer from inherent limitations such as time consumption and high costs. To address these challenges, the field of bioinformatics has increasingly turned to deep learning technologies for analyzing gene sequences. Nevertheless, the pursuit of improved experimental results has led to the inclusion of numerous complex analysis function modules, resulting in models with a growing number of parameters. To overcome these limitations, it is proposed a novel approach for analyzing DNA transcription factor sequences, which is\u00a0named as\u00a0DeepCAC. This method leverages deep convolutional neural networks with a multi-head self-attention mechanism. By employing convolutional neural networks, it can effectively capture local hidden features in the sequences. Simultaneously, the multi-head self-attention mechanism enhances the identification of hidden features with long-distant dependencies. This approach reduces the overall number of parameters in the model while harnessing the computational power of sequence data from multi-head self-attention. Through training with labeled data, experiments demonstrate that this approach significantly improves performance while requiring fewer parameters compared to existing methods. Additionally, the effectiveness of our approach \u00a0is validated\u00a0in accurately predicting DNA transcription factor sequences.<\/jats:p>","DOI":"10.1186\/s12859-023-05469-9","type":"journal-article","created":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T07:02:30Z","timestamp":1695020550000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["DeepCAC: a deep learning approach on DNA transcription factors classification based on multi-head self-attention and concatenate convolutional neural network"],"prefix":"10.1186","volume":"24","author":[{"given":"Jidong","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Bo","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Jiahui","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Zhihan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jianqiang","family":"Li","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,18]]},"reference":[{"key":"5469_CR1","first-page":"1","volume":"30","author":"R Singh","year":"2017","unstructured":"Singh R, et al. Attend and predict: understanding gene regulation by selective attention on chromatin. Adv Neural Inf Process Syst. 2017;30:1.","journal-title":"Adv Neural Inf Process Syst"},{"issue":"1","key":"5469_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-018-03149-4","volume":"9","author":"SJ Clark","year":"2018","unstructured":"Clark SJ, et al. scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells. Nat Commun. 2018;9(1):1\u20139.","journal-title":"Nat Commun"},{"issue":"7457","key":"5469_CR3","doi-asserted-by":"publisher","first-page":"172","DOI":"10.1038\/nature12311","volume":"499","author":"D Ray","year":"2013","unstructured":"Ray D, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499(7457):172\u20137.","journal-title":"Nature"},{"issue":"1","key":"5469_CR4","doi-asserted-by":"publisher","first-page":"1053","DOI":"10.1146\/annurev.bi.61.070192.005201","volume":"61","author":"CO Pabo","year":"1992","unstructured":"Pabo CO, Sauer RT. Transcription factors: structural families and principles of DNA recognition. Annu Rev Biochem. 1992;61(1):1053\u201395.","journal-title":"Annu Rev Biochem"},{"issue":"3","key":"5469_CR5","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1016\/j.cgh.2013.12.007","volume":"12","author":"JM Carethers","year":"2014","unstructured":"Carethers JM. DNA testing and molecular screening for colon cancer. Clin Gastroenterol Hepatol. 2014;12(3):377\u201381.","journal-title":"Clin Gastroenterol Hepatol"},{"issue":"7","key":"5469_CR6","doi-asserted-by":"publisher","first-page":"554","DOI":"10.1038\/nrg2364","volume":"9","author":"M Boutros","year":"2008","unstructured":"Boutros M, Ahringer J. The art and design of genetic screens: RNA interference. Nat Rev Genet. 2008;9(7):554\u201366.","journal-title":"Nat Rev Genet"},{"key":"5469_CR7","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2022.102349","volume":"131","author":"S Akbar","year":"2022","unstructured":"Akbar S, et al. cACP-DeepGram: classification of anticancer peptides via deep neural network and skip-gram-based word embedding model. Artif Intell Med. 2022;131: 102349.","journal-title":"Artif Intell Med"},{"issue":"12","key":"5469_CR8","doi-asserted-by":"publisher","first-page":"1505","DOI":"10.1101\/gr.5560806","volume":"16","author":"SM Johnson","year":"2006","unstructured":"Johnson SM, et al. Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatin. Genome Res. 2006;16(12):1505\u201316.","journal-title":"Genome Res"},{"issue":"5","key":"5469_CR9","doi-asserted-by":"publisher","first-page":"887","DOI":"10.1016\/j.cell.2008.02.022","volume":"132","author":"DE Schones","year":"2008","unstructured":"Schones DE, et al. Dynamic regulation of nucleosome positioning in the human genome. Cell. 2008;132(5):887\u201398.","journal-title":"Cell"},{"issue":"12","key":"5469_CR10","doi-asserted-by":"publisher","first-page":"1213","DOI":"10.1038\/nmeth.2688","volume":"10","author":"JD Buenrostro","year":"2013","unstructured":"Buenrostro JD, et al. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods. 2013;10(12):1213\u20138.","journal-title":"Nat Methods"},{"issue":"2","key":"5469_CR11","doi-asserted-by":"publisher","first-page":"W202","DOI":"10.1093\/nar\/gkp335","volume":"37","author":"TL Bailey","year":"2009","unstructured":"Bailey TL, et al. MEME SUITE: tools for motif discovery and searching. Nucl Acids Res. 2009;37(2):W202\u20138.","journal-title":"Nucl Acids Res"},{"issue":"3","key":"5469_CR12","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1007\/BF00994018","volume":"20","author":"C Cortes","year":"1995","unstructured":"Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273\u201397.","journal-title":"Mach Learn"},{"issue":"7","key":"5469_CR13","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1003711","volume":"10","author":"M Ghandi","year":"2014","unstructured":"Ghandi M, et al. Enhanced regulatory sequence prediction using gapped k-mer features. PLoS Comput Biol. 2014;10(7): e1003711.","journal-title":"PLoS Comput Biol"},{"key":"5469_CR14","first-page":"1","volume":"2","author":"Y LeCun","year":"1989","unstructured":"LeCun Y, et al. Handwritten digit recognition with a back-propagation network. Adv Neural Inf Process Syst. 1989;2:1.","journal-title":"Adv Neural Inf Process Syst"},{"issue":"8","key":"5469_CR15","doi-asserted-by":"publisher","first-page":"831","DOI":"10.1038\/nbt.3300","volume":"33","author":"B Alipanahi","year":"2015","unstructured":"Alipanahi B, et al. Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning. Nat Biotechnol. 2015;33(8):831\u20138.","journal-title":"Nat Biotechnol"},{"issue":"12","key":"5469_CR16","doi-asserted-by":"publisher","first-page":"i121","DOI":"10.1093\/bioinformatics\/btw255","volume":"32","author":"H Zeng","year":"2016","unstructured":"Zeng H, et al. Convolutional neural network architectures for predicting DNA\u2013protein binding. Bioinformatics. 2016;32(12):i121\u20137.","journal-title":"Bioinformatics"},{"issue":"10","key":"5469_CR17","doi-asserted-by":"publisher","first-page":"931","DOI":"10.1038\/nmeth.3547","volume":"12","author":"J Zhou","year":"2015","unstructured":"Zhou J, Troyanskaya OG. Predicting effects of noncoding variants with deep learning\u2013based sequence model. Nat Methods. 2015;12(10):931\u20134.","journal-title":"Nat Methods"},{"key":"5469_CR18","doi-asserted-by":"crossref","unstructured":"Salekin S, Zhang JM, Huang Y. A deep learning model for predicting transcription factor binding location at single nucleotide resolution. In: 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). IEEE; 2017.","DOI":"10.1109\/BHI.2017.7897204"},{"key":"5469_CR19","doi-asserted-by":"crossref","unstructured":"Gupta A, Rush AM. Dilated convolutions for modeling long-distance genomic dependencies. arXiv preprint arXiv:1710.01278, 2017.","DOI":"10.1101\/200857"},{"issue":"2","key":"5469_CR20","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1207\/s15516709cog1402_1","volume":"14","author":"JL Elman","year":"1990","unstructured":"Elman JL. Finding structure in time. Cogn Sci. 1990;14(2):179\u2013211.","journal-title":"Cogn Sci"},{"issue":"11","key":"5469_CR21","doi-asserted-by":"publisher","first-page":"e107","DOI":"10.1093\/nar\/gkw226","volume":"44","author":"D Quang","year":"2016","unstructured":"Quang D, Xie X. DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences. Nucl Acids Res. 2016;44(11):e107\u2013e107.","journal-title":"Nucl Acids Res"},{"issue":"8","key":"5469_CR22","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735\u201380.","journal-title":"Neural Comput"},{"key":"5469_CR23","unstructured":"Yu D-J, Zhu Y-H, Bao X-R. DeepTF: Accurate prediction of transcription factor binding sites by combining multi-scale convolution and long short-term memory neural network. In: International Conference on Intelligent Science and Big Data Engineering; 2019."},{"key":"5469_CR24","doi-asserted-by":"crossref","unstructured":"He K, et al. Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR); 2016.","DOI":"10.1109\/CVPR.2016.90"},{"issue":"5","key":"5469_CR25","doi-asserted-by":"publisher","first-page":"bbab101","DOI":"10.1093\/bib\/bbab101","volume":"22","author":"L-C Shen","year":"2021","unstructured":"Shen L-C, et al. SAResNet: self-attention residual network for predicting DNA-protein binding. Briefings Bioinf. 2021;22(5):bbab101.","journal-title":"Briefings Bioinf"},{"issue":"1","key":"5469_CR26","doi-asserted-by":"publisher","first-page":"bbab525","DOI":"10.1093\/bib\/bbab525","volume":"23","author":"Y Zhang","year":"2022","unstructured":"Zhang Y, et al. A novel convolution attention model for predicting transcription factor binding sites by combination of sequence and shape. Briefings Bioinf. 2022;23(1):bbab525.","journal-title":"Briefings Bioinf"},{"issue":"4","key":"5469_CR27","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1007\/s13042-019-00990-x","volume":"11","author":"YQ Zhang","year":"2020","unstructured":"Zhang YQ, et al. DeepSite: bidirectional LSTM and CNN models for predicting DNA-protein binding. Int J Mach Learn Cybern. 2020;11(4):841\u201351.","journal-title":"Int J Mach Learn Cybern"},{"key":"5469_CR28","first-page":"1","volume":"26","author":"M Denil","year":"2013","unstructured":"Denil M, et al. Predicting parameters in deep learning. Adv Neural Inf Process Syst. 2013;26:1.","journal-title":"Adv Neural Inf Process Syst"},{"key":"5469_CR29","doi-asserted-by":"crossref","unstructured":"Raza A, Tahir M, Alam W. iPro-TCN: Prediction of DNA Promoters Recognition and their Strength Using Temporal Convolutional Network. IEEE Access;2023.","DOI":"10.1109\/ACCESS.2023.3285197"},{"key":"5469_CR30","doi-asserted-by":"crossref","unstructured":"Lin T-Y, et al. Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.","DOI":"10.1109\/CVPR.2017.106"},{"issue":"4","key":"5469_CR31","doi-asserted-by":"publisher","first-page":"bbaa229","DOI":"10.1093\/bib\/bbaa229","volume":"22","author":"Y He","year":"2021","unstructured":"He Y, et al. A survey on deep learning in DNA\/RNA motif mining. Briefings Bioinf. 2021;22(4):bbaa229.","journal-title":"Briefings Bioinf"},{"key":"5469_CR32","unstructured":"Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473; 2014."},{"key":"5469_CR33","first-page":"1","volume":"30","author":"A Vaswani","year":"2017","unstructured":"Vaswani A, et al. Attention is all you need. Adv Neural Inf Process Syst. 2017;30:1.","journal-title":"Adv Neural Inf Process Syst"},{"key":"5469_CR34","unstructured":"So D, Le Q, Liang C. The evolved transformer. In: International Conference on Machine Learning. PMLR; 2019"},{"key":"5469_CR35","unstructured":"Park J, et al. Bam: Bottleneck attention module. arXiv preprint arXiv:1807.06514; 2018."},{"key":"5469_CR36","doi-asserted-by":"crossref","unstructured":"Woo S, et al. Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV). 2018.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"5469_CR37","doi-asserted-by":"crossref","unstructured":"Bello I, et al. Attention augmented convolutional networks. In: Proceedings of the IEEE\/CVF international conference on computer vision. 2019.","DOI":"10.1109\/ICCV.2019.00338"},{"issue":"12","key":"5469_CR38","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pcbi.1007560","volume":"15","author":"PK Koo","year":"2019","unstructured":"Koo PK, Eddy SR. Representation learning of genomic sequence motifs with convolutional neural networks. Plos Comput Biol. 2019;15(12):1.","journal-title":"Plos Comput Biol"},{"issue":"2","key":"5469_CR39","doi-asserted-by":"publisher","DOI":"10.1088\/1742-5468\/ab633c","volume":"2020","author":"M Geiger","year":"2020","unstructured":"Geiger M, et al. Scaling description of generalization with number of parameters in deep learning. J Stat Mech: Theory Exp. 2020;2020(2): 023401.","journal-title":"J Stat Mech: Theory Exp"},{"key":"5469_CR40","doi-asserted-by":"crossref","unstructured":"Huang G, et al. Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2017.","DOI":"10.1109\/CVPR.2017.243"},{"issue":"15","key":"5469_CR41","doi-asserted-by":"publisher","first-page":"2112","DOI":"10.1093\/bioinformatics\/btab083","volume":"37","author":"YR Ji","year":"2021","unstructured":"Ji YR, et al. DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome. Bioinformatics. 2021;37(15):2112\u201320.","journal-title":"Bioinformatics"},{"key":"5469_CR42","doi-asserted-by":"publisher","DOI":"10.1016\/j.chemolab.2020.104103","volume":"204","author":"S Akbar","year":"2020","unstructured":"Akbar S, et al. iHBP-DeepPSSM: Identifying hormone binding proteins using PsePSSM based evolutionary features and deep learning approach. Chemom Intell Lab Syst. 2020;204: 104103.","journal-title":"Chemom Intell Lab Syst"},{"key":"5469_CR43","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2021.104778","volume":"137","author":"S Akbar","year":"2021","unstructured":"Akbar S, et al. iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model. Comput Biol Med. 2021;137: 104778.","journal-title":"Comput Biol Med"},{"issue":"1","key":"5469_CR44","doi-asserted-by":"publisher","first-page":"bbac562","DOI":"10.1093\/bib\/bbac562","volume":"24","author":"Y He","year":"2023","unstructured":"He Y, et al. Incorporating higher order network structures to improve miRNA\u2013disease association prediction based on functional modularity. Briefings Bioinf. 2023;24(1):bbac562.","journal-title":"Briefings Bioinf"},{"key":"5469_CR45","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1007\/s11704-018-8094-9","volume":"14","author":"S Akbar","year":"2020","unstructured":"Akbar S, et al. iRNA-PseTNC: identification of RNA 5-methylcytosine sites using hybrid vector space of pseudo nucleotide composition. Front Comput Sci. 2020;14:451\u201360.","journal-title":"Front Comput Sci"},{"key":"5469_CR46","doi-asserted-by":"crossref","unstructured":"Zhao B-W, et al. Fusing higher and lower-order biological information for drug repositioning via graph representation learning. IEEE Trans Emerg Top Comput 2023.","DOI":"10.1109\/TETC.2023.3239949"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05469-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05469-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05469-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,21]],"date-time":"2024-05-21T14:01:52Z","timestamp":1716300112000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05469-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,18]]},"references-count":46,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5469"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05469-9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,18]]},"assertion":[{"value":"20 July 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 September 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"345"}}