{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T14:56:28Z","timestamp":1776092188967,"version":"3.50.1"},"reference-count":55,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2024,7,16]],"date-time":"2024-07-16T00:00:00Z","timestamp":1721088000000},"content-version":"vor","delay-in-days":54,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"National Network Project, S2S","award":["BT\/PR40177\/BTIS\/137\/49\/2022"],"award-info":[{"award-number":["BT\/PR40177\/BTIS\/137\/49\/2022"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,5,23]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Unlike animals, variability in transcription factors (TFs) and their binding regions (TFBRs) across the plants species is a major problem that most of the existing TFBR finding software fail to tackle, rendering them hardly of any use. This limitation has resulted into underdevelopment of plant regulatory research and rampant use of Arabidopsis-like model species, generating misleading results. Here, we report a revolutionary transformers-based deep-learning approach, PTFSpot, which learns from TF structures and their binding regions\u2019 co-variability to bring a universal TF-DNA interaction model to detect TFBR with complete freedom from TF and species-specific models\u2019 limitations. During a series of extensive benchmarking studies over multiple experimentally validated data, it not only outperformed the existing software by &amp;gt;30% lead but also delivered consistently &amp;gt;90% accuracy even for those species and TF families that were never encountered during the model-building process. PTFSpot makes it possible now to accurately annotate TFBRs across any plant genome even in the total lack of any TF information, completely free from the bottlenecks of species and TF-specific models.<\/jats:p>","DOI":"10.1093\/bib\/bbae324","type":"journal-article","created":{"date-parts":[[2024,6,19]],"date-time":"2024-06-19T09:37:54Z","timestamp":1718789874000},"source":"Crossref","is-referenced-by-count":19,"title":["PTFSpot: deep co-learning on transcription factors and their binding regions attains impeccable universality in plants"],"prefix":"10.1093","volume":"25","author":[{"given":"Sagar","family":"Gupta","sequence":"first","affiliation":[{"name":"CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT) Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division, , Palampur, Himachal Pradesh 176061, India"},{"name":"Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad, Uttar Pradesh 201002, India"}]},{"given":"Veerbhan","family":"Kesarwani","sequence":"additional","affiliation":[{"name":"CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT) Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division, , Palampur, Himachal Pradesh 176061, India"},{"name":"Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad, Uttar Pradesh 201002, India"}]},{"given":"Umesh","family":"Bhati","sequence":"additional","affiliation":[{"name":"CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT) Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division, , Palampur, Himachal Pradesh 176061, India"},{"name":"Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad, Uttar Pradesh 201002, India"}]},{"family":"Jyoti","sequence":"additional","affiliation":[{"name":"CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT) Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division, , Palampur, Himachal Pradesh 176061, India"},{"name":"Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad, Uttar Pradesh 201002, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4004-8047","authenticated-orcid":false,"given":"Ravi","family":"Shankar","sequence":"additional","affiliation":[{"name":"CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT) Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division, , Palampur, Himachal Pradesh 176061, India"},{"name":"Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad, Uttar Pradesh 201002, India"}]}],"member":"286","published-online":{"date-parts":[[2024,7,16]]},"reference":[{"key":"2024072412264361100_ref1","first-page":"245","article-title":"Protein binding microarrays (PBMs) for the rapid, high-throughput characterization of the sequence specificities of DNA binding proteins","volume":"338","author":"Berger","year":"2006","journal-title":"Methods Mol Biol"},{"key":"2024072412264361100_ref2","doi-asserted-by":"crossref","first-page":"1497","DOI":"10.1126\/science.1141319","article-title":"Genome-wide mapping of in vivo protein-DNA interactions","volume":"316","author":"Johnson","year":"2007","journal-title":"Science"},{"key":"2024072412264361100_ref3","doi-asserted-by":"crossref","first-page":"1408","DOI":"10.1016\/j.cell.2011.11.013","article-title":"Comprehensive genome-wide protein-DNA interactions detected at single nucleotide resolution","volume":"147","author":"Rhee","year":"2011","journal-title":"Cell"},{"key":"2024072412264361100_ref4","doi-asserted-by":"crossref","first-page":"1659","DOI":"10.1038\/nprot.2017.055","article-title":"Mapping genome-wide transcription factor binding sites using DAP-seq","volume":"12","author":"Bartlett","year":"2017","journal-title":"Nat Protoc"},{"key":"2024072412264361100_ref5","first-page":"279","article-title":"Recent advances in ChIP-seq analysis: from quality management to whole-genome annotation","volume":"18","author":"Nakato","year":"2017","journal-title":"Brief Bioinform"},{"key":"2024072412264361100_ref6","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003711","article-title":"Enhanced regulatory sequence prediction using gapped k-mer features","volume":"10","author":"Ghandi","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2024072412264361100_ref7","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1007\/s12038-014-9437-9","article-title":"MiRNAting control of DNA methylation","volume":"39","author":"Jha","year":"2014","journal-title":"J Biosci"},{"key":"2024072412264361100_ref8","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","article-title":"Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2024072412264361100_ref9","doi-asserted-by":"crossref","first-page":"15270","DOI":"10.1038\/s41598-018-33321-1","article-title":"Recurrent neural network for predicting transcription factor binding sites","volume":"8","author":"Shen","year":"2018","journal-title":"Sci Rep"},{"key":"2024072412264361100_ref10","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1186\/s12870-019-1693-2","article-title":"A k-mer grammar analysis to uncover maize regulatory architecture","volume":"19","author":"Mej\u00eda-Guerra","year":"2019","journal-title":"BMC Plant Biol"},{"key":"2024072412264361100_ref11","doi-asserted-by":"crossref","first-page":"7809","DOI":"10.1093\/nar\/gkz672","article-title":"Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework","volume":"47","author":"Yang","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref12","doi-asserted-by":"crossref","first-page":"i269","DOI":"10.1093\/bioinformatics\/btz339","article-title":"Comprehensive evaluation of deep learning architectures for prediction of DNA\/RNA sequence binding specificities","volume":"35","author":"Trabelsi","year":"2019","journal-title":"Bioinformatics"},{"key":"2024072412264361100_ref13","doi-asserted-by":"crossref","first-page":"8484","DOI":"10.1038\/s41598-019-44966-x","article-title":"Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network","volume":"9","author":"Zhang","year":"2019","journal-title":"Sci Rep"},{"key":"2024072412264361100_ref14","doi-asserted-by":"crossref","first-page":"bbab101","DOI":"10.1093\/bib\/bbab101","article-title":"SAResNet: self-attention residual network for predicting DNA-protein binding","volume":"22","author":"Shen","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024072412264361100_ref15","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1038\/s42256-020-00282-y","article-title":"Deep neural networks identify sequence context features predictive of transcription factor binding","volume":"3","author":"Zheng","year":"2021","journal-title":"Nat Mach Intell"},{"key":"2024072412264361100_ref16","doi-asserted-by":"crossref","first-page":"1301","DOI":"10.1016\/j.tplants.2021.06.016","article-title":"Deep learning-based prediction of TFBSs in plants","volume":"26","author":"Shen","year":"2021","journal-title":"Trends Plant Sci"},{"key":"2024072412264361100_ref17","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1093\/bioinformatics\/btaa1100","article-title":"TSPTFBS: a Docker image for trans-species prediction of transcription factor binding sites in plants","volume":"37","author":"Liu","year":"2021","journal-title":"Bioinformatics"},{"key":"2024072412264361100_ref18","doi-asserted-by":"crossref","first-page":"2112","DOI":"10.1093\/bioinformatics\/btab083","article-title":"DNABERT: pre-trained bidirectional encoder representations from transformers model for DNA-language in genome","volume":"37","author":"Ji","year":"2021","journal-title":"Bioinformatics"},{"key":"2024072412264361100_ref19","doi-asserted-by":"crossref","first-page":"1457","DOI":"10.1093\/pcp\/pcac095","article-title":"Exploiting genomic features to improve the prediction of transcription factor-binding sites in plants","volume":"63","author":"Rivi\u00e8re","year":"2022","journal-title":"Plant Cell Physiol"},{"key":"2024072412264361100_ref20","doi-asserted-by":"crossref","first-page":"bbac425","DOI":"10.1093\/bib\/bbac425","article-title":"PlantBind: an attention-based multi-label neural network for predicting plant transcription factor binding sites","volume":"23","author":"Yan","year":"2022","journal-title":"Brief Bioinform"},{"key":"2024072412264361100_ref21","first-page":"1175837","article-title":"TSPTFBS 2.0: trans-species prediction of transcription factor binding sites and identification of their core motifs in plants. Front","volume":"14","author":"Cheng","year":"2023","journal-title":"Plant Sci"},{"key":"2024072412264361100_ref22","doi-asserted-by":"crossref","first-page":"W56","DOI":"10.1093\/nar\/gkt437","article-title":"DNAshape: a method for the high-throughput prediction of DNA structural features on a genomic scale","volume":"41","author":"Zhou","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref23","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1101\/gr.226530.117","article-title":"Local sequence features that influence AP-1 cis-regulatory activity","volume":"28","author":"Chaudhari","year":"2018","journal-title":"Genome Res"},{"key":"2024072412264361100_ref24","doi-asserted-by":"crossref","first-page":"6549","DOI":"10.1038\/s41467-021-26819-2","article-title":"Local DNA shape is a general principle of transcription factor binding specificity in Arabidopsis thaliana","volume":"12","author":"Sielemann","year":"2021","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref25","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1038\/s41467-019-14217-8","article-title":"Eukaryotic transcription factors can track and control their target genes using DNA antennas","volume":"11","author":"Castellanos","year":"2020","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref26","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1016\/j.tcb.2020.03.003","article-title":"Transcription factors and DNA play Hide and Seek","volume":"30","author":"Suter","year":"2020","journal-title":"Trends Cell Biol"},{"key":"2024072412264361100_ref27","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1186\/s12862-019-1398-z","article-title":"Expression and regulatory asymmetry of retained Arabidopsis thaliana transcription factor genes derived from whole genome duplication","volume":"19","author":"Panchy","year":"2019","journal-title":"BMC Evol Biol"},{"key":"2024072412264361100_ref28","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1093\/aob\/mci008","article-title":"Mechanisms of recent genome size variation in flowering plants","volume":"95","author":"Bennetzen","year":"2005","journal-title":"Ann Bot"},{"key":"2024072412264361100_ref29","doi-asserted-by":"crossref","first-page":"5399","DOI":"10.1038\/s41467-019-13386-w","article-title":"Unraveling cis and trans regulatory evolution during cotton domestication","volume":"10","author":"Bao","year":"2019","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref30","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1104\/pp.105.065110","article-title":"Transcription factor families have much higher expansion rates in plants than in animals","volume":"139","author":"Shiu","year":"2005","journal-title":"Plant Physiol"},{"key":"2024072412264361100_ref31","doi-asserted-by":"crossref","first-page":"981","DOI":"10.1038\/s41588-019-0411-1","article-title":"Similarity regression predicts evolution of transcription factor sequence specificity","volume":"51","author":"Lambert","year":"2019","journal-title":"Nat Genet"},{"key":"2024072412264361100_ref32","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.bbagrm.2016.08.005","article-title":"Diversity, expansion, and evolutionary novelty of plant DNA-binding transcription factor families","volume":"1860","author":"Lehti-Shiu","year":"2017","journal-title":"Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms"},{"key":"2024072412264361100_ref33","doi-asserted-by":"crossref","first-page":"D1155","DOI":"10.1093\/nar\/gky1081","article-title":"PlantPAN3.0: a new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants","volume":"47","author":"Chow","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref34","doi-asserted-by":"crossref","first-page":"1280","DOI":"10.1016\/j.cell.2016.04.038","article-title":"Cistrome and Epicistrome features shape the regulatory DNA landscape","volume":"165","author":"O\u2019Malley","year":"2016","journal-title":"Cell"},{"key":"2024072412264361100_ref35","first-page":"D87","article-title":"JASPAR 2020: update of the open-access database of transcription factor binding profiles","volume":"48","author":"Fornes","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref36","doi-asserted-by":"crossref","first-page":"D1040","DOI":"10.1093\/nar\/gkw982","article-title":"PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants","volume":"45","author":"Jin","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref37","doi-asserted-by":"crossref","DOI":"10.1016\/j.isci.2021.103381","article-title":"RBPSpot: learning on appropriate contextual information for RBP binding sites discovery","volume":"24","author":"Sharma","year":"2021","journal-title":"iScience"},{"key":"2024072412264361100_ref38","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1007\/s12038-010-0013-7","article-title":"Flanking region sequence information to refine microRNA target predictions","volume":"35","author":"Heikham","year":"2010","journal-title":"J Biosci"},{"key":"2024072412264361100_ref39","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2024072412264361100_ref40","doi-asserted-by":"crossref","first-page":"3413","DOI":"10.1038\/s41467-022-30770-1","article-title":"ChIP-hub provides an integrative platform for exploring plant regulome","volume":"13","author":"Fu","year":"2022","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref41","doi-asserted-by":"crossref","first-page":"6367","DOI":"10.1093\/nar\/gkaa383","article-title":"A unified dinucleotide alphabet describing both RNA and DNA structures","volume":"48","author":"\u010cern\u00fd","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref42","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2024072412264361100_ref43","volume-title":"Adam: A Method for Stochastic Optimization","author":"Kingma"},{"key":"2024072412264361100_ref44","doi-asserted-by":"crossref","first-page":"1114","DOI":"10.1002\/prot.22002","article-title":"Structure-based prediction of transcription factor binding sites using a protein-DNA docking approach","volume":"72","author":"Liu","year":"2008","journal-title":"Proteins"},{"key":"2024072412264361100_ref45","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2017.243","article-title":"Densely connected convolutional networks","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Huang"},{"key":"2024072412264361100_ref46","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/s12864-019-6413-7","article-title":"The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation","volume":"21","author":"Chicco","year":"2020","journal-title":"BMC Genomics"},{"key":"2024072412264361100_ref47","doi-asserted-by":"crossref","first-page":"12621","DOI":"10.1038\/ncomms12621","article-title":"Sequences flanking the core-binding site modulate glucocorticoid receptor structure and activity","volume":"7","author":"Sch\u00f6ne","year":"2016","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref48","doi-asserted-by":"crossref","first-page":"11883","DOI":"10.1093\/nar\/gky1057","article-title":"Flexibility and structure of flanking DNA impact transcription factor affinity for its core motif","volume":"46","author":"Yella","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref49","doi-asserted-by":"crossref","first-page":"W535","DOI":"10.1093\/nar\/gkt448","article-title":"PscanChIP: finding over-represented transcription factor-binding site motifs and their correlations in sequences from ChIP-Seq experiments","volume":"41","author":"Zambelli","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024072412264361100_ref50","doi-asserted-by":"crossref","first-page":"E1291","DOI":"10.1073\/pnas.1621150114","article-title":"Systematic dissection of genomic features determining transcription factor binding and enhancer function","volume":"114","author":"Grossman","year":"2017","journal-title":"Proc Natl Acad Sci"},{"key":"2024072412264361100_ref51","doi-asserted-by":"crossref","first-page":"1470","DOI":"10.1110\/ps.690101","article-title":"A normalized root-mean-square distance for comparing protein three-dimensional structures","volume":"10","author":"Carugo","year":"2001","journal-title":"Protein Sci"},{"key":"2024072412264361100_ref52","article-title":"Comprehensive evaluation of plant transcription factors binding sites discovery tools","volume-title":"bioRxiv","author":"Jyoti","year":"2023"},{"key":"2024072412264361100_ref53","doi-asserted-by":"crossref","first-page":"2276","DOI":"10.1101\/gr.275658.121","article-title":"Evolutionary rewiring of the wheat transcriptional regulatory network by lineage-specific transposable elements","volume":"31","author":"Zhang","year":"2021","journal-title":"Genome Res"},{"key":"2024072412264361100_ref54","doi-asserted-by":"crossref","first-page":"8822","DOI":"10.1038\/ncomms9822","article-title":"Transcriptional regulation of PIN genes by FOUR LIPS and MYB88 during Arabidopsis root gravitropism","volume":"6","author":"Wang","year":"2015","journal-title":"Nat Commun"},{"key":"2024072412264361100_ref55","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1093\/mp\/ssr103","article-title":"The role of PIN auxin efflux carriers in polar auxin transport and accumulation and their effect on shaping maize development","volume":"5","author":"Forestan","year":"2012","journal-title":"Mol Plant"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/4\/bbae324\/58627309\/bbae324.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/4\/bbae324\/58627309\/bbae324.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,24]],"date-time":"2024-07-24T22:46:13Z","timestamp":1721861173000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae324\/7714599"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,23]]},"references-count":55,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,5,23]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae324","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.11.16.567355","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,5,23]]},"article-number":"bbae324"}}