{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T02:03:18Z","timestamp":1767924198320,"version":"3.49.0"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2022,6,27]],"date-time":"2022-06-27T00:00:00Z","timestamp":1656288000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61972422"],"award-info":[{"award-number":["61972422"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Fundamental Research Funds for the Central Universities of Central South University","award":["1053320211941"],"award-info":[{"award-number":["1053320211941"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,7,18]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The interplay between protein and nucleic acid participates in diverse biological activities. Accurately identifying the interaction between protein and nucleic acid can strengthen the understanding of protein function. However, conventional methods are too time-consuming, and computational methods are type-agnostic predictions. We proposed an ensemble predictor termed TSNAPred and first used it to identify residues that bind to A-DNA, B-DNA, ssDNA, mRNA, tRNA and rRNA. TSNAPred combines LightGBM and capsule network, both learned on the feature derived from protein sequence. TSNAPred utilizes the sliding window technique to extract long-distance dependencies between residues and a weighted ensemble strategy to enhance the prediction performance. The results show that TSNAPred can effectively identify type-specific nucleic acid binding residues in our test set. What is more, it also can discriminate DNA-binding and RNA-binding residues, which has improved 5% to 10% on the AUC value compared with other state-of-the-art methods. The dataset and code of TSNAPred are available at: https:\/\/github.com\/niewenjuan-csu\/TSNAPred.<\/jats:p>","DOI":"10.1093\/bib\/bbac244","type":"journal-article","created":{"date-parts":[[2022,6,26]],"date-time":"2022-06-26T23:43:51Z","timestamp":1656287031000},"source":"Crossref","is-referenced-by-count":7,"title":["TSNAPred: predicting type-specific nucleic acid binding residues via an ensemble approach"],"prefix":"10.1093","volume":"23","author":[{"given":"Wenjuan","family":"Nie","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Central South University , 410075, Changsha , China"}]},{"given":"Lei","family":"Deng","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Central South University , 410075, Changsha , China"}]}],"member":"286","published-online":{"date-parts":[[2022,6,27]]},"reference":[{"issue":"1","key":"2022071906183828400_ref1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2000-1-1-reviews001","article-title":"An overview of the structures of protein-dna complexes","volume":"1","author":"Luscombe","year":"2000","journal-title":"Genome Biol"},{"issue":"21","key":"2022071906183828400_ref2","doi-asserted-by":"crossref","first-page":"7364","DOI":"10.1093\/nar\/gkq617","article-title":"Genomic repertoires of dna-binding transcription factors across the tree of life","volume":"38","author":"Charoensawan","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2022071906183828400_ref3","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1007\/978-1-62703-709-9_23","article-title":"RNA-protein interactions: an overview","volume":"1097","author":"Re","year":"2014","journal-title":"Methods Mol Biol"},{"issue":"9","key":"2022071906183828400_ref4","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1016\/j.chembiol.2003.09.002","article-title":"The process of structure-based drug design","volume":"10","author":"Anderson","year":"2003","journal-title":"Chem Biol"},{"issue":"15","key":"2022071906183828400_ref5","doi-asserted-by":"crossref","first-page":"5858","DOI":"10.1021\/jm100574m","article-title":"Understanding and predicting druggability. a high-throughput method for detection of drug binding sites","volume":"53","author":"Schmidtke","year":"2010","journal-title":"J Med Chem"},{"issue":"7","key":"2022071906183828400_ref6","doi-asserted-by":"crossref","first-page":"1043","DOI":"10.1261\/rna.410107","article-title":"X-ray crystallographic and nmr studies of protein\u2013protein and protein\u2013nucleic acid interactions involving the kh domains from human poly (c)-binding protein-2","volume":"13","author":"Zhihua","year":"2007","journal-title":"RNA"},{"issue":"8","key":"2022071906183828400_ref7","doi-asserted-by":"crossref","first-page":"1849","DOI":"10.1038\/nprot.2007.249","article-title":"Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions","volume":"2","author":"Hellman","year":"2007","journal-title":"Nat Protoc"},{"key":"2022071906183828400_ref8","first-page":"289","article-title":"NMR studies of protein-nucleic acid interactions","volume":"278","author":"Varani","year":"2004","journal-title":"Methods Mol Biol"},{"issue":"1","key":"2022071906183828400_ref9","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022071906183828400_ref10","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"Uniprot: a worldwide hub of protein knowledge","volume":"47","author":"UniProt Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2022071906183828400_ref11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1752-0509-4-S2-S1","article-title":"Bindn+ for accurate prediction of dna and rna-binding residues from protein sequence features","volume":"4","author":"Wang","year":"2010","journal-title":"BMC Syst Biol"},{"issue":"10","key":"2022071906183828400_ref12","first-page":"e84","article-title":"Drnapred, fast sequence-based method that accurately predicts and discriminates dna-and rna-binding residues","volume":"45","author":"Yan","year":"2017","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"2022071906183828400_ref13","doi-asserted-by":"crossref","first-page":"1250","DOI":"10.1093\/bib\/bbx168","article-title":"Comprehensive review and empirical analysis of hallmarks of dna-, rna-and protein-binding residues in protein chains","volume":"20","author":"Zhang","year":"2019","journal-title":"Brief Bioinform"},{"issue":"14","key":"2022071906183828400_ref14","doi-asserted-by":"crossref","first-page":"i343","DOI":"10.1093\/bioinformatics\/btz324","article-title":"Scriber: accurate and partner type-specific prediction of protein-binding residues from proteins sequences","volume":"35","author":"Zhang","year":"2019","journal-title":"Bioinformatics"},{"issue":"7","key":"2022071906183828400_ref15","doi-asserted-by":"crossref","first-page":"2428","DOI":"10.1016\/j.jmb.2020.02.026","article-title":"Prona2020 predicts protein\u2013dna, protein\u2013rna, and protein\u2013protein binding proteins and residues from sequence","volume":"432","author":"Qiu","year":"2020","journal-title":"J Mol Biol"},{"issue":"12","key":"2022071906183828400_ref16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-15-S12-S1","article-title":"Identification of single-stranded and double-stranded dna binding proteins based on protein structure","volume":"15","author":"Wang","year":"2014","journal-title":"BMC bioinformatics"},{"issue":"5","key":"2022071906183828400_ref17","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1038\/nrm.2017.130","article-title":"A brave new world of rna-binding proteins","volume":"19","author":"Hentze","year":"2018","journal-title":"Nat Rev Mol Cell Biol"},{"issue":"14","key":"2022071906183828400_ref18","doi-asserted-by":"crossref","first-page":"1977","DOI":"10.1016\/j.febslet.2008.03.004","article-title":"Rna-binding proteins and post-transcriptional gene regulation","volume":"582","author":"Glisovic","year":"2008","journal-title":"FEBS Lett"},{"issue":"4","key":"2022071906183828400_ref19","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1093\/nar\/29.4.943","article-title":"Protein\u2013rna interactions: a structural analysis","volume":"29","author":"Jones","year":"2001","journal-title":"Nucleic Acids Res"},{"issue":"6","key":"2022071906183828400_ref20","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1093\/bib\/bbab336","article-title":"DNAgenie: accurate prediction of DNA-type-specific binding residues in protein sequences","volume":"22","author":"Zhang","year":"2021","journal-title":"Brief Bioinform"},{"issue":"4","key":"2022071906183828400_ref21","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1109\/TCBB.2019.2952338","article-title":"DeepDRBP-2L: A New Genome Annotation Predictor for Identifying DNA-Binding Proteins and RNA-Binding Proteins Using Convolutional Neural Network and Long Short-Term Memory","volume":"18","author":"Zhang","year":"2021","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"1","key":"2022071906183828400_ref22","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat Genet"},{"issue":"D1","key":"2022071906183828400_ref23","doi-asserted-by":"crossref","first-page":"D1096","DOI":"10.1093\/nar\/gks966","article-title":"Biolip: a semi-manually curated database for biologically relevant ligand\u2013protein interactions","volume":"41","author":"Yang","year":"2012","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"2022071906183828400_ref24","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"issue":"1","key":"2022071906183828400_ref25","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1093\/bib\/bbv023","article-title":"A comprehensive comparative review of sequence-based predictors of dna-and rna-binding residues","volume":"17","author":"Yan","year":"2016","journal-title":"Brief Bioinform"},{"issue":"18","key":"2022071906183828400_ref26","doi-asserted-by":"crossref","first-page":"6879","DOI":"10.3390\/ijms21186879","article-title":"Comprehensive survey and comparative assessment of rna-binding residue predictions with analysis by rna type","volume":"21","author":"Wang","year":"2020","journal-title":"Int J Mol Sci"},{"issue":"11","key":"2022071906183828400_ref27","doi-asserted-by":"crossref","first-page":"3170","DOI":"10.1002\/prot.24682","article-title":"Accurate single-sequence prediction of solvent accessible surface area using local and global features","volume":"82","author":"Faraggi","year":"2014","journal-title":"Proteins: Structure, Function, and Bioinformatics"},{"issue":"2","key":"2022071906183828400_ref28","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1038\/nmeth.1818","article-title":"Hhblits: lightning-fast iterative protein sequence searching by hmm-hmm alignment","volume":"9","author":"Remmert","year":"2012","journal-title":"Nat Methods"},{"issue":"1","key":"2022071906183828400_ref29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-8-211","article-title":"Composition profiler: a tool for discovery and visualization of amino acid composition differences","volume":"8","author":"Vacic","year":"2007","journal-title":"BMC bioinformatics"},{"issue":"19","key":"2022071906183828400_ref30","first-page":"135","article-title":"Pdrlgb: precise dna-binding residue prediction using a light gradient boosting machine","volume":"19","author":"Deng","year":"2018","journal-title":"BMC bioinformatics"},{"issue":"W1","key":"2022071906183828400_ref31","doi-asserted-by":"crossref","first-page":"W329","DOI":"10.1093\/nar\/gky384","article-title":"Iupred2a: context-dependent prediction of protein disorder as a function of redox state and protein binding","volume":"46","author":"M\u00e9sz\u00e1ros","year":"2018","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2022071906183828400_ref32","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/nar\/28.1.374","article-title":"Aaindex: amino acid index database","volume":"28","author":"Kawashima","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"2022071906183828400_ref33","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/bioinformatics\/16.4.404","article-title":"The psipred protein structure prediction server","volume":"16","author":"McGuffin","year":"2000","journal-title":"Bioinformatics"},{"issue":"6","key":"2022071906183828400_ref34","doi-asserted-by":"crossref","first-page":"2189","DOI":"10.1109\/TCBB.2019.2932416","article-title":"Prediction of FMN Binding Sites in Electron Transport Chains Based on 2-D CNN and PSSM Profiles","volume":"18","author":"Le","year":"2021","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"23","key":"2022071906183828400_ref35","first-page":"1","article-title":"iprodna-capsnet: identifying protein-dna binding residues using capsule neural networks","volume":"20","author":"Nguyen","year":"2019","journal-title":"BMC bioinformatics"},{"issue":"17","key":"2022071906183828400_ref36","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped blast and psi-blast: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"issue":"Suppl","key":"2022071906183828400_ref37","doi-asserted-by":"crossref","first-page":"2247","DOI":"10.1093\/nar\/19.suppl.2247","article-title":"The swiss-prot protein sequence data bank","volume":"19","author":"Bairoch","year":"1991","journal-title":"Nucleic Acids Res"},{"issue":"22","key":"2022071906183828400_ref38","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc Natl Acad Sci"},{"issue":"11","key":"2022071906183828400_ref39","doi-asserted-by":"crossref","first-page":"4337","DOI":"10.1073\/pnas.0607879104","article-title":"Predicting protein\u2013protein interactions based only on sequences information","volume":"104","author":"Shen","year":"2007","journal-title":"Proc Natl Acad Sci"},{"issue":"4","key":"2022071906183828400_ref40","first-page":"1","article-title":"Xgboost: extreme gradient boosting","volume":"1","author":"Chen","year":"2015","journal-title":"R package version 04-2"},{"key":"2022071906183828400_ref41","first-page":"3146","article-title":"Lightgbm: A highly efficient gradient boosting decision tree","volume":"30","author":"Ke","year":"2017","journal-title":"Advances in neural information processing systems"},{"key":"2022071906183828400_ref42","article-title":"Dynamic routing between capsules","volume-title":"Adavances in neural information processing systems","author":"Sabour"},{"issue":"1","key":"2022071906183828400_ref43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-017-1792-8","article-title":"El_pssm-rt: Dna-binding residue prediction by integrating ensemble learning with pssm relation transformation","volume":"18","author":"Zhou","year":"2017","journal-title":"BMC bioinformatics"},{"issue":"Database issue","key":"2022071906183828400_ref44","doi-asserted-by":"crossref","first-page":"D364","DOI":"10.1093\/nar\/gku1028","article-title":"A series of PDB-related databanks for everyday needs","volume":"43","author":"Touw","year":"2015","journal-title":"Nucleic Acids Res"},{"issue":"7873","key":"2022071906183828400_ref45","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with alphafold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/4\/bbac244\/45016414\/bbac244.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/4\/bbac244\/45016414\/bbac244.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,19]],"date-time":"2022-07-19T06:19:34Z","timestamp":1658211574000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac244\/6618235"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,27]]},"references-count":45,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,7,18]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac244","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,7,18]]},"published":{"date-parts":[[2022,6,27]]},"article-number":"bbac244"}}