{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T18:48:58Z","timestamp":1774896538087,"version":"3.50.1"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Nucleotides are multifunctional molecules that are essential for numerous biological processes. They serve as sources for chemical energy, participate in the cellular signaling and they are involved in the enzymatic reactions. The knowledge of the nucleotide\u2013protein interactions helps with annotation of protein functions and finds applications in drug design.<\/jats:p>\n               <jats:p>Results: We propose a novel ensemble of accurate high-throughput predictors of binding residues from the protein sequence for ATP, ADP, AMP, GTP and GDP. Empirical tests show that our NsitePred method significantly outperforms existing predictors and approaches based on sequence alignment and residue conservation scoring. The NsitePred accurately finds more binding residues and binding sites and it performs particularly well for the sites with residues that are clustered close together in the sequence. The high predictive quality stems from the usage of novel, comprehensive and custom-designed inputs that utilize information extracted from the sequence, evolutionary profiles, several sequence-predicted structural descriptors and sequence alignment. Analysis of the predictive model reveals several sequence-derived hallmarks of nucleotide-binding residues; they are usually conserved and flanked by less conserved residues, and they are associated with certain arrangements of secondary structures and amino acid pairs in the specific neighboring positions in the sequence.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/biomine.ece.ualberta.ca\/nSITEpred\/<\/jats:p>\n               <jats:p>Contact: \u00a0lkurgan@ece.ualberta.ca<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr657","type":"journal-article","created":{"date-parts":[[2011,12,1]],"date-time":"2011-12-01T05:29:54Z","timestamp":1322717394000},"page":"331-341","source":"Crossref","is-referenced-by-count":122,"title":["Prediction and analysis of nucleotide-binding residues using sequence and sequence-derived structural descriptors"],"prefix":"10.1093","volume":"28","author":[{"given":"Ke","family":"Chen","sequence":"first","affiliation":[{"name":"1 School of Computer Science and Software Engineering, Tianjin Polytechnic University, No. 63 Chenglin Road, Hedong District, Tianjin 300160, P. R. of China and 2Department of Electrical and Computer Engineering, 2nd floor, ECERF (9107 116 Street), University of Alberta, Edmonton, AB, Canada T6G 2V4"},{"name":"1 School of Computer Science and Software Engineering, Tianjin Polytechnic University, No. 63 Chenglin Road, Hedong District, Tianjin 300160, P. R. of China and 2Department of Electrical and Computer Engineering, 2nd floor, ECERF (9107 116 Street), University of Alberta, Edmonton, AB, Canada T6G 2V4"}]},{"given":"Marcin J.","family":"Mizianty","sequence":"additional","affiliation":[{"name":"1 School of Computer Science and Software Engineering, Tianjin Polytechnic University, No. 63 Chenglin Road, Hedong District, Tianjin 300160, P. R. of China and 2Department of Electrical and Computer Engineering, 2nd floor, ECERF (9107 116 Street), University of Alberta, Edmonton, AB, Canada T6G 2V4"}]},{"given":"Lukasz","family":"Kurgan","sequence":"additional","affiliation":[{"name":"1 School of Computer Science and Software Engineering, Tianjin Polytechnic University, No. 63 Chenglin Road, Hedong District, Tianjin 300160, P. R. of China and 2Department of Electrical and Computer Engineering, 2nd floor, ECERF (9107 116 Street), University of Alberta, Edmonton, AB, Canada T6G 2V4"}]}],"member":"286","published-online":{"date-parts":[[2011,11,29]]},"reference":[{"key":"2023012512144827600_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012512144827600_B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023012512144827600_B3","doi-asserted-by":"crossref","first-page":"W529","DOI":"10.1093\/nar\/gkq399","article-title":"ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids","volume":"38","author":"Ashkenazy","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512144827600_B4","doi-asserted-by":"crossref","first-page":"1875","DOI":"10.1093\/bioinformatics\/btm270","article-title":"Predicting functionally important residues from sequence conservation","volume":"23","author":"Capra","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B5","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1186\/1471-2105-10-434","article-title":"Identification of ATP binding residues of a protein from its primary sequence","volume":"10","author":"Chauhan","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012512144827600_B6","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1186\/1471-2105-11-301","article-title":"Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information","volume":"11","author":"Chauhan","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012512144827600_B7","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1186\/1472-6807-7-25","article-title":"Prediction of flexible\/rigid regions from protein sequences using k-spaced amino acid pairs","volume":"7","author":"Chen","year":"2007","journal-title":"BMC Struct Biol."},{"key":"2023012512144827600_B8","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1002\/jcc.21053","article-title":"Prediction of integral membrane protein type by collocated hydrophobic amino acid pairs","volume":"30","author":"Chen","year":"2009","journal-title":"J. Comput. Chem."},{"key":"2023012512144827600_B9","doi-asserted-by":"crossref","first-page":"e4473","DOI":"10.1371\/journal.pone.0004473","article-title":"Investigation of atomic level patterns in protein-small ligand interactions","volume":"4","author":"Chen","year":"2009","journal-title":"PLoS ONE"},{"key":"2023012512144827600_B10","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1002\/(SICI)1097-0134(20000215)38:3<310::AID-PROT7>3.0.CO;2-T","article-title":"When fold is not important: a common structural framework for adenine and AMP binding in 12 unrelated protein families","volume":"38","author":"Denessiouk","year":"2000","journal-title":"Proteins."},{"key":"2023012512144827600_B11","doi-asserted-by":"crossref","first-page":"D667","DOI":"10.1093\/nar\/gkm839","article-title":"LigASite\u2014a database of biologically relevant binding sites in proteins with known apo-structures","volume":"36","author":"Dessailly","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012512144827600_B12","first-page":"1889","article-title":"Working set selection using second order information for training SVM","volume":"6","author":"Fan","year":"2005","journal-title":"J. Mach. Learn Res."},{"key":"2023012512144827600_B13","first-page":"1871","article-title":"LIBLINEAR: a library for large linear classification","volume":"9","author":"Fan","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"2023012512144827600_B14","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1002\/prot.22193","article-title":"Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network","volume":"74","author":"Faraggi","year":"2009","journal-title":"Proteins."},{"key":"2023012512144827600_B15","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/1471-2091-12-20","article-title":"Residue propensities, discrimination and binding site prediction of adenine and guanine phosphates","volume":"12","author":"Firoz","year":"2011","journal-title":"BMC Biochem."},{"key":"2023012512144827600_B16","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1214\/aoms\/1177728730","article-title":"Correlation between a discrete and a continuous variable. Point-biserial correlation","volume":"25","author":"Tate","year":"1954","journal-title":"Annals of Mathematical Statistics"},{"key":"2023012512144827600_B17","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1093\/nar\/30.1.402","article-title":"LIGAND: database of chemical compounds and reactions in biological pathways","volume":"30","author":"Goto","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023012512144827600_B18","doi-asserted-by":"crossref","first-page":"2947","DOI":"10.1093\/bioinformatics\/btm404","article-title":"Clustal W and Clustal X version 2.0","volume":"23","author":"Larkin","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B19","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B20","doi-asserted-by":"crossref","first-page":"2860","DOI":"10.1093\/nar\/29.13.2860","article-title":"Amino acid-base interactions: a three-dimensional analysis of protein-DNA interactions at an atomic level","volume":"29","author":"Luscombe","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012512144827600_B21","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1016\/j.jmb.2003.12.056","article-title":"Molecular determinants for ATP-binding in proteins: a data mining and quantum chemical analysis","volume":"336","author":"Mao","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012512144827600_B22","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/bioinformatics\/16.4.404","article-title":"The PSIPRED protein structure prediction server","volume":"16","author":"McGuffin","year":"2000","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B23","doi-asserted-by":"crossref","first-page":"486","DOI":"10.1006\/jmbi.1996.0591","article-title":"Protein recognition of adenylate: an example of a fuzzy recognition template","volume":"263","author":"Moodie","year":"1996","journal-title":"J. Mol. Biol."},{"key":"2023012512144827600_B24","doi-asserted-by":"crossref","first-page":"4294","DOI":"10.1093\/nar\/29.21.4294","article-title":"On the molecular discrimination between adenine and guanine by proteins","volume":"29","author":"Nobeli","year":"2001","journal-title":"Nucleic Acids Res."},{"issue":"Suppl. 1","key":"2023012512144827600_B25","doi-asserted-by":"crossref","first-page":"S71","DOI":"10.1093\/bioinformatics\/18.suppl_1.S71","article-title":"Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues","volume":"18","author":"Pupko","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B26","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1093\/protein\/gzj002","article-title":"An empirical approach for detecting nucleotide-binding sites on proteins","volume":"19","author":"Saito","year":"2006","journal-title":"Protein Eng. Des. Sel."},{"key":"2023012512144827600_B27","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1016\/0968-0004(90)90281-F","article-title":"The P-loop - A common motif in ATP-binding and GTP-binding proteins","volume":"15","author":"Saraste","year":"1990","journal-title":"Trends Biochem Sci."},{"key":"2023012512144827600_B28","doi-asserted-by":"crossref","first-page":"921","DOI":"10.1006\/jmbi.1999.3488","article-title":"Statistical analysis of amino acid patterns in transmembrane helices: the GxxxG motif occurs frequently and in association with beta-branched residues at neighboring positions","volume":"296","author":"Senes","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012512144827600_B29","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1002\/j.1460-2075.1982.tb01276.x","article-title":"Distantly related sequences in the alpha- and beta-subunits of ATP synthase, myosin, kinases and other ATP-requiring enzymes and a common nucleotide-binding fold","volume":"1","author":"Walker","year":"1982","journal-title":"EMBO J."},{"key":"2023012512144827600_B30","doi-asserted-by":"crossref","first-page":"1589","DOI":"10.1093\/bioinformatics\/btg224","article-title":"PISCES: a protein sequence culling server","volume":"19","author":"Wang","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512144827600_B31","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1186\/1471-2105-7-385","article-title":"Incorporating background frequency improves entropy-based residue conservation measures","volume":"7","author":"Wang","year":"2006","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/3\/331\/48876680\/bioinformatics_28_3_331.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/3\/331\/48876680\/bioinformatics_28_3_331.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T14:31:00Z","timestamp":1674657060000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/3\/331\/188242"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,11,29]]},"references-count":31,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2012,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr657","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,2,1]]},"published":{"date-parts":[[2011,11,29]]}}}