{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T07:51:23Z","timestamp":1769759483542,"version":"3.49.0"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2018,5,8]],"date-time":"2018-05-08T00:00:00Z","timestamp":1525737600000},"content-version":"vor","delay-in-days":6,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Board of Regents Support Fund","award":["(2016\u20132019)-RD-B-07"],"award-info":[{"award-number":["(2016\u20132019)-RD-B-07"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Machine learning plays a substantial role in bioscience owing to the explosive growth in sequence data and the challenging application of computational methods. Peptide-recognition domains (PRDs) are critical as they promote coupled-binding with short peptide-motifs of functional importance through transient interactions. It is challenging to build a reliable predictor of peptide-binding residue in proteins with diverse types of PRDs from protein sequence alone. On the other hand, it is vital to cope up with the sequencing speed and to broaden the scope of study.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this paper, we propose a machine-learning-based tool, named PBRpredict, to predict residues in peptide-binding domains from protein sequence alone. To develop a generic predictor, we train the models on peptide-binding residues of diverse types of domains. As inputs to the models, we use a high-dimensional feature set of chemical, structural and evolutionary information extracted from protein sequence. We carefully investigate six different state-of-the-art classification algorithms for this application. Finally, we use the stacked generalization approach to non-linearly combine a set of complementary base-level learners using a meta-level learner which outperformed the winner-takes-all approach. The proposed predictor is found competitive based on statistical evaluation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>PBRpredict-Suite software: http:\/\/cs.uno.edu\/~tamjid\/Software\/PBRpredict\/pbrpredict-suite.zip.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty352","type":"journal-article","created":{"date-parts":[[2018,4,30]],"date-time":"2018-04-30T07:17:16Z","timestamp":1525072636000},"page":"3289-3299","source":"Crossref","is-referenced-by-count":43,"title":["PBRpredict-Suite: a suite of models to predict peptide-recognition domain residues from protein sequence"],"prefix":"10.1093","volume":"34","author":[{"given":"Sumaiya","family":"Iqbal","sequence":"first","affiliation":[{"name":"Computer Science, University of New Orleans, New Orleans, LA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0110-2194","authenticated-orcid":false,"given":"Md Tamjidul","family":"Hoque","sequence":"additional","affiliation":[{"name":"Computer Science, University of New Orleans, New Orleans, LA, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,5,2]]},"reference":[{"key":"2023012712493571400_bty352-B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol"},{"key":"2023012712493571400_bty352-B2","doi-asserted-by":"crossref","first-page":"235.","DOI":"10.1093\/nar\/28.1.235","article-title":"Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023012712493571400_bty352-B3","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1016\/j.semcdb.2009.09.010","article-title":"MBT domain proteins in development and disease","volume":"21","author":"Bonasio","year":"2010","journal-title":"Semin. Cell Dev. Biol"},{"key":"2023012712493571400_bty352-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM: a library for support vector machines","volume":"2","author":"Chang","year":"2011","journal-title":"ACM Trans. Intell. Syst. Technol"},{"key":"2023012712493571400_bty352-B5","doi-asserted-by":"crossref","first-page":"e1004248.","DOI":"10.1371\/journal.pcbi.1004248","article-title":"Predicting peptide-mediated interactions on a genome-wide scale","volume":"11","author":"Chen","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023012712493571400_bty352-B6","doi-asserted-by":"crossref","first-page":"eaal3655.","DOI":"10.1126\/science.aal3655","article-title":"An N-end rule pathway that recognizes proline and destroys gluconeogenic enzymes","volume":"355","author":"Chen","year":"2017","journal-title":"Science"},{"key":"2023012712493571400_bty352-B7","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/j.gpb.2013.03.002","article-title":"PepBind: a comprehensive database and computational tool for analysis of protein-peptide interactions","volume":"11","author":"Das","year":"2013","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2023012712493571400_bty352-B8","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1002\/jcc.21968","article-title":"SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles","volume":"33","author":"Faraggi","year":"2012","journal-title":"J. Comput. Chem"},{"key":"2023012712493571400_bty352-B9","doi-asserted-by":"crossref","first-page":"D808","DOI":"10.1093\/nar\/gks1094","article-title":"STRING v9.1: protein-protein interaction networks, with increased coverage and integration","volume":"41","author":"Franceschini","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023012712493571400_bty352-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00251-008-0341-z","article-title":"NetMHCpan, a method for MHC class I binding prediction beyond humans","volume":"61","author":"Hoof","year":"2009","journal-title":"Immunogenetics"},{"key":"2023012712493571400_bty352-B11","doi-asserted-by":"crossref","first-page":"1939","DOI":"10.1038\/emboj.2011.108","article-title":"The CW domain, a new histone recognition module in chromatin proteins","volume":"30","author":"Hoppmann","year":"2011","journal-title":"EMBO J"},{"key":"2023012712493571400_bty352-B12","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1016\/j.jtbi.2015.06.012","article-title":"Improved prediction of accessible surface area results in efficient energy function application","volume":"380","author":"Iqbal","year":"2015","journal-title":"J. Theor. Biol"},{"key":"2023012712493571400_bty352-B13","doi-asserted-by":"crossref","first-page":"e0141551.","DOI":"10.1371\/journal.pone.0141551","article-title":"DisPredict: a predictor of disordered protein using optimized RBF kernel","volume":"10","author":"Iqbal","year":"2015","journal-title":"PLoS One"},{"key":"2023012712493571400_bty352-B14","doi-asserted-by":"crossref","first-page":"e0161452.","DOI":"10.1371\/journal.pone.0161452","article-title":"Estimation of position specific energy as a feature of protein residues from sequence alone for structural classification","volume":"11","author":"Iqbal","year":"2016","journal-title":"PLoS One"},{"key":"2023012712493571400_bty352-B15","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1016\/j.jtbi.2015.10.015","article-title":"A balanced secondary structure predictor","volume":"389","author":"Islam","year":"2016","journal-title":"J. Theor. Biol"},{"key":"2023012712493571400_bty352-B16","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1093\/bioinformatics\/btu744","article-title":"DISOPRED3: precise disordered region predictions with annotated protein-binding activity","volume":"31","author":"Jones","year":"2015","journal-title":"Bioinformatics"},{"key":"2023012712493571400_bty352-B17","doi-asserted-by":"crossref","first-page":"2096","DOI":"10.1002\/prot.24422","article-title":"Detection of peptide-binding sites on protein surfaces: the first step toward the modeling and targeting of peptide-mediated interactions","volume":"81","author":"Lavi","year":"2013","journal-title":"Proteins"},{"key":"2023012712493571400_bty352-B19","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1016\/S0014-5793(01)03287-2","article-title":"VHS domaina longshoreman of vesicle lines","volume":"513","author":"Lohi","year":"2002","journal-title":"FEBS Lett"},{"key":"2023012712493571400_bty352-B20","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1016\/j.str.2009.11.012","article-title":"The structural basis of peptide-protein binding strategies","volume":"18","author":"London","year":"2010","journal-title":"Structure"},{"key":"2023012712493571400_bty352-B21","doi-asserted-by":"crossref","first-page":"1738","DOI":"10.1093\/bioinformatics\/btv060","article-title":"Computational identification of MoRFs in protein sequences","volume":"31","author":"Malhis","year":"2015","journal-title":"Bioinformatics"},{"key":"2023012712493571400_bty352-B22","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1007\/s008940100038","article-title":"Generation and evaluation of dimension-reduced amino acid parameter representations by artificial neural networks","volume":"7","author":"Meiler","year":"2001","journal-title":"J. Mol. Model"},{"key":"2023012712493571400_bty352-B23","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1021\/ma00145a039","article-title":"Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation","volume":"18","author":"Miyazawa","year":"1985","journal-title":"Macromolecules"},{"key":"2023012712493571400_bty352-B24","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1007\/s13721-013-0034-x","article-title":"Classification of microarray cancer data using ensemble approach","volume":"2","author":"Nagi","year":"2013","journal-title":"Netw. Model. Anal. Health Inform. Bioinform"},{"key":"2023012712493571400_bty352-B25","doi-asserted-by":"crossref","first-page":"33.","DOI":"10.1186\/s13073-016-0288-x","article-title":"NetMHCpan-3.0; improved prediction of binding to MHC class I molecules integrating information from multiple receptor and peptide length datasets","volume":"8","author":"Nielsen","year":"2016","journal-title":"Genome Med"},{"key":"2023012712493571400_bty352-B26","first-page":"2825","article-title":"Scikit-learn: machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res"},{"key":"2023012712493571400_bty352-B27","doi-asserted-by":"crossref","first-page":"e1000335.","DOI":"10.1371\/journal.pcbi.1000335","article-title":"Accurate prediction of peptide binding sites on protein surfaces","volume":"5","author":"Petsalaki","year":"2009","journal-title":"PLoS Comput. Biol"},{"key":"2023012712493571400_bty352-B28","doi-asserted-by":"crossref","first-page":"D61","DOI":"10.1093\/nar\/gkl842","article-title":"NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins","volume":"35","author":"Pruitt","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023012712493571400_bty352-B29","author":"Schr\u00f6dinger","year":"2015"},{"key":"2023012712493571400_bty352-B30","doi-asserted-by":"crossref","first-page":"1220","DOI":"10.1126\/science.1175668","article-title":"Cell signaling in space and time: where proteins come together and when they\u2019re apart","volume":"326","author":"Scott","year":"2009","journal-title":"Science"},{"key":"2023012712493571400_bty352-B31","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1016\/j.jtbi.2012.12.008","article-title":"A feature extraction technique using bi-gram probabilities of position specific scoring matrix for protein fold recognition","volume":"320","author":"Sharma","year":"2013","journal-title":"J. Theor. Biol"},{"key":"2023012712493571400_bty352-B32","doi-asserted-by":"crossref","first-page":"1223","DOI":"10.1002\/jcc.24314","article-title":"Sequencebased prediction of protein-peptide binding sites using support vector machine","volume":"37","author":"Taherzadeh","year":"2016","journal-title":"J. Comput. Chem"},{"key":"2023012712493571400_bty352-B33","doi-asserted-by":"crossref","first-page":"1543","DOI":"10.1021\/jm010468s","article-title":"Inhibition of protein-protein association by small molecules: approaches and progress","volume":"45","author":"Toogood","year":"2002","journal-title":"J. Med. Chem"},{"key":"2023012712493571400_bty352-B34","doi-asserted-by":"crossref","first-page":"2626","DOI":"10.1039\/C4MB00290C","article-title":"Proteome-wide analysis of human disease mutations in short linear motifs: neglected players in cancer?","volume":"10","author":"Uyar","year":"2014","journal-title":"Mol. Biosyst"},{"key":"2023012712493571400_bty352-B35","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1093\/bioinformatics\/bts072","article-title":"The identification of short linear motif-mediated interfaces within the human interactome","volume":"28","author":"Weatheritt","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012712493571400_bty352-B36","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/S0893-6080(05)80023-1","article-title":"Stacked generalization","volume":"5","author":"Wolpert","year":"1992","journal-title":"Neural Netw"},{"key":"2023012712493571400_bty352-B37","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1109\/4235.585893","article-title":"No free lunch theorems for optimization","volume":"1","author":"Wolpert","year":"1997","journal-title":"IEEE Trans. Evol. Comput"},{"key":"2023012712493571400_bty352-B39","doi-asserted-by":"crossref","first-page":"3353","DOI":"10.1002\/prot.22842","article-title":"Fluctuations of backbone torsion angles obtained from NMR-determined structures and their prediction","volume":"78","author":"Zhang","year":"2010","journal-title":"Proteins"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/19\/3289\/48919703\/bioinformatics_34_19_3289.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/19\/3289\/48919703\/bioinformatics_34_19_3289.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:43:32Z","timestamp":1674827012000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/19\/3289\/4991983"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,5,2]]},"references-count":37,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2018,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty352","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,10,1]]},"published":{"date-parts":[[2018,5,2]]}}}