{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T23:04:18Z","timestamp":1770764658844,"version":"3.50.0"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Protein\u2013RNA interactions play a key role in a number of biological processes, such as protein synthesis, mRNA processing, mRNA assembly, ribosome function and eukaryotic spliceosomes. As a result, a reliable identification of RNA binding site of a protein is important for functional annotation and site-directed mutagenesis. Accumulated data of experimental protein\u2013RNA interactions reveal that a RNA binding residue with different neighbor amino acids often exhibits different preferences for its RNA partners, which in turn can be assessed by the interacting interdependence of the amino acid fragment and RNA nucleotide.<\/jats:p>\n               <jats:p>Results: In this work, we propose a novel classification method to identify the RNA binding sites in proteins by combining a new interacting feature (interaction propensity) with other sequence- and structure-based features. Specifically, the interaction propensity represents a binding specificity of a protein residue to the interacting RNA nucleotide by considering its two-side neighborhood in a protein residue triplet. The sequence as well as the structure-based features of the residues are combined together to discriminate the interaction propensity of amino acids with RNA. We predict RNA interacting residues in proteins by implementing a well-built random forest classifier. The experiments show that our method is able to detect the annotated protein\u2013RNA interaction sites in a high accuracy. Our method achieves an accuracy of 84.5%, F-measure of 0.85 and AUC of 0.92 prediction of the RNA binding residues for a dataset containing 205 non-homologous RNA binding proteins, and also outperforms several existing RNA binding residue predictors, such as RNABindR, BindN, RNAProB and PPRint, and some alternative machine learning methods, such as support vector machine, naive Bayes and neural network in the comparison study. Furthermore, we provide some biological insights into the roles of sequences and structures in protein\u2013RNA interactions by both evaluating the importance of features for their contributions in predictive accuracy and analyzing the binding patterns of interacting residues.<\/jats:p>\n               <jats:p>Availability: All the source data and code are available at http:\/\/www.aporc.org\/doc\/wiki\/PRNA or http:\/\/www.sysbio.ac.cn\/datatools.asp<\/jats:p>\n               <jats:p>Contact: \u00a0lnchen@sibs.ac.cn<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq253","type":"journal-article","created":{"date-parts":[[2010,5,19]],"date-time":"2010-05-19T00:47:01Z","timestamp":1274230021000},"page":"1616-1622","source":"Crossref","is-referenced-by-count":135,"title":["Prediction of protein\u2013RNA binding sites by a random forest method with combined features"],"prefix":"10.1093","volume":"26","author":[{"given":"Zhi-Ping","family":"Liu","sequence":"first","affiliation":[{"name":"1 Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 and 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ling-Yun","family":"Wu","sequence":"additional","affiliation":[{"name":"1 Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 and 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yong","family":"Wang","sequence":"additional","affiliation":[{"name":"1 Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 and 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiang-Sun","family":"Zhang","sequence":"additional","affiliation":[{"name":"1 Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 and 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luonan","family":"Chen","sequence":"additional","affiliation":[{"name":"1 Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 and 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2010,5,18]]},"reference":[{"key":"2023012507554196600_B1","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1006\/jmbi.2001.4857","article-title":"Structure-based analysis of protein-RNA interactions using the program ENTANGLE","volume":"311","author":"Allers","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012507554196600_B2","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B4","doi-asserted-by":"crossref","first-page":"751","DOI":"10.1016\/S0006-3495(92)81649-1","article-title":"The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids","volume":"63","author":"Berman","year":"1992","journal-title":"Biophys. J."},{"key":"2023012507554196600_B5","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"2023012507554196600_B6","doi-asserted-by":"crossref","first-page":"e29","DOI":"10.1093\/nar\/gkn008","article-title":"Predicting RNA-binding sites from the protein structure based on electrostatics, evolution and geometry","volume":"36","author":"Chen","year":"2008","journal-title":"Nucleic Acids Res."},{"issue":"Suppl. 12","key":"2023012507554196600_B7","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1471-2105-9-S12-S6","article-title":"Predicting RNA-binding sites of proteins using support vector machines and evolutionary information","volume":"9","author":"Cheng","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012507554196600_B8","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1038\/86221","article-title":"A universal mode of helix packing in RNA","volume":"8","author":"Doherty","year":"2001","journal-title":"Nat. Struct. Biol."},{"key":"2023012507554196600_B9","doi-asserted-by":"crossref","first-page":"903","DOI":"10.1002\/prot.21211","article-title":"Protein-RNA interactions: structural analysis and functional classes","volume":"66","author":"Ellis","year":"2007","journal-title":"Proteins"},{"key":"2023012507554196600_B10","doi-asserted-by":"crossref","first-page":"1977","DOI":"10.1016\/j.febslet.2008.03.004","article-title":"RNA-binding proteins and post-transcriptional gene regulation","volume":"582","author":"Glisovic","year":"2008","journal-title":"FEBS Lett."},{"key":"2023012507554196600_B11","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1016\/S0959-440X(02)00323-8","article-title":"RNA-protein interactions","volume":"12","author":"Hall","year":"2002","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012507554196600_B12","first-page":"105","article-title":"A neural network method for identification of RNA-interacting residues in protein","volume":"15","author":"Jeong","year":"2004","journal-title":"Genome Inform."},{"key":"2023012507554196600_B13","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1093\/nar\/29.4.943","article-title":"Protein-RNA interactions: a structural analysis","volume":"29","author":"Jones","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B14","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1002\/bip.360221211","article-title":"Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features","volume":"22","author":"Kabsch","year":"1983","journal-title":"Biopolymers"},{"key":"2023012507554196600_B15","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/S0014-5793(03)00930-X","article-title":"Computational analysis of hydrogen bonds in protein-RNA complexes for interaction patterns","volume":"552","author":"Kim","year":"2003","journal-title":"FEBS Lett."},{"key":"2023012507554196600_B16","doi-asserted-by":"crossref","first-page":"6450","DOI":"10.1093\/nar\/gkl819","article-title":"Amino acid residue doublet propensity in the protein-RNA interface and its application to RNA interface prediction","volume":"34","author":"Kim","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B17","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1002\/prot.21677","article-title":"Prediction of RNA binding sites in a protein using SVM and PSSM profile","volume":"71","author":"Kumar","year":"2008","journal-title":"Proteins"},{"key":"2023012507554196600_B18","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1186\/1471-2105-9-553","article-title":"Prediction of protein-protein binding site by using core interface residue and support vector machine","volume":"9","author":"Li","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012507554196600_B19","first-page":"18","article-title":"Classification and regression by random forest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"2023012507554196600_B20","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1007\/s00726-008-0088-8","article-title":"Bridging protein local structures and protein functions","volume":"35","author":"Liu","year":"2008","journal-title":"Amino Acids"},{"key":"2023012507554196600_B21","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1038\/nrm2178","article-title":"RNA-binding proteins: modular design for efficient function","volume":"8","author":"Lunde","year":"2007","journal-title":"Nat. Rev. Mol. Cell Biol."},{"key":"2023012507554196600_B22","doi-asserted-by":"crossref","first-page":"2746","DOI":"10.1093\/bioinformatics\/btl470","article-title":"Protein-RNA interactions: exploring binding patterns with a three-dimensional superposition analysis of high resolution structures","volume":"22","author":"Morozova","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012507554196600_B23","first-page":"75","article-title":"Amino acids, peptides, and proteins","volume-title":"Lehninger Principles of Biochemistry","author":"Nelson","year":"2004","edition":"4th"},{"key":"2023012507554196600_B24","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1002\/prot.340200303","article-title":"Conservation and prediction of solvent accessibility in protein families","volume":"20","author":"Rost","year":"1994","journal-title":"Proteins"},{"key":"2023012507554196600_B25","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1016\/j.jmb.2008.03.043","article-title":"Prediction of interacting single-stranded RNA bases by protein-binding patterns","volume":"379","author":"Shulman-Peleg","year":"2008","journal-title":"J. Mol. Biol."},{"key":"2023012507554196600_B26","doi-asserted-by":"crossref","first-page":"1492","DOI":"10.1093\/bioinformatics\/btp257","article-title":"Protein function annotation from sequence: prediction of residues interacting with RNA","volume":"25","author":"Spriggs","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507554196600_B27","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1016\/0022-2836(83)90041-4","article-title":"Correlation of sequence hydrophobicities measures similarity in three dimensional protein structure","volume":"171","author":"Sweet","year":"1983","journal-title":"J. Mol. Biol."},{"key":"2023012507554196600_B28","doi-asserted-by":"crossref","first-page":"1450","DOI":"10.1261\/rna.2197306","article-title":"Prediction of RNA binding sites in proteins from amino acid sequence","volume":"12","author":"Terribilini","year":"2006","journal-title":"RNA"},{"key":"2023012507554196600_B29","doi-asserted-by":"crossref","first-page":"W578","DOI":"10.1093\/nar\/gkm294","article-title":"RNABindR: a server for analyzing and predicting RNA-binding sites in proteins","volume":"35","author":"Terribilini","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B30","doi-asserted-by":"crossref","first-page":"D190","DOI":"10.1093\/nar\/gkm895","article-title":"The Universal Protein Resource (Uniprot)","volume":"36","author":"The UniProt Consortium.","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B31","doi-asserted-by":"crossref","first-page":"W243","DOI":"10.1093\/nar\/gkl298","article-title":"BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences","volume":"34","author":"Wang","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507554196600_B32","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1007\/s10858-007-9193-3","article-title":"Nearest-neighbor effects on backbone alpha and beta carbon chemical shifts in proteins","volume":"39","author":"Wang","year":"2007","journal-title":"J. Biomol. NMR"},{"key":"2023012507554196600_B33","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1073\/pnas.0805923106","article-title":"Identification of direct residue contacts in protein-protein interaction by message passing","volume":"106","author":"Weigt","year":"2009","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/13\/1616\/48851907\/bioinformatics_26_13_1616.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/13\/1616\/48851907\/bioinformatics_26_13_1616.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T07:56:13Z","timestamp":1674633373000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/13\/1616\/201634"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5,18]]},"references-count":33,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2010,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq253","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,7,1]]},"published":{"date-parts":[[2010,5,18]]}}}