{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T21:08:52Z","timestamp":1775077732876,"version":"3.50.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2020,6,23]],"date-time":"2020-06-23T00:00:00Z","timestamp":1592870400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Genome Canada, and Natural Sciences and Engineering Research Council (NSERC) of Canada"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The interaction between proteins and nucleic acids plays a crucial role in gene regulation and cell function. Determining the binding preferences of nucleic acid-binding proteins (NBPs), namely RNA-binding proteins (RBPs) and transcription factors (TFs), is the key to decipher the protein\u2013nucleic acids interaction code. Today, available NBP binding data from in vivo or in vitro experiments are still limited, which leaves a large portion of NBPs uncovered. Unfortunately, existing computational methods that model the NBP binding preferences are mostly protein specific: they need the experimental data for a specific protein in interest, and thus only focus on experimentally characterized NBPs. The binding preferences of experimentally unexplored NBPs remain largely unknown.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here, we introduce ProbeRating, a nucleic acid recommender system that utilizes techniques from deep learning and word embeddings of natural language processing. ProbeRating is developed to predict binding profiles for unexplored or poorly studied NBPs by exploiting their homologs NBPs which currently have available binding data. Requiring only sequence information as input, ProbeRating adapts FastText from Facebook AI Research to extract biological features. It then builds a neural network-based recommender system. We evaluate the performance of ProbeRating on two different tasks: one for RBP and one for TF. As a result, ProbeRating outperforms previous methods on both tasks. The results show that ProbeRating can be a useful tool to study the binding mechanism for the many NBPs that lack direct experimental evidence. and implementation<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code is freely available at &amp;lt;https:\/\/github.com\/syang11\/ProbeRating&amp;gt;.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa580","type":"journal-article","created":{"date-parts":[[2020,6,14]],"date-time":"2020-06-14T11:07:27Z","timestamp":1592132847000},"page":"4797-4804","source":"Crossref","is-referenced-by-count":14,"title":["ProbeRating: a recommender system to infer binding profiles for nucleic acid-binding proteins"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8507-7191","authenticated-orcid":false,"given":"Shu","family":"Yang","sequence":"first","affiliation":[{"name":"University of British Columbia Department of Computer Science, , Vancouver, BC V6T1Z4, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaoxi","family":"Liu","sequence":"additional","affiliation":[{"name":"RIKEN Center for Integrative Medical Sciences (IMS) , Yokohama 230-0045, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Raymond T","family":"Ng","sequence":"additional","affiliation":[{"name":"University of British Columbia Department of Computer Science, , Vancouver, BC V6T1Z4, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,6,23]]},"reference":[{"key":"2023062213574555100_btaa580-B1","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023062213574555100_btaa580-B2","doi-asserted-by":"crossref","first-page":"e0141287","DOI":"10.1371\/journal.pone.0141287","volume":"10","author":"Asgari","year":"2015","journal-title":"PLoS One"},{"key":"2023062213574555100_btaa580-B3","doi-asserted-by":"crossref","first-page":"W202","DOI":"10.1093\/nar\/gkp335","volume":"37","author":"Bailey","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B4","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1002\/jcb.22077","volume":"107","author":"Barski","year":"2009","journal-title":"J. Cell. Biochem"},{"key":"2023062213574555100_btaa580-B5","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nmeth.1611","volume":"8","author":"Bellucci","year":"2011","journal-title":"Nat. Methods"},{"key":"2023062213574555100_btaa580-B6","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1038\/nbt1246","volume":"24","author":"Berger","year":"2006","journal-title":"Nat. Biotechnol"},{"key":"2023062213574555100_btaa580-B7","doi-asserted-by":"crossref","first-page":"1266","DOI":"10.1016\/j.cell.2008.05.024","volume":"133","author":"Berger","year":"2008","journal-title":"Cell"},{"key":"2023062213574555100_btaa580-B8","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B9","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"Bojanowski","year":"2017","journal-title":"Trans. Assoc. Comput. Linguist"},{"key":"2023062213574555100_btaa580-B10","doi-asserted-by":"crossref","first-page":"3627","DOI":"10.1093\/bioinformatics\/btw517","volume":"32","author":"Corrado","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B11","doi-asserted-by":"crossref","first-page":"1489","DOI":"10.18632\/aging.101485","volume":"10","author":"Dong","year":"2018","journal-title":"Aging"},{"key":"2023062213574555100_btaa580-B12","author":"Gandhi","year":"2019"},{"key":"2023062213574555100_btaa580-B13","first-page":"214","volume":"30","author":"Ghanbari","year":"2020"},{"key":"2023062213574555100_btaa580-B14","doi-asserted-by":"crossref","first-page":"e1003711","DOI":"10.1371\/journal.pcbi.1003711","volume":"10","author":"Ghandi","year":"2014","journal-title":"PLoS Comput. Biol"},{"key":"2023062213574555100_btaa580-B15","doi-asserted-by":"crossref","first-page":"e117","DOI":"10.1093\/nar\/gkl544","volume":"34","author":"Hiller","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B16","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1002\/prot.25639","volume":"87","author":"Jung","year":"2018","journal-title":"Proteins"},{"key":"2023062213574555100_btaa580-B17","doi-asserted-by":"crossref","first-page":"e1000832","DOI":"10.1371\/journal.pcbi.1000832","volume":"6","author":"Kazan","year":"2010","journal-title":"PLoS Comput. Biol"},{"key":"2023062213574555100_btaa580-B18","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1038\/nrg3141","volume":"13","author":"Konig","year":"2012","journal-title":"Nat. Rev. Genet"},{"key":"2023062213574555100_btaa580-B19","author":"Koo","year":"2018"},{"key":"2023062213574555100_btaa580-B20","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1016\/j.cell.2018.01.029","volume":"172","author":"Lambert","year":"2018","journal-title":"Cell"},{"key":"2023062213574555100_btaa580-B21","first-page":"1188","volume":"32","author":"Le","year":"2014"},{"key":"2023062213574555100_btaa580-B22","doi-asserted-by":"crossref","first-page":"e129","DOI":"10.1093\/nar\/gkx492","volume":"45","author":"Li","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B23","doi-asserted-by":"crossref","first-page":"19675","DOI":"10.1038\/srep19675","volume":"6","author":"Liu","year":"2016","journal-title":"Sci. Rep"},{"key":"2023062213574555100_btaa580-B24","doi-asserted-by":"crossref","first-page":"2118","DOI":"10.1111\/j.1742-4658.2005.04653.x","volume":"272","author":"Maris","year":"2005","journal-title":"FEBS J"},{"key":"2023062213574555100_btaa580-B25","doi-asserted-by":"crossref","first-page":"R17","DOI":"10.1186\/gb-2014-15-1-r17","volume":"15","author":"Maticzka","year":"2014","journal-title":"Genome Biol"},{"key":"2023062213574555100_btaa580-B26","first-page":"3111","author":"Mikolov","year":"2013"},{"key":"2023062213574555100_btaa580-B27","doi-asserted-by":"crossref","first-page":"i351","DOI":"10.1093\/bioinformatics\/btw259","volume":"32","author":"Orenstein","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B28","volume":"8, 14249","author":"Osmanbeyoglu","year":"2017","journal-title":"Nat. Commun"},{"key":"2023062213574555100_btaa580-B29","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","volume":"22","author":"Pan","year":"2010","journal-title":"IEEE Trans. Knowl. Data Eng"},{"key":"2023062213574555100_btaa580-B30","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.neucom.2018.04.036","volume":"305","author":"Pan","year":"2018","journal-title":"Neurocomputing"},{"key":"2023062213574555100_btaa580-B31","doi-asserted-by":"crossref","first-page":"3427","DOI":"10.1093\/bioinformatics\/bty364","volume":"34","author":"Pan","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B32","volume":"10, e1544","author":"Pan","year":"2019","journal-title":"Wiley Interdiscip. Rev RNA"},{"key":"2023062213574555100_btaa580-B33","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1038\/nrg2641","volume":"10","author":"Park","year":"2009","journal-title":"Nat. Rev. Genet"},{"key":"2023062213574555100_btaa580-B34","doi-asserted-by":"crossref","first-page":"1242","DOI":"10.1038\/nbt.3343","volume":"33","author":"Pelossof","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023062213574555100_btaa580-B35","doi-asserted-by":"crossref","first-page":"e121","DOI":"10.1093\/nar\/gkv585","volume":"43","author":"Peng","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B36","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1038\/nrg.2015.10","volume":"17","author":"Quinn","year":"2016","journal-title":"Nat. Rev. Genet"},{"key":"2023062213574555100_btaa580-B37","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1038\/nbt.1550","volume":"27","author":"Ray","year":"2009","journal-title":"Nat. Biotechnol"},{"key":"2023062213574555100_btaa580-B38","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1038\/nature12311","volume":"499","author":"Ray","year":"2013","journal-title":"Nature"},{"key":"2023062213574555100_btaa580-B39","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-85820-3","volume-title":"Recommender Systems Handbook","author":"Ricci","year":"2011"},{"key":"2023062213574555100_btaa580-B40","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1093\/bioinformatics\/16.1.16","volume":"16","author":"Stormo","year":"2000","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B41","doi-asserted-by":"crossref","first-page":"1370","DOI":"10.1093\/nar\/gkv020","volume":"43","author":"Suresh","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B42","doi-asserted-by":"crossref","first-page":"D322","DOI":"10.1093\/nar\/gky1112","volume":"47","author":"Tak Leung","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B43","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1007\/978-1-4939-6406-2_15","volume":"1484","author":"Walia","year":"2017","journal-title":"Methods Mol. Biol."},{"key":"2023062213574555100_btaa580-B44","doi-asserted-by":"crossref","first-page":"5263","DOI":"10.1093\/nar\/gkv439","volume":"43","author":"Wang","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023062213574555100_btaa580-B45","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1146\/annurev-biochem-060815-014607","volume":"85","author":"Wang","year":"2016","journal-title":"Annu. Rev. Biochem"},{"key":"2023062213574555100_btaa580-B46","doi-asserted-by":"crossref","first-page":"1431","DOI":"10.1016\/j.cell.2014.08.009","volume":"158","author":"Weirauch","year":"2014","journal-title":"Cell"},{"key":"2023062213574555100_btaa580-B47","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1093\/bib\/bbv023","volume":"17","author":"Yan","year":"2016","journal-title":"Brief. Bioinf"},{"key":"2023062213574555100_btaa580-B48","volume":"19, 96","author":"Yang","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023062213574555100_btaa580-B49","doi-asserted-by":"crossref","first-page":"2972","DOI":"10.1093\/bioinformatics\/btr503","volume":"27","author":"Yang","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B50","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1016\/j.omtn.2018.03.001","volume":"11","author":"Yi","year":"2018","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023062213574555100_btaa580-B51","doi-asserted-by":"crossref","first-page":"i121","DOI":"10.1093\/bioinformatics\/btw255","volume":"32","author":"Zeng","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062213574555100_btaa580-B52","doi-asserted-by":"crossref","first-page":"1250","DOI":"10.1093\/bib\/bbx168","volume":"20","author":"Zhang","year":"2019","journal-title":"Brief. Bioinf"},{"key":"2023062213574555100_btaa580-B53","doi-asserted-by":"crossref","first-page":"D203","DOI":"10.1093\/nar\/gkv1252","volume":"44","author":"Zhao","year":"2016","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa580\/33773945\/btaa580.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/18\/4797\/50677949\/btaa580.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/18\/4797\/50677949\/btaa580.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,23]],"date-time":"2023-06-23T22:09:26Z","timestamp":1687558166000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/18\/4797\/5861534"}},"subtitle":[],"editor":[{"given":"Yann","family":"Ponty","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2020,6,23]]},"references-count":53,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2020,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa580","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,9,15]]},"published":{"date-parts":[[2020,6,23]]}}}