{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T02:53:20Z","timestamp":1761706400113},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2019,5,11]],"date-time":"2019-05-11T00:00:00Z","timestamp":1557532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Self Regional Healthcare Foundation"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Circular RNAs (circRNAs) are a new class of endogenous RNAs in animals and plants. During pre-RNA splicing, the 5\u2032 and 3\u2032 termini of exon(s) can be covalently ligated to form circRNAs through back-splicing (head-to-tail splicing). CircRNAs can be conserved across species, show tissue- and developmental stage-specific expression patterns, and may be associated with human disease. However, the mechanism of circRNA formation is still unclear although some sequence features have been shown to affect back-splicing.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, by applying the state-of-art machine learning techniques, we have developed the first deep learning model, DeepCirCode, to predict back-splicing for human circRNA formation. DeepCirCode utilizes a convolutional neural network (CNN) with nucleotide sequence as the input, and shows superior performance over conventional machine learning algorithms such as support vector machine and random forest. Relevant features learnt by DeepCirCode are represented as sequence motifs, some of which match human known motifs involved in RNA splicing, transcription or translation. Analysis of these motifs shows that their distribution in RNA sequences can be important for back-splicing. Moreover, some of the human motifs appear to be conserved in mouse and fruit fly. The findings provide new insight into the back-splicing code for circRNA formation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>All the datasets and source code for model construction are available at https:\/\/github.com\/BioDataLearning\/DeepCirCode.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz382","type":"journal-article","created":{"date-parts":[[2019,5,1]],"date-time":"2019-05-01T19:13:32Z","timestamp":1556738012000},"page":"5235-5242","source":"Crossref","is-referenced-by-count":45,"title":["Deep learning of the back-splicing code for circular RNA formation"],"prefix":"10.1093","volume":"35","author":[{"given":"Jun","family":"Wang","sequence":"first","affiliation":[{"name":"Department of Genetics and Biochemistry, Clemson University , Clemson, SC, USA"}]},{"given":"Liangjiang","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Genetics and Biochemistry, Clemson University , Clemson, SC, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,5,11]]},"reference":[{"key":"2023013108375454800_btz382-B1","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","article-title":"Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023013108375454800_btz382-B2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1016\/j.molcel.2014.08.019","article-title":"circRNA biogenesis competes with pre-mRNA splicing","volume":"56","author":"Ashwal-Fluss","year":"2014","journal-title":"Mol. Cell"},{"key":"2023013108375454800_btz382-B3","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/S0031-3203(96)00142-2","article-title":"The use of the area under the ROC curve in the evaluation of machine learning algorithms","volume":"30","author":"Bradley","year":"1997","journal-title":"Pattern Recogn"},{"key":"2023013108375454800_btz382-B4","doi-asserted-by":"crossref","first-page":"34985.","DOI":"10.1038\/srep34985","article-title":"circRNADb: a comprehensive database for human circular RNAs with protein-coding annotations","volume":"6","author":"Chen","year":"2016","journal-title":"Sci. Rep"},{"key":"2023013108375454800_btz382-B5","doi-asserted-by":"crossref","first-page":"17053.","DOI":"10.1038\/nplants.2017.53","article-title":"A circRNA from SEPALLATA3 regulates splicing of its cognate mRNA through R-loop formation","volume":"3","author":"Conn","year":"2017","journal-title":"Nat. Plants"},{"key":"2023013108375454800_btz382-B6","doi-asserted-by":"crossref","first-page":"1125","DOI":"10.1016\/j.cell.2015.02.014","article-title":"The RNA binding protein quaking regulates formation of circRNAs","volume":"160","author":"Conn","year":"2015","journal-title":"Cell"},{"key":"2023013108375454800_btz382-B7","doi-asserted-by":"crossref","first-page":"2846","DOI":"10.1093\/nar\/gkw027","article-title":"Foxo3 circular RNA retards cell cycle progression via forming ternary complexes with p21 and CDK2","volume":"44","author":"Du","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023013108375454800_btz382-B8","doi-asserted-by":"crossref","first-page":"1245","DOI":"10.1016\/j.bbagrm.2016.07.009","article-title":"The complexity of the translation ability of circRNAs","volume":"1859","author":"Granados-Riveron","year":"2016","journal-title":"Biochim. Biophys. Acta"},{"key":"2023013108375454800_btz382-B9","doi-asserted-by":"crossref","first-page":"R24.","DOI":"10.1186\/gb-2007-8-2-r24","article-title":"Quantifying similarity between motifs","volume":"8","author":"Gupta","year":"2007","journal-title":"Genome Biol"},{"key":"2023013108375454800_btz382-B10","doi-asserted-by":"crossref","first-page":"409.","DOI":"10.1186\/s13059-014-0409-z","article-title":"Expanded identification and characterization of mammalian circular RNAs","volume":"15","author":"Guo","year":"2014","journal-title":"Genome Biol"},{"key":"2023013108375454800_btz382-B11","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1261\/rna.035667.112","article-title":"Circular RNAs are abundant, conserved, and associated with ALU repeats","volume":"19","author":"Jeck","year":"2013","journal-title":"RNA"},{"key":"2023013108375454800_btz382-B12","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1016\/j.cell.2018.05.022","article-title":"A network of noncoding regulatory RNAs acts in the mammalian brain","volume":"174","author":"Kleaveland","year":"2018","journal-title":"Cell"},{"key":"2023013108375454800_btz382-B13","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.molcel.2017.02.017","article-title":"Circ-ZNF609 is a circular RNA that can be translated and functions in myogenesis","volume":"66","author":"Legnini","year":"2017","journal-title":"Mol. Cell"},{"key":"2023013108375454800_btz382-B14","doi-asserted-by":"crossref","first-page":"8168","DOI":"10.1093\/nar\/gky721","article-title":"Circular RNA expression in human hematopoietic cells is widespread and cell-type specific","volume":"46","author":"Nicolet","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023013108375454800_btz382-B15","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.molcel.2017.02.021","article-title":"Translation of CircRNAs","volume":"66","author":"Pamudurti","year":"2017","journal-title":"Mol. Cell"},{"key":"2023013108375454800_btz382-B16","doi-asserted-by":"crossref","first-page":"eaam8526","DOI":"10.1126\/science.aam8526","article-title":"Loss of a mammalian circular RNA locus causes miRNA deregulation and affects brain function","volume":"357","author":"Piwecka","year":"2017","journal-title":"Science"},{"key":"2023013108375454800_btz382-B17","doi-asserted-by":"crossref","first-page":"e107.","DOI":"10.1093\/nar\/gkw226","article-title":"DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences","volume":"44","author":"Quang","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023013108375454800_btz382-B18","doi-asserted-by":"crossref","first-page":"870","DOI":"10.1016\/j.molcel.2015.03.027","article-title":"Circular RNAs in the mammalian brain are highly abundant, conserved, and dynamically expressed","volume":"58","author":"Rybak-Wolf","year":"2015","journal-title":"Mol. Cell"},{"key":"2023013108375454800_btz382-B19","doi-asserted-by":"crossref","first-page":"1363.","DOI":"10.1126\/science.355.6332.1363","article-title":"Circular RNAs hint at new realm of genetics","volume":"355","author":"Servick","year":"2017","journal-title":"Science"},{"key":"2023013108375454800_btz382-B20","doi-asserted-by":"crossref","first-page":"3940","DOI":"10.1093\/bioinformatics\/bti623","article-title":"ROCR: visualizing classifier performance in R","volume":"21","author":"Sing","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013108375454800_btz382-B21","doi-asserted-by":"crossref","first-page":"D158","DOI":"10.1093\/nar\/gkw1099","article-title":"UniProt: the universal protein knowledgebase","volume":"45","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013108375454800_btz382-B22","author":"Wang","year":"2017"},{"key":"2023013108375454800_btz382-B23","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/j.yjmcc.2016.07.007","article-title":"Characterization of circular RNAs in human, mouse and rat hearts","volume":"98","author":"Werfel","year":"2016","journal-title":"J. Mol. Cell Cardiol"},{"key":"2023013108375454800_btz382-B24","doi-asserted-by":"crossref","first-page":"1966","DOI":"10.1016\/j.celrep.2014.10.062","article-title":"Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation","volume":"9","author":"Westholm","year":"2014","journal-title":"Cell Rep"},{"key":"2023013108375454800_btz382-B25","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1038\/nn.3975","article-title":"Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity","volume":"18","author":"You","year":"2015","journal-title":"Nat. Neurosci"},{"key":"2023013108375454800_btz382-B26","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1016\/j.tcm.2013.03.003","article-title":"Fragile hearts: new insights into translational control in cardiac muscle","volume":"23","author":"Zarnescu","year":"2013","journal-title":"Trends Cardiovasc. Med"},{"key":"2023013108375454800_btz382-B27","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1016\/j.cell.2014.09.001","article-title":"Complementary sequence-mediated exon circularization","volume":"159","author":"Zhang","year":"2014","journal-title":"Cell"},{"key":"2023013108375454800_btz382-B28","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1038\/nmeth.3547","article-title":"Predicting effects of noncoding variants with deep learning-based sequence model","volume":"12","author":"Zhou","year":"2015","journal-title":"Nat. Methods"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz382\/28705208\/btz382.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/24\/5235\/48977868\/bioinformatics_35_24_5235.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/24\/5235\/48977868\/bioinformatics_35_24_5235.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T17:55:05Z","timestamp":1675187705000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/24\/5235\/5488122"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,5,11]]},"references-count":28,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2019,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz382","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,12,15]]},"published":{"date-parts":[[2019,5,11]]}}}