{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T20:26:58Z","timestamp":1745440018002,"version":"3.37.3"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2017,1,15]],"date-time":"2017-01-15T00:00:00Z","timestamp":1484438400000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/501100002790","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002790","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000196","name":"Canada Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000196","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Comparing ribonucleic acid (RNA) secondary structures of arbitrary size uncovers structural patterns that can provide a better understanding of RNA functions. However, performing fast and accurate secondary structure comparisons is challenging when we take into account the RNA configuration (i.e. linear or circular), the presence of pseudoknot and G-quadruplex (G4) motifs and the increasing number of secondary structures generated by high-throughput probing techniques. To address this challenge, we propose the super-n-motifs model based on a latent analysis of enhanced motifs comprising not only basic motifs but also adjacency relations. The super-n-motifs model computes a vector representation of secondary structures as linear combinations of these motifs.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We demonstrate the accuracy of our model for comparison of secondary structures from linear and circular RNA while also considering pseudoknot and G4 motifs. We show that the super-n-motifs representation effectively captures the most important structural features of secondary structures, as compared to other representations such as ordered tree, arc-annotated and string representations. Finally, we demonstrate the time efficiency of our model, which is alignment free and capable of performing large-scale comparisons of 10\u00a0000 secondary structures with an efficiency up to 4 orders of magnitude faster than existing approaches.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and Implementation<\/jats:title>\n                  <jats:p>The super-n-motifs model was implemented in C\u2009++. Source code and Linux binary are freely available at http:\/\/jpsglouzon.github.io\/supernmotifs\/.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btw773","type":"journal-article","created":{"date-parts":[[2016,12,6]],"date-time":"2016-12-06T12:05:41Z","timestamp":1481025941000},"page":"1169-1178","source":"Crossref","is-referenced-by-count":8,"title":["The super-n-motifs model: a novel alignment-free approach for representing and comparing RNA secondary structures"],"prefix":"10.1093","volume":"33","author":[{"given":"Jean-Pierre S\u00e9hi","family":"Glouzon","sequence":"first","affiliation":[{"name":"Department of Computer Science, Faculty of Science, Universit\u00e9 de Sherbrooke, Sherbrooke, QC, Canada"},{"name":"RNA Group, Department of Biochemistry, Faculty of Medicine and Health Sciences, Applied Cancer Research Pavilion, Universit\u00e9 de Sherbrooke, Sherbrooke, QC, Canada"}]},{"given":"Jean-Pierre","family":"Perreault","sequence":"additional","affiliation":[{"name":"RNA Group, Department of Biochemistry, Faculty of Medicine and Health Sciences, Applied Cancer Research Pavilion, Universit\u00e9 de Sherbrooke, Sherbrooke, QC, Canada"}]},{"given":"Shengrui","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Faculty of Science, Universit\u00e9 de Sherbrooke, Sherbrooke, QC, Canada"}]}],"member":"286","published-online":{"date-parts":[[2017,1,14]]},"reference":[{"key":"2023020205011174600_btw773-B1","doi-asserted-by":"crossref","first-page":"775","DOI":"10.1002\/spe.846","article-title":"A multiple layer model to compare RNA secondary structures","volume":"38","author":"Allali","year":"2008","journal-title":"Softw. Pract. Exp"},{"key":"2023020205011174600_btw773-B2","doi-asserted-by":"crossref","first-page":"340","DOI":"10.1186\/1471-2105-9-340","article-title":"RNA STRAND: the RNA secondary structure and statistical analysis database","volume":"9","author":"Andronescu","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020205011174600_btw773-B3","doi-asserted-by":"crossref","first-page":"W471","DOI":"10.1093\/nar\/gkt290","article-title":"RNAstructure: web servers for RNA secondary structure prediction and analysis","volume":"41","author":"Bellaousov","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023020205011174600_btw773-B4","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1109\/TCBB.2008.28","article-title":"Alignments of RNA structures","volume":"7","author":"Blin","year":"2010","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinformatics"},{"key":"2023020205011174600_btw773-B5","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1093\/bib\/bbt052","article-title":"Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis","volume":"15","author":"Bonham-Carter","year":"2013","journal-title":"Brief. Bioinformatics"},{"key":"2023020205011174600_btw773-B6","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1146\/annurev.biophys.26.1.113","article-title":"Hierarchy and dynamics of RNA folding","volume":"26","author":"Brion","year":"1997","journal-title":"Annu. Rev. Biophys. Biomol. Struct"},{"key":"2023020205011174600_btw773-B7","doi-asserted-by":"crossref","first-page":"909","DOI":"10.1038\/nbt0704-909","article-title":"What is dynamic programming?","volume":"22","author":"Eddy","year":"2004","journal-title":"Nat. Biotechnol"},{"key":"2023020205011174600_btw773-B8","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1016\/j.tibs.2006.04.007","article-title":"RNase P: interface of the RNA and protein worlds","volume":"31","author":"Evans","year":"2006","journal-title":"Trends Biochem. Sci"},{"key":"2023020205011174600_btw773-B9","doi-asserted-by":"crossref","first-page":"230","DOI":"10.1186\/1471-2105-10-230","article-title":"CSA: an efficient algorithm to improve circular DNA multiple alignment","volume":"10","author":"Fernandes","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023020205011174600_btw773-B10","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1055\/s-0032-1323624","article-title":"Viroids and hepatitis delta virus","volume":"32","author":"Flores","year":"2012","journal-title":"Semin. Liver Dis"},{"key":"2023020205011174600_btw773-B11","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1007\/978-1-4419-9473-8_2","volume-title":"An Introduction to Heavy-Tailed and Subexponential Distributions SE - 2","author":"Foss","year":"2011"},{"key":"2023020205011174600_btw773-B12","doi-asserted-by":"crossref","DOI":"10.1093\/database\/bav059","article-title":"G4RNA: an RNA G-quadruplex database","author":"Garant","year":"2015","journal-title":"Database"},{"key":"2023020205011174600_btw773-B13","doi-asserted-by":"crossref","first-page":"e98655","DOI":"10.1371\/journal.pone.0098655","article-title":"Comprehensive secondary structure elucidation of four genera of the family Pospiviroidae","volume":"9","author":"Gigu\u00e8re","year":"2014","journal-title":"PLoS One"},{"key":"2023020205011174600_btw773-B14","first-page":"48","article-title":"Matrix computations","volume":"10","author":"Golub","year":"1996","journal-title":"Phys. Today"},{"key":"2023020205011174600_btw773-B15","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1007\/BF02163027","article-title":"Singular value decomposition and least squares solutions","volume":"14","author":"Golub","year":"1970","journal-title":"Numer. Math"},{"key":"2023020205011174600_btw773-B16","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1007\/11575832_38","volume-title":"String Processing and Information Retrieval SE 38","author":"Guignon","year":"2005"},{"key":"2023020205011174600_btw773-B17","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1261\/rna.031401.111","article-title":"The ubiquitous hammerhead ribozyme","volume":"18","author":"Hammann","year":"2012","journal-title":"RNA"},{"key":"2023020205011174600_btw773-B18","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1093\/bib\/bbt083","article-title":"Alignment-free phylogenetics and population genetics","volume":"15","author":"Haubold","year":"2014","journal-title":"Brief. Bioinformatics"},{"key":"2023020205011174600_btw773-B19","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1017\/S0033583506004215","article-title":"RNA structural motifs: building blocks of a modular biomolecule","volume":"38","author":"Hendrix","year":"2005","journal-title":"Q. Rev. Biophys"},{"key":"2023020205011174600_btw773-B20","doi-asserted-by":"crossref","first-page":"6260","DOI":"10.1093\/nar\/gkn511","article-title":"G-quadruplexes: the beginning and end of UTRs","volume":"36","author":"Huppert","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023020205011174600_btw773-B21","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1261\/rna.035667.112","article-title":"Circular RNAs are abundant, conserved, and associated with ALU repeats","volume":"19","author":"Jeck","year":"2013","journal-title":"RNA"},{"key":"2023020205011174600_btw773-B22","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1080\/13658816.2011.575074","article-title":"Scaling of geographic space from the perspective of city and field blocks and using volunteered geographic information","volume":"26","author":"Jiang","year":"2011","journal-title":"Int. J. Geogr. Inf. Sci"},{"key":"2023020205011174600_btw773-B23","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1038\/nature11956","article-title":"Circles reshape the RNA world","volume":"495","author":"Kosik","year":"2013","journal-title":"Nature"},{"key":"2023020205011174600_btw773-B24","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1186\/1748-7188-6-26","article-title":"ViennaRNA Package 2.0","volume":"6","author":"Lorenz","year":"2011","journal-title":"Algorithms Mol. Biol"},{"key":"2023020205011174600_btw773-B25","doi-asserted-by":"crossref","first-page":"e165","DOI":"10.1093\/nar\/gku909","article-title":"SHAPE-Seq 2.0: systematic optimization and extension of high-throughput chemical probing of RNA secondary structure with next generation sequencing","volume":"42","author":"Loughrey","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020205011174600_btw773-B26","first-page":"496","article-title":"Introduction to Information Retrieval","volume":"1","author":"Manning","year":"2008","journal-title":"J. Am. Soc. Inf. Sci. Technol"},{"key":"2023020205011174600_btw773-B27","doi-asserted-by":"crossref","first-page":"6146","DOI":"10.1093\/nar\/gku283","article-title":"A novel approach to represent and compare RNA secondary structures","volume":"42","author":"Mattei","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020205011174600_btw773-B28","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1002\/wrna.1113","article-title":"G-quadruplexes in RNA biology","volume":"3","author":"Millevoi","year":"2012","journal-title":"Wiley Interdiscip. Rev. RNA"},{"key":"2023020205011174600_btw773-B29","first-page":"93","volume-title":"Lecture Notes in Informatics, German Conference on Bioinformatics,","author":"Mosig","year":"2006"},{"key":"2023020205011174600_btw773-B30","doi-asserted-by":"crossref","first-page":"D130","DOI":"10.1093\/nar\/gku1063","article-title":"Rfam 12.0: updates to the RNA families database","volume":"43","author":"Nawrocki","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020205011174600_btw773-B31","doi-asserted-by":"crossref","first-page":"419","DOI":"10.1093\/bib\/bbt078","article-title":"Applications of alignment-free methods in epigenomics","volume":"15","author":"Pinello","year":"2014","journal-title":"Brief. Bioinformatics"},{"key":"2023020205011174600_btw773-B32","doi-asserted-by":"crossref","first-page":"508","DOI":"10.4161\/rna.6.5.9753","article-title":"Kinship in the SRP RNA family","volume":"6","author":"Rosenblad","year":"2009","journal-title":"RNA Biol"},{"key":"2023020205011174600_btw773-B33","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1007\/978-1-62703-709-9_12","volume-title":"RNA Sequence, Structure, and Function: Computational and Bioinformatic Methods SE 12","author":"Schirmer","year":"2014"},{"key":"2023020205011174600_btw773-B34","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.tcs.2012.07.040","article-title":"Forest alignment with affine gaps and anchors, applied in RNA structure comparison","author":"Schirmer","year":"2013","journal-title":"Theoretical Computer Science"},{"key":"2023020205011174600_btw773-B35","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1093\/bib\/bbt067","article-title":"New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing","volume":"15","author":"Song","year":"2014","journal-title":"Brief. Bioinformatics"},{"key":"2023020205011174600_btw773-B36","doi-asserted-by":"crossref","first-page":"0956","DOI":"10.1371\/journal.pbio.0030213","article-title":"Pseudoknots: RNA structures with diverse functions","volume":"3","author":"Staple","year":"2005","journal-title":"PLoS Biol"},{"key":"2023020205011174600_btw773-B37","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1006\/jmbi.1999.3001","article-title":"How RNA folds","volume":"293","author":"Tinoco","year":"1999","journal-title":"J. Mol. Biol"},{"key":"2023020205011174600_btw773-B38","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1038\/nmeth.1529","article-title":"FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing","volume":"7","author":"Underwood","year":"2010","journal-title":"Nat. Methods"},{"key":"2023020205011174600_btw773-B39","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1093\/bib\/bbu005","article-title":"Editorial: alignment-free methods in computational biology","volume":"15","author":"Vinga","year":"2014","journal-title":"Briefings Bioinformatics"},{"key":"2023020205011174600_btw773-B40","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1093\/bioinformatics\/btg005","article-title":"Alignment-free sequence comparison-a review","volume":"19","author":"Vinga","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020205011174600_btw773-B41","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1038\/nrg3049","article-title":"Understanding the transcriptome through RNA structure","volume":"12","author":"Wan","year":"2011","journal-title":"Nat. Rev. Genet"},{"key":"2023020205011174600_btw773-B42","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1371\/journal.pcbi.0030065","article-title":"Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering","volume":"3","author":"Will","year":"2007","journal-title":"PLoS Comput. Biol"},{"key":"2023020205011174600_btw773-B43","doi-asserted-by":"crossref","first-page":"2489","DOI":"10.1093\/bioinformatics\/btv185","article-title":"SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics","volume":"31","author":"Will","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020205011174600_btw773-B44","first-page":"135","volume-title":"Proceedings of 14th International Workshop, WABI 2014","author":"Will","year":"2014"},{"key":"2023020205011174600_btw773-B45","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1186\/1471-2105-14-269","article-title":"Efficient alignment of RNA secondary structures using sparse dynamic programming","volume":"14","author":"Zhong","year":"2013","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/8\/1169\/49038580\/bioinformatics_33_8_1169.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/8\/1169\/49038580\/bioinformatics_33_8_1169.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T05:03:58Z","timestamp":1675314238000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/8\/1169\/2907822"}},"subtitle":[],"editor":[{"given":"Ivo","family":"Hofacker","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,1,14]]},"references-count":45,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2017,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw773","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2017,4,15]]},"published":{"date-parts":[[2017,1,14]]}}}