{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,6]],"date-time":"2025-11-06T19:57:03Z","timestamp":1762459023214},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"23","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1453,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem is to perform clustering over the candidate structures along with known miRNA precursor structures. Mixed clusters allow then the identification of candidates that are similar to known precursors. Given the large number of pre-miRNA candidates that can be identified in single-genome approaches, even after applying several filters for precursor robustness and stability, a conventional structural clustering approach is unfeasible.<\/jats:p>\n               <jats:p>Results: We propose a method to represent candidate structures in a feature space, which summarizes key sequence\/structure characteristics of each candidate. We demonstrate that proximity in this feature space is related to sequence\/structure similarity, and we select candidates that have a high similarity to known precursors. Additional filtering steps are then applied to further reduce the number of candidates to those with greater transcriptional potential. Our method is compared with another single-genome method (TripletSVM) in two datasets, showing better performance in one and comparable performance in the other, for larger training sets. Additionally, we show that our approach allows for a better interpretation of the results.<\/jats:p>\n               <jats:p>Availability and Implementation: The MinDist method is implemented using Perl scripts and is freely available at http:\/\/www.cravela.org\/?mindist=1.<\/jats:p>\n               <jats:p>Contact: \u00a0backofen@informatik.uni-freiburg.de<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts574","type":"journal-article","created":{"date-parts":[[2012,10,12]],"date-time":"2012-10-12T00:24:35Z","timestamp":1350001475000},"page":"3034-3041","source":"Crossref","is-referenced-by-count":6,"title":["Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches"],"prefix":"10.1093","volume":"28","author":[{"given":"Nuno D.","family":"Mendes","sequence":"first","affiliation":[]},{"given":"Steffen","family":"Heyne","sequence":"additional","affiliation":[]},{"given":"Ana T.","family":"Freitas","sequence":"additional","affiliation":[]},{"given":"Marie-France","family":"Sagot","sequence":"additional","affiliation":[]},{"given":"Rolf","family":"Backofen","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2012,10,10]]},"reference":[{"key":"2023062411492213500_bts574-B1","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.cell.2009.01.002","article-title":"MicroRNAs: target recognition and regulatory functions","volume":"136","author":"Bartel","year":"2009","journal-title":"Cell"},{"key":"2023062411492213500_bts574-B2","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1261\/rna.7240905","article-title":"Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes","volume":"11","author":"Baskerville","year":"2005","journal-title":"RNA"},{"key":"2023062411492213500_bts574-B3","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1007\/s00285-007-0107-5","article-title":"Variations on RNA folding and alignment: lessons from Benasque","volume":"56","author":"Bompfunewerer","year":"2008","journal-title":"J. Math. Biol."},{"key":"2023062411492213500_bts574-B4","doi-asserted-by":"crossref","first-page":"2677","DOI":"10.1093\/bioinformatics\/btn495","article-title":"Specific alignment of structured RNA: stochastic grammars and sequence annealing","volume":"24","author":"Bradley","year":"2008","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B5","volume-title":"Pattern Classification","author":"Duda","year":"2001"},{"key":"2023062411492213500_bts574-B6","doi-asserted-by":"crossref","first-page":"3724","DOI":"10.1093\/nar\/25.18.3724","article-title":"Finding the most significant common sequence and structure motifs in a set of RNA sequences","volume":"25","author":"Gorodkin","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023062411492213500_bts574-B7","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1093\/nar\/gkg006","article-title":"Rfam: an RNA family database","volume":"31","author":"Griffiths-Jones","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023062411492213500_bts574-B8","doi-asserted-by":"crossref","first-page":"1815","DOI":"10.1093\/bioinformatics\/bti279","article-title":"Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%","volume":"21","author":"Havgaard","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B9","doi-asserted-by":"crossref","first-page":"2095","DOI":"10.1093\/bioinformatics\/btp065","article-title":"Lightweight comparison of RNAs based on exact sequence-structure matches","volume":"25","author":"Heyne","year":"2009","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B10","first-page":"159","article-title":"Local similarity in RNA secondary structures","volume-title":"Proceedings of Computational Systems Bioinformatics (CSB 2003)","author":"H\u00f6chsmann","year":"2003"},{"key":"2023062411492213500_bts574-B11","doi-asserted-by":"crossref","first-page":"2222","DOI":"10.1093\/bioinformatics\/bth229","article-title":"Alignment of RNA base pairing probability matrices","volume":"20","author":"Hofacker","year":"2004","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B12","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1007\/BF00818163","article-title":"Fast folding and comparison of RNA secondary structures","volume":"125","author":"Hofacker","year":"1994","journal-title":"Monatshefte Chemie"},{"key":"2023062411492213500_bts574-B13","doi-asserted-by":"crossref","first-page":"1059","DOI":"10.1016\/S0022-2836(02)00308-X","article-title":"Secondary structure prediction for aligned RNA sequences","volume":"319","author":"Hofacker","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023062411492213500_bts574-B14","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1093\/bioinformatics\/btn628","article-title":"Structural profiles of human miRNA families from pairwise clustering","volume":"25","author":"Kaczkowski","year":"2009","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B15","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1016\/0092-8674(93)90529-Y","article-title":"The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14","volume":"75","author":"Lee","year":"1993","journal-title":"Cell"},{"key":"2023062411492213500_bts574-B16","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1089\/dna.2006.0551","article-title":"Principles and limitations of computational microRNA gene and target finding","volume":"26","author":"Lindow","year":"2007","journal-title":"DNA Cell Biol."},{"key":"2023062411492213500_bts574-B17","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1006\/jmbi.2001.5351","article-title":"Dynalign: an algorithm for finding the secondary structure common to two RNA sequences","volume":"317","author":"Mathews","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023062411492213500_bts574-B18","doi-asserted-by":"crossref","first-page":"2419","DOI":"10.1093\/nar\/gkp145","article-title":"Current tools for the identification of miRNA genes and their targets","volume":"37","author":"Mendes","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023062411492213500_bts574-B19","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1186\/1471-2164-11-529","article-title":"Combination of measures distinguishes pre-miRNAs from other stem-loops in the genome of the newly sequenced Anopheles darlingi","volume":"11","author":"Mendes","year":"2010","journal-title":"BMC Genomics"},{"key":"2023062411492213500_bts574-B20","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1137\/0145048","article-title":"Simultaneous solution of the RNA folding, alignment and protosequence problems","volume":"45","author":"Sankoff","year":"1985","journal-title":"SIAM J. Appl. Math."},{"key":"2023062411492213500_bts574-B21","doi-asserted-by":"crossref","first-page":"3352","DOI":"10.1093\/bioinformatics\/bti550","article-title":"MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons","volume":"21","author":"Siebert","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062411492213500_bts574-B22","doi-asserted-by":"crossref","first-page":"322","DOI":"10.1016\/j.tig.2005.04.008","article-title":"Mammalian microRNAs derived from genomic repeats","volume":"21","author":"Smalheiser","year":"2005","journal-title":"Trends Genetics"},{"key":"2023062411492213500_bts574-B23","doi-asserted-by":"crossref","first-page":"2454","DOI":"10.1073\/pnas.0409169102","article-title":"Fast and reliable prediction of noncoding RNAs","volume":"102","author":"Washietl","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA."},{"key":"2023062411492213500_bts574-B24","doi-asserted-by":"crossref","first-page":"e65","DOI":"10.1371\/journal.pcbi.0030065","article-title":"Inferring non-coding RNA families and classes by means of genome-scale structure-based clustering","volume":"3","author":"Will","year":"2007","journal-title":"PLOS Comput. Biol."},{"key":"2023062411492213500_bts574-B25","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1186\/1471-2105-6-310","article-title":"Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine","volume":"6","author":"Xue","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023062411492213500_bts574-B26","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1002\/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3","article-title":"Index for rating diagnostic tests","volume":"3","author":"Youden","year":"1950","journal-title":"Cancer"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/23\/3034\/50695182\/bioinformatics_28_23_3034.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/23\/3034\/50695182\/bioinformatics_28_23_3034.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,25]],"date-time":"2023-06-25T00:26:05Z","timestamp":1687652765000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/23\/3034\/192879"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,10]]},"references-count":26,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2012,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts574","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2012,12]]},"published":{"date-parts":[[2012,10,10]]}}}