{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,10]],"date-time":"2025-11-10T20:53:57Z","timestamp":1762808037917},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2220,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Small non-coding RNAs (ncRNAs) play important roles in various cellular functions in all clades of life. With next-generation sequencing techniques, it has become possible to study ncRNAs in a high-throughput manner and by using specialized algorithms ncRNA classes such as miRNAs can be detected in deep sequencing data. Typically, such methods are targeted to a certain class of ncRNA. Many methods rely on RNA secondary structure prediction, which is not always accurate and not all ncRNA classes are characterized by a common secondary structure. Unbiased classification methods for ncRNAs could be important to improve accuracy and to detect new ncRNA classes in sequencing data.<\/jats:p>\n               <jats:p>Results: Here, we present a scoring system called ALPS (alignment of pattern matrices score) that only uses primary information from a deep sequencing experiment, i.e. the relative positions and lengths of reads, to classify ncRNAs. ALPS makes no further assumptions, e.g. about common structural properties in the ncRNA class and is nevertheless able to identify ncRNA classes with high accuracy. Since ALPS is not designed to recognize a certain class of ncRNA, it can be used to detect novel ncRNA classes, as long as these unknown ncRNAs have a characteristic pattern of deep sequencing read lengths and positions. We evaluate our scoring system on publicly available deep sequencing data and show that it is able to classify known ncRNAs with high sensitivity and specificity.<\/jats:p>\n               <jats:p>Availability: Calculated pattern matrices of the datasets hESC and EB are available at the project web site http:\/\/www.bio.ifi.lmu.de\/ALPS. An implementation of the described method is available upon request from the authors.<\/jats:p>\n               <jats:p>Contact: \u00a0florian.erhard@bio.ifi.lmu.de<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq363","type":"journal-article","created":{"date-parts":[[2010,9,7]],"date-time":"2010-09-07T17:41:46Z","timestamp":1283881306000},"page":"i426-i432","source":"Crossref","is-referenced-by-count":21,"title":["Classification of ncRNAs using position and size information in deep sequencing data"],"prefix":"10.1093","volume":"26","author":[{"given":"Florian","family":"Erhard","sequence":"first","affiliation":[{"name":"Institut f\u00fcr Informatik, Ludwig-Maximilians-Universit\u00e4t M\u00fcnchen, Amalienstra\u00dfe 17, 80333 M\u00fcnchen, Germany"}]},{"given":"Ralf","family":"Zimmer","sequence":"additional","affiliation":[{"name":"Institut f\u00fcr Informatik, Ludwig-Maximilians-Universit\u00e4t M\u00fcnchen, Amalienstra\u00dfe 17, 80333 M\u00fcnchen, Germany"}]}],"member":"286","published-online":{"date-parts":[[2010,9,4]]},"reference":[{"key":"2023012508222008600_B1","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1016\/S0960-9822(01)00299-8","article-title":"Double-stranded RNA-mediated silencing of genomic tandem repeats and transposable elements in the D. melanogaster germline","volume":"11","author":"Aravin","year":"2001","journal-title":"Curr. Biol."},{"key":"2023012508222008600_B2","doi-asserted-by":"crossref","first-page":"2773","DOI":"10.1101\/gad.1705308","article-title":"Mouse ES cells express endogenous shRNAs, siRNAs, and other microprocessor-independent, dicer-dependent small RNAs","volume":"22","author":"Babiarz","year":"2008","journal-title":"Genes Dev."},{"key":"2023012508222008600_B3","doi-asserted-by":"crossref","first-page":"775","DOI":"10.1016\/S0300-9084(02)01402-5","article-title":"The expanding snoRNA world","volume":"84","author":"Bachellerie","year":"2002","journal-title":"Biochimie"},{"key":"2023012508222008600_B4","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/S0092-8674(04)00045-5","article-title":"MicroRNAs: genomics, biogenesis, mechanism, and function","volume":"116","author":"Bartel","year":"2004","journal-title":"Cell"},{"key":"2023012508222008600_B5","doi-asserted-by":"crossref","first-page":"5904","DOI":"10.1016\/j.febslet.2005.09.040","article-title":"Prediction and validation of microRNAs and their targets","volume":"579","author":"Bentwich","year":"2005","journal-title":"FEBS Lett."},{"key":"2023012508222008600_B6","doi-asserted-by":"crossref","first-page":"D93","DOI":"10.1093\/nar\/gkn787","article-title":"GtRNAdb: a database of transfer RNA genes detected in genomic sequence","volume":"37","author":"Chan","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012508222008600_B7","doi-asserted-by":"crossref","first-page":"798","DOI":"10.1038\/nature07007","article-title":"An endogenous small interfering RNA pathway in Drosophila","volume":"453","author":"Czech","year":"2008","journal-title":"Nature"},{"key":"2023012508222008600_B8","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1186\/1471-2105-5-105","article-title":"Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction","volume":"5","author":"Doshi","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012508222008600_B9","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1186\/1471-2105-5-71","article-title":"Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction","volume":"5","author":"Dowell","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012508222008600_B10","volume-title":"Statistical Methods for Research Workers","author":"Fisher","year":"1970","edition":"14th"},{"key":"2023012508222008600_B11","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1038\/nbt1394","article-title":"Discovering microRNAs from deep sequencing data using miRDeep","volume":"26","author":"Friedlander","year":"2008","journal-title":"Nat. Biotechnol."},{"key":"2023012508222008600_B12","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1093\/nar\/gkp988","article-title":"The scaRNA2 is produced by an independent transcription unit and its processing is directed by the encoding region","volume":"38","author":"Gerard","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012508222008600_B13","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1016\/0022-2836(82)90398-9","article-title":"An improved algorithm for matching biological sequences","volume":"162","author":"Gotoh","year":"1982","journal-title":"J. Mol. Biol."},{"key":"2023012508222008600_B14","doi-asserted-by":"crossref","first-page":"D154","DOI":"10.1093\/nar\/gkm952","article-title":"miRBase: tools for microRNA genomics","volume":"36","author":"Griffiths-Jones","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012508222008600_B15","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.molcel.2007.06.017","article-title":"MicroRNA targeting specificity in mammals: determinants beyond seed pairing","volume":"27","author":"Grimson","year":"2007","journal-title":"Mol. Cell"},{"key":"2023012508222008600_B16","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1261\/rna.2000810","article-title":"Human tRNA-derived small RNAs in the global regulation of RNA silencing","volume":"16","author":"Haussecker","year":"2010","journal-title":"RNA"},{"key":"2023012508222008600_B17","doi-asserted-by":"crossref","first-page":"852","DOI":"10.1007\/3-540-59496-5_348","article-title":"Thermodynamics of RNA folding. when is an RNA molecule in equilibrium?","volume-title":"Advances in Artificial Life","author":"Higgs","year":"1995"},{"key":"2023012508222008600_B18","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1007\/BF00818163","article-title":"Fast folding and comparison of RNA secondary structures","volume":"125","author":"Hofacker","year":"1994","journal-title":"Monatsh. Chem. Chem. Mon."},{"key":"2023012508222008600_B19","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1093\/bioinformatics\/btg388","article-title":"Prediction of locally stable RNA secondary structures for genome-wide surveys","volume":"20","author":"Hofacker","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508222008600_B20","doi-asserted-by":"crossref","first-page":"R54","DOI":"10.1186\/gb-2009-10-5-r54","article-title":"Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs\/21U-RNAs, during Caenorhabditis elegans development","volume":"10","author":"Kato","year":"2009","journal-title":"Genome Biol."},{"key":"2023012508222008600_B21","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012508222008600_B22","doi-asserted-by":"crossref","first-page":"12751","DOI":"10.1128\/JVI.01325-09","article-title":"Characterization of viral and human RNAs smaller than canonical MicroRNAs","volume":"83","author":"Li","year":"2009","journal-title":"J. Virol."},{"key":"2023012508222008600_B23","doi-asserted-by":"crossref","first-page":"934","DOI":"10.1038\/nsmb1293","article-title":"Structural determinants of RNA recognition and cleavage by Dicer","volume":"14","author":"MacRae","year":"2007","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012508222008600_B24","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1101\/gr.7179508","article-title":"Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells","volume":"18","author":"Morin","year":"2008","journal-title":"Genome Res."},{"key":"2023012508222008600_B25","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1016\/j.febslet.2009.03.048","article-title":"High throughput sequencing of microRNAs in chicken somites","volume":"583","author":"Rathjen","year":"2009","journal-title":"FEBS Lett."},{"key":"2023012508222008600_B26","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1038\/nsmb.1536","article-title":"A distinct class of small RNAs arises from pre-miRNA-proximal regions in a simple chordate","volume":"16","author":"Shi","year":"2009","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012508222008600_B27","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"2023012508222008600_B28","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.cell.2009.07.001","article-title":"Stressing out over tRNA cleavage","volume":"138","author":"Thompson","year":"2009","journal-title":"Cell"},{"key":"2023012508222008600_B29","doi-asserted-by":"crossref","first-page":"e65","DOI":"10.1371\/journal.pcbi.0030065","article-title":"Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering","volume":"3","author":"Will","year":"2007","journal-title":"PLoS Comput. Biol."},{"key":"2023012508222008600_B30","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1089\/106652700750050907","article-title":"A simple iterative approach to parameter optimization","volume":"7","author":"Zien","year":"2000","journal-title":"J. Comput. Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i426\/48858246\/bioinformatics_26_18_i426.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i426\/48858246\/bioinformatics_26_18_i426.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:22:49Z","timestamp":1674634969000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/18\/i426\/204894"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,9,4]]},"references-count":30,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2010,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq363","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,9,15]]},"published":{"date-parts":[[2010,9,4]]}}}