{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:38Z","timestamp":1740185138689,"version":"3.37.3"},"reference-count":55,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2019,12,20]],"date-time":"2019-12-20T00:00:00Z","timestamp":1576800000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100005746","name":"Universidad Nacional del Litoral","doi-asserted-by":"publisher","award":["2016 082"],"award-info":[{"award-number":["2016 082"]}],"id":[{"id":"10.13039\/501100005746","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003074","name":"Agencia Nacional de Promoci\u00f3n Cient\u00edfica y Tecnol\u00f3gica","doi-asserted-by":"publisher","award":["PICT 2014-2627","PICT 2018-3384"],"award-info":[{"award-number":["PICT 2014-2627","PICT 2018-3384"]}],"id":[{"id":"10.13039\/501100003074","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The discovery of microRNA (miRNA) in the last decade has certainly changed the understanding of gene regulation in the cell. Although a large number of algorithms with different features have been proposed, they still predict an impractical amount of false positives. Most of the proposed features are based on the structure of precursors of the miRNA only, not considering the important and relevant information contained in the mature miRNA. Such new kind of features could certainly improve the performance of the predictors of new miRNAs.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>This paper presents three new features that are based on the sequence information contained in the mature miRNA. We will show how these new features, when used by a classical supervised machine learning approach as well as by more recent proposals based on deep learning, improve the prediction performance in a significant way. Moreover, several experimental conditions were defined and tested to evaluate the novel features impact in situations close to genome-wide analysis. The results show that the incorporation of new features based on the mature miRNA allows to improve the detection of new miRNAs independently of the classifier used.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/sourceforge.net\/projects\/sourcesinc\/files\/cplxmirna\/.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz940","type":"journal-article","created":{"date-parts":[[2019,12,17]],"date-time":"2019-12-17T20:17:10Z","timestamp":1576613830000},"page":"2319-2327","source":"Crossref","is-referenced-by-count":6,"title":["Complexity measures of the mature miRNA for improving pre-miRNAs prediction"],"prefix":"10.1093","volume":"36","author":[{"given":"Jonathan","family":"Raad","sequence":"first","affiliation":[{"name":"Research Institute for Signals, Systems and Computational Intelligence sinc(i) (FICH-UNL\/CONICET), Ciudad Universitaria , Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4459-4560","authenticated-orcid":false,"given":"Georgina","family":"Stegmayer","sequence":"additional","affiliation":[{"name":"Research Institute for Signals, Systems and Computational Intelligence sinc(i) (FICH-UNL\/CONICET), Ciudad Universitaria , Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego H","family":"Milone","sequence":"additional","affiliation":[{"name":"Research Institute for Signals, Systems and Computational Intelligence sinc(i) (FICH-UNL\/CONICET), Ciudad Universitaria , Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2019,12,20]]},"reference":[{"key":"2023013110164695800_btz940-B1","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1038\/nmeth0910-687","article-title":"MicroRNA profiling: separating signal from noise","volume":"7","author":"Baker","year":"2010","journal-title":"Nat. Methods"},{"key":"2023013110164695800_btz940-B2","doi-asserted-by":"crossref","first-page":"174102","DOI":"10.1103\/PhysRevLett.88.174102","article-title":"Permutation entropy: a natural complexity measure for time series","volume":"88","author":"Bandt","year":"2002","journal-title":"Phys. Rev. Lett"},{"key":"2023013110164695800_btz940-B3","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/S0092-8674(04)00045-5","article-title":"MicroRNAs: genomics, biogenesis, mechanism, and function","volume":"116","author":"Bartel","year":"2004","journal-title":"Cell"},{"key":"2023013110164695800_btz940-B4","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.cell.2009.01.002","article-title":"MicroRNAs: target recognition and regulatory functions","volume":"136","author":"Bartel","year":"2009","journal-title":"Cell"},{"key":"2023013110164695800_btz940-B5","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.cell.2018.03.006","article-title":"Metazoan microRNAs","volume":"173","author":"Bartel","year":"2018","journal-title":"Cell"},{"key":"2023013110164695800_btz940-B6","doi-asserted-by":"crossref","first-page":"989","DOI":"10.1093\/bioinformatics\/btp107","article-title":"microPred: effective classification of pre-miRNAs for human miRNA gene prediction","volume":"25","author":"Batuwita","year":"2009","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B7","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1093\/nar\/gkt856","article-title":"Computational prediction and experimental validation of microRNAs in the brown alga Ectocarpus siliculosus","volume":"42","author":"Billoud","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B8","doi-asserted-by":"crossref","first-page":"918","DOI":"10.1093\/bib\/bbx148","article-title":"A survey of software tools for microRNA discovery and characterization using RNA-seq","volume":"20","author":"Bortolomeazzi","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023013110164695800_btz940-B9","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"key":"2023013110164695800_btz940-B10","doi-asserted-by":"crossref","first-page":"e85","DOI":"10.1371\/journal.pbio.0030085","article-title":"Principles of microRNA\u2014target recognition","volume":"3","author":"Brennecke","year":"2005","journal-title":"PLoS Biol"},{"key":"2023013110164695800_btz940-B11","doi-asserted-by":"crossref","first-page":"i54","DOI":"10.1093\/bioinformatics\/btg1005","article-title":"Glocal alignment: finding rearrangements during alignment","volume":"19","author":"Brudno","year":"2003","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B12","first-page":"1","article-title":"Deep neural architectures for highly imbalanced data in bioinformatics","volume":"6","author":"Bugnon","year":"2019","journal-title":"IEEE Trans. Neural Netw. Learn. Syst"},{"key":"2023013110164695800_btz940-B13","doi-asserted-by":"crossref","first-page":"1836","DOI":"10.1093\/bib\/bby054","article-title":"Trends in the development of miRNA bioinformatics tools","author":"Chen","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023013110164695800_btz940-B14","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1038\/s41467-017-00403-z","article-title":"On the performance of pre-microRNA detection algorithms","volume":"8","author":"Demirci","year":"2017","journal-title":"Nat. Commun"},{"key":"2023013110164695800_btz940-B15","first-page":"1","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","author":"Dem\u0161ar","year":"2006","journal-title":"J. Mach. Learn. Res"},{"key":"2023013110164695800_btz940-B16","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1186\/1471-2105-15-124","article-title":"The discriminant power of RNA features for pre-miRNA recognition","volume":"15","author":"de ON Lopes","year":"2014","journal-title":"BMC Bioinform"},{"key":"2023013110164695800_btz940-B17","doi-asserted-by":"crossref","first-page":"S11","DOI":"10.1186\/1471-2105-11-S11-S11","article-title":"MiRenSVM: towards better prediction of microRNA precursors using an ensemble SVM classifier with multi-loop features","volume":"11","author":"Ding","year":"2010","journal-title":"BMC Bioinform"},{"key":"2023013110164695800_btz940-B18","doi-asserted-by":"crossref","first-page":"6207","DOI":"10.1021\/cr300362f","article-title":"MicroRNA: function, detection, and bioanalysis","volume":"113","author":"Dong","year":"2013","journal-title":"Chem. Rev"},{"key":"2023013110164695800_btz940-B19","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1007\/978-3-642-33275-3_2","volume-title":"Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications","author":"Fischer","year":"2012"},{"key":"2023013110164695800_btz940-B20","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1093\/nar\/gkr688","article-title":"miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades","volume":"40","author":"Friedl\u00e4nder","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B21","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1101\/gr.082701.108","article-title":"Most mammalian mRNAs are conserved targets of micrornas","volume":"19","author":"Friedman","year":"2008","journal-title":"Genome Res"},{"key":"2023013110164695800_btz940-B22","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1186\/1471-2105-14-83","article-title":"HuntMi: an efficient and taxon-specific approach in pre-miRNA identification","volume":"14","author":"Gudy\u015b","year":"2013","journal-title":"BMC Bioinform"},{"key":"2023013110164695800_btz940-B23","doi-asserted-by":"crossref","first-page":"e197","DOI":"10.1093\/bioinformatics\/btl257","article-title":"Hairpins in a haystack: recognizing microRNA precursors in comparative genomics data","volume":"22","author":"Hertel","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B24","doi-asserted-by":"crossref","first-page":"W339","DOI":"10.1093\/nar\/gkm368","article-title":"MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features","volume":"35","author":"Jiang","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B25","doi-asserted-by":"crossref","first-page":"D152","DOI":"10.1093\/nar\/gkq1027","article-title":"miRBase: integrating microRNA annotation and deep-sequencing data","volume":"39","author":"Kozomara","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B26","doi-asserted-by":"crossref","first-page":"D155","DOI":"10.1093\/nar\/gky1141","article-title":"miRBase: from microRNA sequences to function","volume":"47","author":"Kozomara","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B27","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1186\/1471-2105-6-298","article-title":"Kalign\u2014an accurate and fast multiple sequence alignment algorithm","volume":"6","author":"Lassmann","year":"2005","journal-title":"BMC Bioinform"},{"key":"2023013110164695800_btz940-B28","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1109\/TIT.1976.1055501","article-title":"On the complexity of finite sequences","volume":"22","author":"Lempel","year":"1976","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023013110164695800_btz940-B29","first-page":"707","article-title":"Binary codes capable of correcting deletions, insertions, and reversals","volume":"10","author":"Levenshtein","year":"1966","journal-title":"Sov. Phys. Dokl"},{"key":"2023013110164695800_btz940-B30","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1016\/S0092-8674(03)01018-3","article-title":"Prediction of mammalian microRNA targets","volume":"115","author":"Lewis","year":"2003","journal-title":"Cell"},{"key":"2023013110164695800_btz940-B31","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.cell.2004.12.035","article-title":"Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets","volume":"120","author":"Lewis","year":"2005","journal-title":"Cell"},{"key":"2023013110164695800_btz940-B32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00335-009-9241-2","article-title":"Computational approaches for microRNA studies: a review","volume":"21","author":"Li","year":"2010","journal-title":"Mamm. Genome"},{"key":"2023013110164695800_btz940-B33","doi-asserted-by":"crossref","first-page":"1280","DOI":"10.1093\/bib\/bbx165","article-title":"BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches","volume":"20","author":"Liu","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023013110164695800_btz940-B34","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1186\/1748-7188-6-26","article-title":"ViennaRNA package 2.0","volume":"6","author":"Lorenz","year":"2011","journal-title":"Algorithms Mol. Biol"},{"key":"2023013110164695800_btz940-B35","doi-asserted-by":"crossref","first-page":"2226","DOI":"10.1093\/bioinformatics\/btq329","article-title":"MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data","volume":"26","author":"Mathelier","year":"2010","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B36","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1093\/bib\/bbx136","article-title":"Computational tools for plant small RNA detection and categorization","volume":"20","author":"Morgado","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023013110164695800_btz940-B37","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","article-title":"A general method applicable to the search for similarities in the amino acid sequence of two proteins","volume":"48","author":"Needleman","year":"1970","journal-title":"J. Mol. Biol"},{"key":"2023013110164695800_btz940-B38","doi-asserted-by":"crossref","first-page":"1321","DOI":"10.1093\/bioinformatics\/btm026","article-title":"De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures","volume":"23","author":"Ng","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B39","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1186\/1748-7188-6-25","article-title":"Comparative analysis of the quality of a global algorithm and a local algorithm for alignment of two sequences","volume":"6","author":"Polyanovsky","year":"2011","journal-title":"Algorithm. Mol. Biol"},{"key":"2023013110164695800_btz940-B40","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1038\/nrg3198","article-title":"MicroRNA profiling: approaches and considerations","volume":"13","author":"Pritchard","year":"2012","journal-title":"Nat. Rev. Genet"},{"key":"2023013110164695800_btz940-B41","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1145\/584091.584093","article-title":"A mathematical theory of communication","volume":"5","author":"Shannon","year":"2001","journal-title":"SIGMOBILE Mob. Comput. Commun. Rev"},{"key":"2023013110164695800_btz940-B42","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1093\/bfgp\/elw042","article-title":"A compilation of web-based research tools for miRNA analysis","volume":"16","author":"Shukla","year":"2017","journal-title":"Brief. Funct. Genomics"},{"key":"2023013110164695800_btz940-B43","first-page":"20, 1607\u20131620","article-title":"Predicting novel microRNA: a comprehensive comparison of machine learning approaches","author":"Stegmayer","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023013110164695800_btz940-B44","doi-asserted-by":"crossref","first-page":"7318","DOI":"10.1038\/ncomms8318","article-title":"Loss of microRNA-27b contributes to breast cancer stem cell generation by activating ENPP1","volume":"6","author":"Takahashi","year":"2015","journal-title":"Nat. Commun"},{"key":"2023013110164695800_btz940-B45","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1093\/bioinformatics\/btx725","article-title":"microRPM: a microRNA prediction model based only on plant small RNA sequencing data","volume":"34","author":"Tseng","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B46","doi-asserted-by":"crossref","first-page":"e177","DOI":"10.1093\/nar\/gkx836","article-title":"Mirnovo: genome-free prediction of microRNAs from small RNA sequencing data and single-cells using decision forests","volume":"45","author":"Vitsios","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013110164695800_btz940-B47","doi-asserted-by":"crossref","DOI":"10.1002\/0470854774","volume-title":"Statistical Pattern Recognition","author":"Webb","year":"2002"},{"key":"2023013110164695800_btz940-B48","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1111\/j.1525-142X.2008.00302.x","article-title":"The deep evolution of metazoan microRNAs","volume":"11","author":"Wheeler","year":"2009","journal-title":"Evol. Dev"},{"key":"2023013110164695800_btz940-B49","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1186\/1471-2105-6-310","article-title":"Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine","volume":"6","author":"Xue","year":"2005","journal-title":"BMC Bioinform"},{"key":"2023013110164695800_btz940-B50","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.biosystems.2015.10.003","article-title":"miRNAfe: a comprehensive tool for feature extraction in microRNA prediction","volume":"138","author":"Yones","year":"2015","journal-title":"Biosystems"},{"key":"2023013110164695800_btz940-B51","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1093\/bioinformatics\/btx612","article-title":"Genome-wide pre-miRNA discovery from few labeled examples","volume":"34","author":"Yones","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B52","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1093\/bioinformatics\/btl094","article-title":"Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier","volume":"22","author":"Yousef","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013110164695800_btz940-B53","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1109\/TIT.1978.1055934","article-title":"Compression of individual sequences via variable-rate coding","volume":"24","author":"Ziv","year":"1978","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023013110164695800_btz940-B54","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.compbiomed.2013.12.007","article-title":"miRClassify: an advanced web server for miRNA family classification and annotation","volume":"45","author":"Zou","year":"2014","journal-title":"Comput. Biol. Med"},{"key":"2023013110164695800_btz940-B55","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1007\/s10601-007-9033-9","article-title":"Darn! A weighted constraint solver for RNA motif localization","volume":"13","author":"Zytnicki","year":"2008","journal-title":"Constraints"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz940\/32358502\/btz940.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2319\/48984062\/bioinformatics_36_8_2319.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/8\/2319\/48984062\/bioinformatics_36_8_2319.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T20:22:07Z","timestamp":1675196527000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/8\/2319\/5682407"}},"subtitle":[],"editor":[{"given":"Yann","family":"Ponty","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2019,12,20]]},"references-count":55,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2020,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz940","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,4,15]]},"published":{"date-parts":[[2019,12,20]]}}}