{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T01:48:45Z","timestamp":1772761725522,"version":"3.50.1"},"reference-count":77,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2018,5,23]],"date-time":"2018-05-23T00:00:00Z","timestamp":1527033600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100002923","name":"National Scientific and Technical Research Council","doi-asserted-by":"publisher","award":["PIP 2013 117"],"award-info":[{"award-number":["PIP 2013 117"]}],"id":[{"id":"10.13039\/501100002923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005746","name":"Universidad Nacional del Litoral","doi-asserted-by":"publisher","award":["CAI\u2009+\u2009D 548"],"award-info":[{"award-number":["CAI\u2009+\u2009D 548"]}],"id":[{"id":"10.13039\/501100005746","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005746","name":"Universidad Nacional del Litoral","doi-asserted-by":"publisher","award":["082"],"award-info":[{"award-number":["082"]}],"id":[{"id":"10.13039\/501100005746","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005746","name":"Universidad Nacional del Litoral","doi-asserted-by":"publisher","award":["076"],"award-info":[{"award-number":["076"]}],"id":[{"id":"10.13039\/501100005746","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005746","name":"Universidad Nacional del Litoral","doi-asserted-by":"publisher","award":["042"],"award-info":[{"award-number":["042"]}],"id":[{"id":"10.13039\/501100005746","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,9,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The importance of microRNAs (miRNAs) is widely recognized in the community nowadays because these short segments of RNA can play several roles in almost all biological processes. The computational prediction of novel miRNAs involves training a classifier for identifying sequences having the highest chance of being precursors of miRNAs (pre-miRNAs). The big issue with this task is that well-known pre-miRNAs are usually few in comparison with the hundreds of thousands of candidate sequences in a genome, which results in high class imbalance. This imbalance has a strong influence on most standard classifiers, and if not properly addressed in the model and the experiments, not only performance reported can be completely unrealistic but also the classifier will not be able to work properly for pre-miRNA prediction. Besides, another important issue is that for most of the machine learning (ML) approaches already used (supervised methods), it is necessary to have both positive and negative examples. The selection of positive examples is straightforward (well-known pre-miRNAs). However, it is difficult to build a representative set of negative examples because they should be sequences with hairpin structure that do not contain a pre-miRNA.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>This review provides a comprehensive study and comparative assessment of methods from these two ML approaches for dealing with the prediction of novel pre-miRNAs: supervised and unsupervised training. We present and analyze the ML proposals that have appeared during the past 10\u2009years in literature. They have been compared in several prediction tasks involving two model genomes and increasing imbalance levels. This work provides a review of existing ML approaches for pre-miRNA prediction and fair comparisons of the classifiers with same features and data sets, instead of just a revision of published software tools. The results and the discussion can help the community to select the most adequate bioinformatics approach according to the prediction task at hand. The comparative results obtained suggest that from low to mid-imbalance levels between classes, supervised methods can be the best. However, at very high imbalance levels, closer to real case scenarios, models including unsupervised and deep learning can provide better performance.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bib\/bby037","type":"journal-article","created":{"date-parts":[[2018,4,19]],"date-time":"2018-04-19T06:57:09Z","timestamp":1524121029000},"page":"1607-1620","source":"Crossref","is-referenced-by-count":39,"title":["Predicting novel microRNA: a comprehensive comparison of machine learning approaches"],"prefix":"10.1093","volume":"20","author":[{"given":"Georgina","family":"Stegmayer","sequence":"first","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leandro E","family":"Di Persia","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mariano","family":"Rubiolo","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Matias","family":"Gerard","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Milton","family":"Pividori","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cristian","family":"Yones","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5702-946X","authenticated-orcid":false,"given":"Leandro A","family":"Bugnon","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tadeo","family":"Rodriguez","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathan","family":"Raad","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego H","family":"Milone","sequence":"additional","affiliation":[{"name":"sinc(i), Research Institute for Signals, Systems and Computational Intelligence (CONICET-UNL), Ciudad Universitaria, Santa Fe, Argentina"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2018,5,23]]},"reference":[{"issue":"2","key":"2020080807562367100_bby037-B1","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/S0092-8674(04)00045-5","article-title":"MicroRNAs: genomics, biogenesis, mechanism, and function","volume":"116","author":"Bartel","year":"2004","journal-title":"Cell"},{"issue":"1","key":"2020080807562367100_bby037-B2","doi-asserted-by":"crossref","first-page":"6601.","DOI":"10.1038\/ncomms7601","article-title":"Genome-wide identification of microRNA expression quantitative trait loci","volume":"6","author":"Huan","year":"2015","journal-title":"Nat Commun"},{"issue":"1","key":"2020080807562367100_bby037-B3","doi-asserted-by":"crossref","first-page":"7318","DOI":"10.1038\/ncomms8318","article-title":"Loss of microRNA-27b contributes to breast cancer stem cell generation by activating ENPP1","volume":"6","author":"Takahashi","year":"2015","journal-title":"Nat Commun"},{"issue":"7537","key":"2020080807562367100_bby037-B4","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1038\/nature13905","article-title":"MicroRNA silencing for cancer therapy targeted to the tumour microenvironment","volume":"518","author":"Cheng","year":"2015","journal-title":"Nature"},{"issue":"6","key":"2020080807562367100_bby037-B5","doi-asserted-by":"crossref","first-page":"e21635.","DOI":"10.1371\/journal.pone.0021635","article-title":"MicroRNA expression aberration as potential peripheral blood biomarkers for schizophrenia","volume":"6","author":"Lai","year":"2011","journal-title":"PLoS One"},{"issue":"1","key":"2020080807562367100_bby037-B6","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1093\/bib\/bbs010","article-title":"Detecting miRNAs in deep-sequencing data: a software performance comparison and evaluation","volume":"14","author":"Williamson","year":"2013","journal-title":"Brief Bioinform"},{"issue":"1\u20132","key":"2020080807562367100_bby037-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00335-009-9241-2","article-title":"Computational approaches for microRNA studies: a review","volume":"21","author":"Li","year":"2010","journal-title":"Mamm Genome"},{"key":"2020080807562367100_bby037-B8","doi-asserted-by":"crossref","first-page":"124.","DOI":"10.1186\/1471-2105-15-124","article-title":"The discriminant power of RNA features for pre-miRNA recognition","volume":"15","author":"Lopes","year":"2014","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B9","first-page":"1","article-title":"A compilation of Web-based research tools for miRNA analysis","volume":"1","author":"Shukla","year":"2017","journal-title":"Brief Funct Genomics"},{"key":"2020080807562367100_bby037-B10","doi-asserted-by":"crossref","first-page":"81","DOI":"10.3389\/fgene.2013.00081","article-title":"A review of computational tools in microRNA discovery","volume":"4","author":"Gomes","year":"2013","journal-title":"Front Genet"},{"key":"2020080807562367100_bby037-B11","doi-asserted-by":"crossref","first-page":"D152","DOI":"10.1093\/nar\/gkq1027","article-title":"miRBase: integrating microRNA annotation and deep-sequencing data","volume":"39","author":"Kozomara","year":"2011","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2020080807562367100_bby037-B12","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1186\/1471-2105-6-310","article-title":"Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine","volume":"6","author":"Xue","year":"2005","journal-title":"BMC Bioinformatics"},{"issue":"14","key":"2020080807562367100_bby037-B13","doi-asserted-by":"crossref","first-page":"e197","DOI":"10.1093\/bioinformatics\/btl257","article-title":"Hairpins in a Haystack: recognizing microRNA precursors in comparative genomics data","volume":"22","author":"Hertel","year":"2006","journal-title":"Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B14","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1186\/1471-2105-8-341","article-title":"MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans","volume":"8","author":"Huang","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2020080807562367100_bby037-B15","doi-asserted-by":"crossref","first-page":"W339","DOI":"10.1093\/nar\/gkm368","article-title":"MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features","volume":"35","author":"Jiang","year":"2007","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"2020080807562367100_bby037-B16","doi-asserted-by":"crossref","first-page":"i50","DOI":"10.1093\/bioinformatics\/btn175","article-title":"MicroRNA prediction with a novel ranking algorithm based on random walks","volume":"24","author":"Xu","year":"2008","journal-title":"Bioinformatics"},{"issue":"8","key":"2020080807562367100_bby037-B17","doi-asserted-by":"crossref","first-page":"e11843","DOI":"10.1371\/journal.pone.0011843","article-title":"MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors","volume":"5","author":"Gkirtzou","year":"2010","journal-title":"PLoS One"},{"issue":"1","key":"2020080807562367100_bby037-B18","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1186\/1471-2105-14-83","article-title":"HuntMi: an efficient and taxon-specific approach in pre-miRNA identification","volume":"14","author":"Gudy\u015b","year":"2013","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"2020080807562367100_bby037-B19","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.ygeno.2012.02.001","article-title":"MiRANN: a reliable approach for improved classification of precursor microRNA using Artificial Neural Network model","volume":"99","author":"Rahman","year":"2012","journal-title":"Genomics"},{"issue":"11","key":"2020080807562367100_bby037-B20","doi-asserted-by":"crossref","first-page":"1321","DOI":"10.1093\/bioinformatics\/btm026","article-title":"De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures","volume":"23","author":"Ng","year":"2007","journal-title":"Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B21","first-page":"209","article-title":"Computational methods for ab initio detection of microRNAs","volume":"3","author":"Allmer","year":"2012","journal-title":"Front Genet"},{"issue":"6","key":"2020080807562367100_bby037-B22","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1016\/j.ygeno.2016.04.002","article-title":"MicroRNA discovery in the human parasite Echinococcus multilocularis from genome-wide data","volume":"107","author":"Kamenetzky","year":"2016","journal-title":"Genomics"},{"issue":"6","key":"2020080807562367100_bby037-B23","doi-asserted-by":"crossref","first-page":"1316","DOI":"10.1109\/TCBB.2016.2576459","article-title":"High class-imbalance in pre-miRNA prediction: a novel approach based on deepSOM","volume":"14","author":"Stegmayer","year":"2017","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"S19","key":"2020080807562367100_bby037-B24","doi-asserted-by":"crossref","first-page":"507.","DOI":"10.1186\/s12859-016-1367-0","article-title":"Grouping miRNAs of similar functions via weighted information content of gene ontology","volume":"17","author":"Lan","year":"2016","journal-title":"BMC Bioinformatics"},{"issue":"23","key":"2020080807562367100_bby037-B25","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.1093\/bioinformatics\/bts574","article-title":"Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches","volume":"28","author":"Mendes","year":"2012","journal-title":"Bioinformatics"},{"key":"2020080807562367100_bby037-B26","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1186\/1471-2105-11-133","article-title":"MapMi: automated mapping of microRNA loci","volume":"11","author":"Guerra-Assuncao","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B27","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1038\/s41467-017-00403-z","article-title":"On the performance of pre-microRNA detection algorithms","volume":"8","author":"Demirci","year":"2017","journal-title":"Nat Commun"},{"issue":"1","key":"2020080807562367100_bby037-B28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/bib\/bbs075","article-title":"Identifying miRNAs, targets and functions","volume":"15","author":"Liu","year":"2014","journal-title":"Brief Bioinform"},{"issue":"1","key":"2020080807562367100_bby037-B29","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1007\/978-1-62703-709-9_20","article-title":"Computational prediction of microRNA genes","volume":"1097","author":"Hertel","year":"2014","journal-title":"Methods Mol Biol"},{"issue":"8","key":"2020080807562367100_bby037-B30","doi-asserted-by":"crossref","first-page":"2419","DOI":"10.1093\/nar\/gkp145","article-title":"Current tools for the identification of miRNA genes and their targets","volume":"37","author":"Mendes","year":"2009","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2020080807562367100_bby037-B31","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1101\/gr.2908205","article-title":"Computational prediction of miRNAs in Arabidopsis thaliana","volume":"15","author":"Adai","year":"2005","journal-title":"Genome Res"},{"issue":"1","key":"2020080807562367100_bby037-B32","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1186\/1471-2105-6-267","article-title":"Identification of clustered microRNAs using an ab initio prediction method","volume":"6","author":"Sewer","year":"2005","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"2020080807562367100_bby037-B33","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1093\/bioinformatics\/btl570","article-title":"Reliable prediction of Drosha processing sites improves microRNA gene prediction","volume":"23","author":"Helvik","year":"2007","journal-title":"Bioinformatics"},{"issue":"Suppl 11","key":"2020080807562367100_bby037-B34","doi-asserted-by":"crossref","first-page":"S11.","DOI":"10.1186\/1471-2105-11-S11-S11","article-title":"MiRenSVM: towards better prediction of microRNA precursors using an ensemble SVM classifier with multi-loop features","volume":"11","author":"Ding","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"9","key":"2020080807562367100_bby037-B35","doi-asserted-by":"crossref","first-page":"e946.","DOI":"10.1371\/journal.pone.0000946","article-title":"Mammalian MicroRNA prediction through a Support Vector Machine model of sequence and structure","volume":"2","author":"Sheng","year":"2007","journal-title":"PLoS One"},{"issue":"8","key":"2020080807562367100_bby037-B36","doi-asserted-by":"crossref","first-page":"989","DOI":"10.1093\/bioinformatics\/btp107","article-title":"microPred: effective classification of pre-miRNAs for human miRNA gene prediction","volume":"25","author":"Batuwita","year":"2009","journal-title":"Bioinformatics"},{"issue":"10","key":"2020080807562367100_bby037-B37","doi-asserted-by":"crossref","first-page":"1368","DOI":"10.1093\/bioinformatics\/btr153","article-title":"PlantMiRNAPred: efficient classification of real and pseudo plant pre-miRNAs","volume":"27","author":"Xuan","year":"2011","journal-title":"Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B38","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1186\/1471-2105-12-107","article-title":"MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences","volume":"12","author":"Wu","year":"2011","journal-title":"BMC Bioinformatics"},{"issue":"20","key":"2020080807562367100_bby037-B39","first-page":"e138","article-title":"A framework for improving microRNA prediction in non-human genomes","volume":"43","author":"Peace","year":"2015","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2020080807562367100_bby037-B40","doi-asserted-by":"crossref","first-page":"19062","DOI":"10.1038\/srep19062","article-title":"iMiRNA-SSF: improving the identification of microRNA precursors by combining negative sets with different distributions","volume":"6","author":"Chen","year":"2016","journal-title":"Sci Rep"},{"issue":"Suppl 1","key":"2020080807562367100_bby037-B41","doi-asserted-by":"crossref","first-page":"S9","DOI":"10.1186\/1471-2105-16-S1-S9","article-title":"ViralmiR: a support-vector-machine-based method for predicting viral microRNA precursors","volume":"16","author":"Huang","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"2020080807562367100_bby037-B42","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1109\/TCBB.2014.2388227","article-title":"YamiPred: a novel evolutionary method for predicting pre-miRNAs and selecting relevant features","volume":"12","author":"Kleftogiannis","year":"2015","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"3","key":"2020080807562367100_bby037-B43","doi-asserted-by":"crossref","first-page":"e0121501","DOI":"10.1371\/journal.pone.0121501","article-title":"Identification of real microRNA Precursors with a Pseudo structure status composition approach","volume":"10","author":"Liu","year":"2015","journal-title":"PLoS One"},{"issue":"4","key":"2020080807562367100_bby037-B44","doi-asserted-by":"crossref","first-page":"1194","DOI":"10.1039\/C5MB00050E","article-title":"miRNA-dis: microRNA precursor identification based on distance structure status pairs","volume":"11","author":"Liu","year":"2015","journal-title":"Mol Biosyst"},{"issue":"11","key":"2020080807562367100_bby037-B45","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1093\/bioinformatics\/btl094","article-title":"Combining multi-species genomic data for microRNA identification using a naive Bayes classifier","volume":"22","author":"Yousef","year":"2006","journal-title":"Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B46","doi-asserted-by":"crossref","first-page":"e21","DOI":"10.1093\/nar\/gks878","article-title":"Heterogeneous ensemble approach with discriminative features and modified-smotebagging for pre-miRNA classification","volume":"41","author":"Lertampaiporn","year":"2013","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"2020080807562367100_bby037-B47","doi-asserted-by":"crossref","first-page":"e45782-15","DOI":"10.1371\/journal.pone.0045782","article-title":"miR-BAG: bagging based identification of microRNA precursors","volume":"7","author":"Jha","year":"2012","journal-title":"PLoS One"},{"key":"2020080807562367100_bby037-B48","first-page":"96","volume-title":"IEEE International Conference on Big Data and Smart Computing, Korea","author":"Thomas","year":"2017"},{"key":"2020080807562367100_bby037-B49","author":"Thomas","year":"2017"},{"key":"2020080807562367100_bby037-B50","doi-asserted-by":"crossref","DOI":"10.1002\/0470854774","volume-title":"Statistical Pattern Recognition","author":"Webb","year":"2002"},{"key":"2020080807562367100_bby037-B51","volume-title":"Pattern Classification","author":"Duda","year":"2001","edition":"2nd edn."},{"key":"2020080807562367100_bby037-B52","volume-title":"Machine Learning","author":"Mitchell","year":"1997"},{"key":"2020080807562367100_bby037-B53","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The Nature of Statistical Learning Theory","author":"Vapnik","year":"1995"},{"issue":"1","key":"2020080807562367100_bby037-B54","first-page":"1889","article-title":"Working set selection using second order information for training support vector machines","volume":"6","author":"Fan","year":"2005","journal-title":"J Mach Learn Res"},{"key":"2020080807562367100_bby037-B55","volume-title":"Pattern Recognition and Machine Learning","author":"Bishop","year":"2006"},{"key":"2020080807562367100_bby037-B56","first-page":"249","volume-title":"Proceedings of the 5th Annual International Conference on Computational Biology","author":"Pavlidis","year":"2001"},{"issue":"1","key":"2020080807562367100_bby037-B57","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: synthetic minority over-sampling","volume":"16","author":"Chawla","year":"2002","journal-title":"J Artif Intell Res"},{"issue":"1","key":"2020080807562367100_bby037-B58","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"issue":"6","key":"2020080807562367100_bby037-B59","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1016\/j.ygeno.2012.04.003","article-title":"Random forests for genomic data analysis","volume":"99","author":"Chen","year":"2012","journal-title":"Genomics"},{"key":"2020080807562367100_bby037-B60","volume-title":"Machine Learning. A Probabilistic Approach","author":"Murphy","year":"2012"},{"key":"2020080807562367100_bby037-B61","volume-title":"Clustering","author":"Xu","year":"2009"},{"issue":"15","key":"2020080807562367100_bby037-B62","doi-asserted-by":"crossref","first-page":"3201.","DOI":"10.1093\/bioinformatics\/bti517","article-title":"Computational cluster validation in post-genomic data analysis","volume":"21","author":"Handl","year":"2005","journal-title":"Bioinformatics"},{"key":"2020080807562367100_bby037-B63","volume-title":"Clustering Methods. Data Mining and Knowledge Discovery Handbook","author":"Rokach","year":"2005"},{"issue":"8","key":"2020080807562367100_bby037-B64","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","article-title":"Data clustering: 50 years beyond k-means","volume":"31","author":"Jain","year":"2010","journal-title":"Pattern Recogn Lett"},{"key":"2020080807562367100_bby037-B65","first-page":"849","volume-title":"Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic","author":"Ng","year":"2001"},{"issue":"4","key":"2020080807562367100_bby037-B66","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","article-title":"A tutorial on spectral clustering","volume":"17","author":"von Luxburg","year":"2007","journal-title":"Stat Comput"},{"issue":"1","key":"2020080807562367100_bby037-B67","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1007\/BF00337288","article-title":"Self-organized formation of topologically correct feature maps","volume":"43","author":"Kohonen","year":"1982","journal-title":"Biological Cybernetics"},{"key":"2020080807562367100_bby037-B68","volume-title":"Self-Organizing Maps","author":"Kohonen","year":"2005"},{"issue":"4","key":"2020080807562367100_bby037-B69","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MCI.2012.2215122","article-title":"Data mining over biological datasets: an integrated approach based on computational intelligence","volume":"7","author":"Stegmayer","year":"2012","journal-title":"IEEE Comput Intell Mag"},{"issue":"1","key":"2020080807562367100_bby037-B70","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1186\/1471-2105-11-438","article-title":"omeSOM: a software for clustering and visualization of transcriptional and metabolite data mined from interspecific crosses of crop plants","volume":"11","author":"Milone","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2020080807562367100_bby037-B71","first-page":"14","volume-title":"An Introduction to Restricted Boltzmann Machines in Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Lecture Notes in Computer Science","author":"Fischer","year":"2012"},{"issue":"6","key":"2020080807562367100_bby037-B72","doi-asserted-by":"crossref","first-page":"1631","DOI":"10.1162\/neco.2008.04-07-510","article-title":"Representational power of restricted Boltzmann machines and deep belief networks","volume":"20","author":"Le Roux","year":"2008","journal-title":"Neural Comput"},{"key":"2020080807562367100_bby037-B73","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.biosystems.2015.10.003","article-title":"miRNAfe: a comprehensive tool for feature extraction in microRNA prediction","volume":"138","author":"Yones","year":"2015","journal-title":"Biosystems"},{"issue":"1","key":"2020080807562367100_bby037-B74","doi-asserted-by":"crossref","first-page":"106.","DOI":"10.1186\/1471-2105-14-106","article-title":"SMOTE for high-dimensional class-imbalanced data","volume":"14","author":"Blagus","year":"2013","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020080807562367100_bby037-B75","first-page":"1","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","author":"Demsar","year":"2006","journal-title":"J Mach Learn Res"},{"issue":"1","key":"2020080807562367100_bby037-B76","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1007\/s10115-014-0794-3","article-title":"Class imbalance revisited: a new experimental setup to assess the performance of treatment methods","volume":"45","author":"Prati","year":"2015","journal-title":"Knowl Inform Syst"},{"issue":"1","key":"2020080807562367100_bby037-B77","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1109\/TCBB.2013.146","article-title":"Improved and promising identification of human micrornas by incorporating a high-quality negative set","volume":"11","author":"Wei","year":"2014","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/5\/1607\/33616701\/bby037.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/5\/1607\/33616701\/bby037.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,8,8]],"date-time":"2020-08-08T11:59:09Z","timestamp":1596887949000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/20\/5\/1607\/5001762"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,5,23]]},"references-count":77,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2018,5,23]]},"published-print":{"date-parts":[[2019,9,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bby037","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,9]]},"published":{"date-parts":[[2018,5,23]]}}}