{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T14:17:19Z","timestamp":1769264239036,"version":"3.49.0"},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Algorithms Mol Biol"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The application of machine learning to classification problems that depend only on positive examples is gaining attention in the computational biology community. We and others have described the use of two-class machine learning to identify novel miRNAs. These methods require the generation of an artificial negative class. However, designation of the negative class can be problematic and if it is not properly done can affect the performance of the classifier dramatically and\/or yield a biased estimate of performance. We present a study using one-class machine learning for microRNA (miRNA) discovery and compare one-class to two-class approaches using na\u00efve Bayes and Support Vector Machines. These results are compared to published two-class miRNA prediction approaches. We also examine the ability of the one-class and two-class techniques to identify miRNAs in newly sequenced species.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Of all methods tested, we found that 2-class naive Bayes and Support Vector Machines gave the best accuracy using our selected features and optimally chosen negative examples. One class methods showed average accuracies of 70\u201380% versus 90% for the two 2-class methods on the same feature sets. However, some one-class methods outperform some recently published two-class approaches with different selected features. Using the EBV genome as and external validation of the method we found one-class machine learning to work as well as or better than a two-class approach in identifying true miRNAs as well as predicting new miRNAs.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>One and two class methods can both give useful classification accuracies when the negative class is well characterized. The advantage of one class methods is that it eliminates guessing at the optimal features for the negative class when they are not well defined. In these cases one-class methods can be superior to two-class methods when the features which are chosen as representative of that positive class are well defined.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Availability<\/jats:title>\n            <jats:p>The OneClassmiRNA program is available at: [1]<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1748-7188-3-2","type":"journal-article","created":{"date-parts":[[2008,1,29]],"date-time":"2008-01-29T07:15:08Z","timestamp":1201590908000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":53,"title":["Learning from positive examples when the negative class is undetermined- microRNA gene identification"],"prefix":"10.1186","volume":"3","author":[{"given":"Malik","family":"Yousef","sequence":"first","affiliation":[]},{"given":"Segun","family":"Jung","sequence":"additional","affiliation":[]},{"given":"Louise C","family":"Showe","sequence":"additional","affiliation":[]},{"given":"Michael K","family":"Showe","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,1,28]]},"reference":[{"key":"43_CR1","unstructured":"One Class MiRNAfind Gene Prediction Web Server. http:\/\/wotan.wistar.upenn.edu\/OneClassmiRNA\/"},{"issue":"2","key":"43_CR2","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1016\/S0092-8674(04)00045-5","volume":"116","author":"DP Bartel","year":"2004","unstructured":"Bartel DP: MicroRNAs: Genomics, Biogenesis, Mechanism, and Function. Cell. 2004, 116 (2): 281-","journal-title":"Cell"},{"issue":"5612","key":"43_CR3","doi-asserted-by":"publisher","first-page":"1540","DOI":"10.1126\/science.1080372","volume":"299","author":"LP Lim","year":"2003","unstructured":"Lim LP, Glasner ME, Yekta S, Burge CB, Bartel DP: Vertebrate MicroRNA Genes. Science. 2003, 299 (5612): 1540-","journal-title":"Science"},{"issue":"8","key":"43_CR4","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1101\/gad.1074403","volume":"17","author":"LP Lim","year":"2003","unstructured":"Lim LP, Lau NC, Weinstein EG, Abdelhakim A, Yekta S, Rhoades MW, Burge CB, Bartel DP: The microRNAs of Caenorhabditis elegans. Genes Dev. 2003, 17 (8): 991-1008.","journal-title":"Genes Dev"},{"issue":"1","key":"43_CR5","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1111\/j.1432-1033.2004.04389.x","volume":"272","author":"MJ Weber","year":"2005","unstructured":"Weber MJ: New human and mouse microRNA genes found by homology search. FEBS Journal. 2005, 272 (1): 59-73.","journal-title":"FEBS Journal"},{"issue":"7","key":"43_CR6","doi-asserted-by":"publisher","first-page":"R42","DOI":"10.1186\/gb-2003-4-7-r42","volume":"4","author":"E Lai","year":"2003","unstructured":"Lai E, Tomancak P, Williams R, Rubin G: Computational identification of Drosophila microRNA genes. Genome Biology. 2003, 4 (7): R42-","journal-title":"Genome Biology"},{"issue":"5","key":"43_CR7","doi-asserted-by":"publisher","first-page":"1253","DOI":"10.1016\/S1097-2765(03)00153-9","volume":"11","author":"Y Grad","year":"2003","unstructured":"Grad Y, Aach J, Hayes GD, Reinhart BJ, Church GM, Ruvkun G, Kim J: Computational and Experimental Identification of C. elegans microRNAs. Molecular Cell. 2003, 11 (5): 1253-","journal-title":"Molecular Cell"},{"issue":"11","key":"43_CR8","doi-asserted-by":"publisher","first-page":"3570","DOI":"10.1093\/nar\/gki668","volume":"33","author":"J-W Nam","year":"2005","unstructured":"Nam J-W, Shin K-R, Han J, Lee Y, Kim VN, Zhang B-T: Human microRNA prediction through a probabilistic co-learning model of sequence and structure. Nucl Acids Res. 2005, 33 (11): 3570-3581.","journal-title":"Nucl Acids Res"},{"issue":"4","key":"43_CR9","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1038\/nmeth746","volume":"2","author":"S Pfeffer","year":"2005","unstructured":"Pfeffer S, Sewer A, Lagos-Quintana M, Sheridan R, Sander C, Grasser FA, van Dyk LF, Ho CK, Shuman S, Chien M: Identification of microRNAs of the herpesvirus family. Nat Meth. 2005, 2 (4): 269-","journal-title":"Nat Meth"},{"issue":"1","key":"43_CR10","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1186\/1471-2105-6-267","volume":"6","author":"A Sewer","year":"2005","unstructured":"Sewer A, Paul N, Landgraf P, Aravin A, Pfeffer S, Brownstein M, Tuschl T, van Nimwegen E, Zavolan M: Identification of clustered microRNAs using an ab initio prediction method. BMC Bioinformatics. 2005, 6 (1): 267-","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"43_CR11","doi-asserted-by":"publisher","first-page":"310","DOI":"10.1186\/1471-2105-6-310","volume":"6","author":"C Xue","year":"2005","unstructured":"Xue C, Li F, He T, Liu G-P, Li Y, Zhang X: Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine. BMC Bioinformatics. 2005, 6 (1): 310-","journal-title":"BMC Bioinformatics"},{"key":"43_CR12","volume-title":"Nat Genet","author":"E Berezikov","year":"2006","unstructured":"Berezikov E, Cuppen E, Plasterk RHA: Approaches to microRNA discovery. Nat Genet. 2006"},{"issue":"11","key":"43_CR13","doi-asserted-by":"publisher","first-page":"1325","DOI":"10.1093\/bioinformatics\/btl094","volume":"22","author":"M Yousef","year":"2006","unstructured":"Yousef M, Nebozhyn M, Shatkay H, Kanterakis S, Showe LC, Showe MK: Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier. Bioinformatics. 2006, 22 (11): 1325-1334.","journal-title":"Bioinformatics"},{"key":"43_CR14","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1186\/1471-2105-7-411","volume":"7","author":"S-K Kim","year":"2006","unstructured":"Kim S-K, Nam J-W, Rhee J-K, Lee W-J, Zhang B-T: miTarget: microRNA target gene prediction using a support vector machine. BMC Bioinformatics. 2006, 7: 411-","journal-title":"BMC Bioinformatics"},{"key":"43_CR15","first-page":"46","volume-title":"IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology: 2005","author":"K Sung-Kyu","year":"2005","unstructured":"Sung-Kyu K, Jin-Wu N, Wha-Jin L, Byoung-Tak Z: A Kernel Method for MicroRNA Target Prediction Using Sensible Data and Position-Based Features. IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology: 2005. 2005, 46-52."},{"key":"43_CR16","first-page":"btl441","volume-title":"Bioinformatics","author":"C Wang","year":"2006","unstructured":"Wang C, Ding C, Meraz RF, Holbrook SR: PSoL: a positive sample only learning algorithm for finding non-coding RNA genes. Bioinformatics. 2006, btl441-"},{"issue":"14","key":"43_CR17","doi-asserted-by":"publisher","first-page":"e197","DOI":"10.1093\/bioinformatics\/btl257","volume":"22","author":"J Hertel","year":"2006","unstructured":"Hertel J, Stadler PF: Hairpins in a Haystack: recognizing microRNA precursors in comparative genomics data. Bioinformatics. 2006, 22 (14): e197-e202.","journal-title":"Bioinformatics"},{"issue":"3","key":"43_CR18","doi-asserted-by":"publisher","first-page":"e23","DOI":"10.1371\/journal.ppat.0020023","volume":"2","author":"X Cai","year":"2006","unstructured":"Cai X, Sch , auml , fer A, Lu S, Bilello JP, Desrosiers RC, Edwards R, Raab-Traub N, Cullen BR: Epstein-Barr Virus MicroRNAs Are Evolutionarily Conserved and Differentially Expressed. PLoS Pathogens. 2006, 2 (3): e23-","journal-title":"PLoS Pathogens"},{"issue":"5","key":"43_CR19","doi-asserted-by":"publisher","first-page":"733","DOI":"10.1261\/rna.2326106","volume":"12","author":"A Grundhoff","year":"2006","unstructured":"Grundhoff A, Sullivan CS, Ganem D: A combined computational and microarray-based approach identifies novel microRNAs encoded by human gamma-herpesviruses. RNA. 2006, 12 (5): 733-750.","journal-title":"RNA"},{"key":"43_CR20","unstructured":"NCBI. http:\/\/www.ncbi.nlm.nih.gov"},{"issue":"90001","key":"43_CR21","doi-asserted-by":"publisher","first-page":"D109","DOI":"10.1093\/nar\/gkh023","volume":"32","author":"S Griffiths-Jones","year":"2004","unstructured":"Griffiths-Jones S: The microRNA Registry. Nucl Acids Res. 2004, 32 (90001): D109-111.","journal-title":"Nucl Acids Res"},{"issue":"2","key":"43_CR22","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1145\/772862.772878","volume":"4","author":"A Kowalczyk","year":"2002","unstructured":"Kowalczyk A, Raskutti B: One Class SVM for Yeast Regulation Prediction. SIGKDD Explorations. 2002, 4 (2): 99-100.","journal-title":"SIGKDD Explorations"},{"issue":"3","key":"43_CR23","first-page":"608","volume":"4","author":"EJ Spinosa","year":"2005","unstructured":"Spinosa EJ, Carvalho ACPLFd: Support vector machines for novel class detection in Bioinformatics. Genetics and Molecular Research (GMR). 2005, 4 (3): 608-615.","journal-title":"Genetics and Molecular Research (GMR)"},{"key":"43_CR24","volume-title":"Proceedings of the Twenty-First International Conference on Machine Learning (ICML): 2004","author":"K Crammer","year":"2004","unstructured":"Crammer K, Chechik G: A Needle in a Haystack: Local One-Class Optimization. Proceedings of the Twenty-First International Conference on Machine Learning (ICML): 2004. 2004"},{"key":"43_CR25","first-page":"273","volume-title":"Proceedings of the 22nd international conference on Machine learning 2005 Bonn, Germany","author":"G Gupta","year":"2005","unstructured":"Gupta G, Ghosh J: Robust one-class clustering using hybrid global and local search. Proceedings of the 22nd international conference on Machine learning 2005 Bonn, Germany. 2005, 273-280. ACM Press"},{"key":"43_CR26","first-page":"139","volume-title":"Journal of Machine Learning Research","author":"LM Manevitz","year":"2001","unstructured":"Manevitz LM, Yousef M: One-Class SVMs for Document Classification. Journal of Machine Learning Research. 2001, 139-154."},{"issue":"4","key":"43_CR27","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/j.media.2004.09.001","volume":"8","author":"B Thirion","year":"2004","unstructured":"Thirion B, Faugeras O: Feature characterization in fMRI data: the Information Bottleneck approach. Medical Image Analysis. 2004, 8 (4): 403-","journal-title":"Medical Image Analysis"},{"key":"43_CR28","first-page":"62","volume-title":"Proceedings of the twenty-first international conference on Machine learning 2004; Banff, Alberta, Canada","author":"M Koppel","year":"2004","unstructured":"Koppel M, Schler J: Authorship verification as a one-class classification problem. Proceedings of the twenty-first international conference on Machine learning 2004; Banff, Alberta, Canada. 2004, 62-ACM Press"},{"issue":"13","key":"43_CR29","doi-asserted-by":"publisher","first-page":"3406","DOI":"10.1093\/nar\/gkg595","volume":"31","author":"M Zuker","year":"2003","unstructured":"Zuker M: Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 2003, 31 (13): 3406-3415.","journal-title":"Nucleic Acids Res"},{"key":"43_CR30","volume-title":"One-class classification; Concept-learning in the absence of counter-examples","author":"DMJ Tax","year":"2001","unstructured":"Tax DMJ: One-class classification; Concept-learning in the absence of counter-examples. 2001"},{"issue":"7","key":"43_CR31","doi-asserted-by":"publisher","first-page":"1443","DOI":"10.1162\/089976601750264965","volume":"13","author":"B Scholkopf","year":"2001","unstructured":"Scholkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC: Estimating the Support of a High-Dimensional Distribution. Neural Comp. 2001, 13 (7): 1443-1471.","journal-title":"Neural Comp"},{"key":"43_CR32","volume-title":"LIBSVM: a library for support vector machines","author":"C-C Chang","year":"2001","unstructured":"Chang C-C, Lin C-J: LIBSVM: a library for support vector machines. 2001"},{"key":"43_CR33","volume-title":"DDtools, the Data Description Toolbox for Matlab","author":"DMJ Tax","year":"2005","unstructured":"Tax DMJ: DDtools, the Data Description Toolbox for Matlab. 2005"},{"key":"43_CR34","volume-title":"Advances in Kernel Methods","author":"B Sch\u00f6lkopf","year":"1999","unstructured":"Sch\u00f6lkopf B, Burges CJC, Smola AJ: Advances in Kernel Methods. 1999, Cambridge, MA: MIT Press"},{"key":"43_CR35","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The Nature of Statistical Learning Theory","author":"V Vapnik","year":"1995","unstructured":"Vapnik V: The Nature of Statistical Learning Theory. 1995, Springer"},{"issue":"2","key":"43_CR36","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","volume":"405","author":"B Matthews","year":"1975","unstructured":"Matthews B: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975, 405 (2): 442-451.","journal-title":"Biochim Biophys Acta"}],"container-title":["Algorithms for Molecular Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1748-7188-3-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:35:41Z","timestamp":1630445741000},"score":1,"resource":{"primary":{"URL":"https:\/\/almob.biomedcentral.com\/articles\/10.1186\/1748-7188-3-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,1,28]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["43"],"URL":"https:\/\/doi.org\/10.1186\/1748-7188-3-2","relation":{},"ISSN":["1748-7188"],"issn-type":[{"value":"1748-7188","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,1,28]]},"assertion":[{"value":"22 June 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 January 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 January 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"2"}}