{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T20:44:53Z","timestamp":1778273093198,"version":"3.51.4"},"reference-count":72,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,1,15]],"date-time":"2024-01-15T00:00:00Z","timestamp":1705276800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,1,15]],"date-time":"2024-01-15T00:00:00Z","timestamp":1705276800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>In recent years, human microbiome studies have received increasing attention as this field is considered a potential source for clinical applications. With the advancements in omics technologies and AI, research focused on the discovery for potential biomarkers in the human microbiome using machine learning tools has produced positive outcomes. Despite the promising results, several issues can still be found in these studies such as datasets with small number of samples, inconsistent results, lack of uniform processing and methodologies, and other additional factors lead to lack of reproducibility in biomedical research. In this work, we propose a methodology that combines the DADA2 pipeline for 16s rRNA sequences processing and the Recursive Ensemble Feature Selection (REFS) in multiple datasets to increase reproducibility and obtain robust and reliable results in biomedical research.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Three experiments were performed analyzing microbiome data from patients\/cases in Inflammatory Bowel Disease (IBD), Autism Spectrum Disorder (ASD), and Type 2 Diabetes (T2D). In each experiment, we found a biomarker signature in one dataset and applied to 2 other as further validation. The effectiveness of the proposed methodology was compared with other feature selection methods such as K-Best with F-score and random selection as a base line. The Area Under the Curve (AUC) was employed as a measure of diagnostic accuracy and used as a metric for comparing the results of the proposed methodology with other feature selection methods. Additionally, we use the Matthews Correlation Coefficient (MCC) as a metric to evaluate the performance of the methodology as well as for comparison with other feature selection methods.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>We developed a methodology for reproducible biomarker discovery for 16s rRNA microbiome sequence analysis, addressing the issues related with data dimensionality, inconsistent results and validation across independent datasets. The findings from the three experiments, across 9 different datasets, show that the proposed methodology achieved higher accuracy compared to other feature selection methods. This methodology is a first approach to increase reproducibility, to provide robust and reliable results.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-024-05639-3","type":"journal-article","created":{"date-parts":[[2024,1,15]],"date-time":"2024-01-15T05:02:19Z","timestamp":1705294939000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Methodology for biomarker discovery with reproducibility in microbiome data using machine learning"],"prefix":"10.1186","volume":"25","author":[{"given":"David","family":"Rojas-Velazquez","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sarah","family":"Kidwai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aletta D.","family":"Kraneveld","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alberto","family":"Tonda","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Oberski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Johan","family":"Garssen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alejandro","family":"Lopez-Rincon","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,1,15]]},"reference":[{"issue":"9","key":"5639_CR1","doi-asserted-by":"publisher","first-page":"1716","DOI":"10.1136\/gutjnl-2018-316723","volume":"67","author":"PD Cani","year":"2018","unstructured":"Cani PD. Human gut microbiome: hopes, threats and promises. Gut. 2018;67(9):1716\u201325.","journal-title":"Gut"},{"issue":"3","key":"5639_CR2","doi-asserted-by":"publisher","first-page":"126","DOI":"10.3390\/pathogens8030126","volume":"8","author":"I Khan","year":"2019","unstructured":"Khan I, Ullah N, Zha L, Bai Y, Khan A, Zhao T, Che T, Zhang C. Alteration of gut microbiota in inflammatory bowel disease (ibd): Cause or consequence? ibd treatment targeting the gut microbiome. Pathogens. 2019;8(3):126.","journal-title":"Pathogens"},{"issue":"4","key":"5639_CR3","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1038\/nrgastro.2017.25","volume":"14","author":"I Dickson","year":"2017","unstructured":"Dickson I. Diagnosing ibd with the gut microbiome. Nat Rev Gastroenterol Hepatol. 2017;14(4):195\u2013195.","journal-title":"Nat Rev Gastroenterol Hepatol"},{"issue":"1","key":"5639_CR4","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1111\/apt.14384","volume":"47","author":"J McIlroy","year":"2018","unstructured":"McIlroy J, Ianiro G, Mukhopadhya I, Hansen R, Hold G. the gut microbiome in inflammatory bowel disease-avenues for microbial management. Aliment Pharmacol Ther. 2018;47(1):26\u201342.","journal-title":"Aliment Pharmacol Ther"},{"issue":"10","key":"5639_CR5","doi-asserted-by":"publisher","first-page":"1799","DOI":"10.1002\/ibd.22860","volume":"18","author":"S Michail","year":"2012","unstructured":"Michail S, Durbin M, Turner D, Griffiths AM, Mack DR, Hyams J, Leleiko N, Kenche H, Stolfi A, Wine E. Alterations in the gut microbiome of children with severe ulcerative colitis. Inflamm Bowel Dis. 2012;18(10):1799\u2013808.","journal-title":"Inflamm Bowel Dis"},{"issue":"1","key":"5639_CR6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-019-56847-4","volume":"10","author":"Q Li","year":"2020","unstructured":"Li Q, Chang Y, Zhang K, Chen H, Tao S, Zhang Z. Implication of the gut microbiome composition of type 2 diabetic patients from northern china. Sci Rep. 2020;10(1):1\u20138.","journal-title":"Sci Rep"},{"key":"5639_CR7","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1016\/j.jnutbio.2018.10.003","volume":"63","author":"S Sharma","year":"2019","unstructured":"Sharma S, Tripathi P. Gut microbiome and type 2 diabetes: Where we are and where to go? J Nutr Biochem. 2019;63:101\u20138.","journal-title":"J Nutr Biochem"},{"key":"5639_CR8","first-page":"1","volume":"18","author":"\u00d6 Aydin","year":"2018","unstructured":"Aydin \u00d6, Nieuwdorp M, Gerdes V. The gut microbiome as a target for the treatment of type 2 diabetes. Curr DiabRep. 2018;18:1\u201311.","journal-title":"Curr DiabRep"},{"issue":"2","key":"5639_CR9","doi-asserted-by":"publisher","first-page":"258","DOI":"10.1016\/j.chom.2020.06.004","volume":"28","author":"S Reitmeier","year":"2020","unstructured":"Reitmeier S, Kiessling S, Clavel T, List M, Almeida EL, Ghosh TS, Neuhaus K, Grallert H, Linseisen J, Skurk T, et al. Arrhythmic gut microbiome signatures predict risk of type 2 diabetes. Cell Host Microbe. 2020;28(2):258\u201372.","journal-title":"Cell Host Microbe"},{"key":"5639_CR10","doi-asserted-by":"crossref","unstructured":"Pulikkan J, Mazumder A, Grace T. Role of the gut microbiome in autism spectrum disorders. Rev Biomark Stud Psychiatr Neurodegener Disorders 2019;253\u2013269","DOI":"10.1007\/978-3-030-05542-4_13"},{"key":"5639_CR11","doi-asserted-by":"crossref","unstructured":"Li Q, Han Y, Dy ABC, Hagerman RJ. The gut microbiota and autism spectrum disorders. Front Cell Neurosci 2017;120","DOI":"10.3389\/fncel.2017.00120"},{"issue":"8","key":"5639_CR12","doi-asserted-by":"publisher","first-page":"2251","DOI":"10.3390\/ijms19082251","volume":"19","author":"G Fowlie","year":"2018","unstructured":"Fowlie G, Cohen N, Ming X. The perturbance of microbiome and gut-brain axis in autism spectrum disorders. Int J Mol Sci. 2018;19(8):2251.","journal-title":"Int J Mol Sci"},{"issue":"5","key":"5639_CR13","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1016\/j.biopsych.2016.08.024","volume":"81","author":"HE Vuong","year":"2017","unstructured":"Vuong HE, Hsiao EY. Emerging roles for the gut microbiome in autism spectrum disorder. Biol Psychiat. 2017;81(5):411\u201323.","journal-title":"Biol Psychiat"},{"key":"5639_CR14","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1016\/j.gpb.2022.07.002","volume":"21","author":"Y Liu","year":"2022","unstructured":"Liu Y, Lau HC-H, Cheng WY, Yu J. Gut microbiome in colorectal cancer: clinical diagnosis and treatment. Genom Proteom Bioinform. 2022;21:84\u201396.","journal-title":"Genom Proteom Bioinform"},{"key":"5639_CR15","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1007\/s00253-020-11043-7","volume":"105","author":"Y Zhang","year":"2021","unstructured":"Zhang Y, Shen J, Shi X, Du Y, Niu Y, Jin G, Wang Z, Lyu J. Gut microbiome analysis as a predictive marker for the gastric cancer patients. Appl Microbiol Biotechnol. 2021;105:803\u201314.","journal-title":"Appl Microbiol Biotechnol"},{"issue":"1","key":"5639_CR16","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1007\/s12072-021-10279-3","volume":"16","author":"T Deng","year":"2022","unstructured":"Deng T, Li J, He B, Chen B, Liu F, Chen Z, Zheng J, Shi Z, Zhang T, Deng L, et al. Gut microbiome alteration as a diagnostic tool and associated with inflammatory response marker in primary liver cancer. Hep Intl. 2022;16(1):99\u2013111.","journal-title":"Hep Intl"},{"key":"5639_CR17","doi-asserted-by":"publisher","DOI":"10.3389\/fcimb.2021.725284","volume":"11","author":"F Zhao","year":"2021","unstructured":"Zhao F, An R, Wang L, Shan J, Wang X. Specific gut microbiome and serum metabolome changes in lung cancer patients. Front Cell Infect Microbiol. 2021;11: 725284.","journal-title":"Front Cell Infect Microbiol"},{"issue":"4","key":"5639_CR18","doi-asserted-by":"publisher","first-page":"570","DOI":"10.1016\/j.ccell.2018.03.015","volume":"33","author":"V Gopalakrishnan","year":"2018","unstructured":"Gopalakrishnan V, Helmink BA, Spencer CN, Reuben A, Wargo JA. The influence of the gut microbiome on cancer, immunity, and cancer immunotherapy. Cancer Cell. 2018;33(4):570\u201380.","journal-title":"Cancer Cell"},{"issue":"7","key":"5639_CR19","doi-asserted-by":"publisher","first-page":"850","DOI":"10.1038\/nm.4345","volume":"23","author":"H Wu","year":"2017","unstructured":"Wu H, Esteve E, Tremaroli V, Khan MT, Caesar R, Manner\u00e5s-Holm L, St\u00e5hlman M, Olsson LM, Serino M, Planas-F\u00e8lix M, et al. Metformin alters the gut microbiome of individuals with treatment-naive type 2 diabetes, contributing to the therapeutic effects of the drug. Nat Med. 2017;23(7):850\u20138.","journal-title":"Nat Med"},{"key":"5639_CR20","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1016\/j.lfs.2017.12.027","volume":"194","author":"Y Yang","year":"2018","unstructured":"Yang Y, Tian J, Yang B. Targeting gut microbiome: a novel and potential therapy for autism. Life Sci. 2018;194:111\u20139.","journal-title":"Life Sci"},{"key":"5639_CR21","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1007\/s11938-019-00221-w","volume":"17","author":"NC Knox","year":"2019","unstructured":"Knox NC, Forbes JD, Van Domselaar G, Bernstein CN. The gut microbiome as a target for ibd treatment: Are we there yet? Curr Treat Options Gastroenterol. 2019;17:115\u201326.","journal-title":"Curr Treat Options Gastroenterol"},{"issue":"1","key":"5639_CR22","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1097\/MOG.0000000000000139","volume":"31","author":"AB Shreiner","year":"2015","unstructured":"Shreiner AB, Kao JY, Young VB. The gut microbiome in health and in disease. Curr Opin Gastroenterol. 2015;31(1):69.","journal-title":"Curr Opin Gastroenterol"},{"issue":"8","key":"5639_CR23","doi-asserted-by":"publisher","first-page":"852","DOI":"10.1038\/s41587-019-0209-9","volume":"37","author":"E Bolyen","year":"2019","unstructured":"Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F, et al. Reproducible, interactive, scalable and extensible microbiome data science using qiime 2. Nat Biotechnol. 2019;37(8):852\u20137.","journal-title":"Nat Biotechnol"},{"key":"5639_CR24","doi-asserted-by":"publisher","first-page":"2584","DOI":"10.7717\/peerj.2584","volume":"4","author":"T Rognes","year":"2016","unstructured":"Rognes T, Flouri T, Nichols B, Quince C, Mah\u00e9 F. Vsearch: a versatile open source tool for metagenomics. PeerJ. 2016;4:2584.","journal-title":"PeerJ"},{"issue":"7","key":"5639_CR25","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1038\/nmeth.3869","volume":"13","author":"BJ Callahan","year":"2016","unstructured":"Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. Dada2: high-resolution sample inference from illumina amplicon data. Nat Methods. 2016;13(7):581\u20133.","journal-title":"Nat Methods"},{"issue":"15","key":"5639_CR26","doi-asserted-by":"publisher","first-page":"2114","DOI":"10.1093\/bioinformatics\/btu170","volume":"30","author":"AM Bolger","year":"2014","unstructured":"Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics. 2014;30(15):2114\u201320.","journal-title":"Bioinformatics"},{"issue":"23","key":"5639_CR27","doi-asserted-by":"publisher","first-page":"7537","DOI":"10.1128\/AEM.01541-09","volume":"75","author":"PD Schloss","year":"2009","unstructured":"Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ, et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009;75(23):7537\u201341.","journal-title":"Appl Environ Microbiol"},{"issue":"21","key":"5639_CR28","doi-asserted-by":"publisher","first-page":"2957","DOI":"10.1093\/bioinformatics\/btr507","volume":"27","author":"T Mago\u010d","year":"2011","unstructured":"Mago\u010d T, Salzberg SL. Flash: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27(21):2957\u201363.","journal-title":"Bioinformatics"},{"key":"5639_CR29","doi-asserted-by":"publisher","first-page":"579","DOI":"10.3389\/fgene.2019.00579","volume":"10","author":"Y-H Zhou","year":"2019","unstructured":"Zhou Y-H, Gallins P. A review and tutorial of machine learning methods for microbiome host trait prediction. Front Genet. 2019;10:579.","journal-title":"Front Genet"},{"key":"5639_CR30","doi-asserted-by":"crossref","unstructured":"Chavira A, Wang EH-J, Mills RH. Meta-analysis of the autism gut microbiome identifies factors influencing study discrepancies and machine learning classification. bioRxiv 2022;2022\u201303","DOI":"10.1101\/2022.03.18.484910"},{"key":"5639_CR31","doi-asserted-by":"publisher","DOI":"10.1016\/j.micpath.2020.104568","volume":"149","author":"T Wu","year":"2020","unstructured":"Wu T, Wang H, Lu W, Zhai Q, Zhang Q, Yuan W, Gu Z, Zhao J, Zhang H, Chen W. Potential of gut microbiome for detection of autism spectrum disorder. Microb Pathog. 2020;149: 104568.","journal-title":"Microb Pathog"},{"issue":"5","key":"5639_CR32","doi-asserted-by":"publisher","first-page":"1555","DOI":"10.1161\/HYPERTENSIONAHA.120.15885","volume":"76","author":"S Aryal","year":"2020","unstructured":"Aryal S, Alimadadi A, Manandhar I, Joe B, Cheng X. Machine learning strategy for gut microbiome-based diagnostic screening of cardiovascular disease. Hypertension. 2020;76(5):1555\u201362.","journal-title":"Hypertension"},{"issue":"4","key":"5639_CR33","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3892\/etm.2022.11234","volume":"23","author":"X Ge","year":"2022","unstructured":"Ge X, Zhang A, Li L, Sun Q, He J, Wu Y, Tan R, Pan Y, Zhao J, Xu Y, et al. Application of machine learning tools: potential and useful approach for the prediction of type 2 diabetes mellitus based on the gut microbiome profile. Exp Ther Med. 2022;23(4):1\u201310.","journal-title":"Exp Ther Med"},{"issue":"9","key":"5639_CR34","doi-asserted-by":"publisher","first-page":"3289","DOI":"10.3390\/nu13093289","volume":"13","author":"M Balvers","year":"2021","unstructured":"Balvers M, Deschasaux M, van den Born B-J, Zwinderman K, Nieuwdorp M, Levin E. Analyzing type 2 diabetes associations with the gut microbiome in individuals from two ethnic backgrounds living in the same geographic area. Nutrients. 2021;13(9):3289.","journal-title":"Nutrients"},{"key":"5639_CR35","doi-asserted-by":"publisher","DOI":"10.3389\/fcimb.2021.711884","volume":"11","author":"X Wang","year":"2021","unstructured":"Wang X, Xiao Y, Xu X, Guo L, Yu Y, Li N, Xu C. Characteristics of fecal microbiota and machine learning strategy for fecal invasive biomarkers in pediatric inflammatory bowel disease. Front Cell Infect Microbiol. 2021;11: 711884.","journal-title":"Front Cell Infect Microbiol"},{"issue":"3","key":"5639_CR36","doi-asserted-by":"publisher","first-page":"328","DOI":"10.1152\/ajpgi.00360.2020","volume":"320","author":"I Manandhar","year":"2021","unstructured":"Manandhar I, Alimadadi A, Aryal S, Munroe PB, Joe B, Cheng X. Gut microbiome-based supervised machine learning for clinical diagnosis of inflammatory bowel diseases. Am J Physiol Gastrointest Liver Physiol. 2021;320(3):328\u201337.","journal-title":"Am J Physiol Gastrointest Liver Physiol"},{"key":"5639_CR37","doi-asserted-by":"publisher","DOI":"10.3389\/fcimb.2021.645951","volume":"11","author":"Y Liu","year":"2021","unstructured":"Liu Y, Li W, Yang H, Zhang X, Wang W, Jia S, Xiang B, Wang Y, Miao L, Zhang H, et al. Leveraging 16s rrna microbiome sequencing data to identify bacterial signatures for irritable bowel syndrome. Front Cell Infect Microbiol. 2021;11: 645951.","journal-title":"Front Cell Infect Microbiol"},{"issue":"8","key":"5639_CR38","doi-asserted-by":"publisher","first-page":"2403","DOI":"10.3390\/jcm9082403","volume":"9","author":"H Fukui","year":"2020","unstructured":"Fukui H, Nishida A, Matsuda S, Kira F, Watanabe S, Kuriyama M, Kawakami K, Aikawa Y, Oda N, Arai K, et al. Usefulness of machine learning-based gut microbiome analysis for identifying patients with irritable bowels syndrome. J Clin Med. 2020;9(8):2403.","journal-title":"J Clin Med"},{"issue":"4","key":"5639_CR39","doi-asserted-by":"publisher","first-page":"242","DOI":"10.3390\/brainsci10040242","volume":"10","author":"D Pietrucci","year":"2020","unstructured":"Pietrucci D, Teofani A, Unida V, Cerroni R, Biocca S, Stefani A, Desideri A. Can gut microbiota be a good predictor for Parkinson\u2019s disease? A machine learning approach. Brain Sci. 2020;10(4):242.","journal-title":"Brain Sci"},{"key":"5639_CR40","doi-asserted-by":"publisher","DOI":"10.3389\/fonc.2021.604584","volume":"11","author":"CW Wong","year":"2021","unstructured":"Wong CW, Yost SE, Lee JS, Gillece JD, Folkerts M, Reining L, Highlander SK, Eftekhari Z, Mortimer J, Yuan Y. Analysis of gut microbiome using explainable machine learning predicts risk of diarrhea associated with tyrosine kinase inhibitor neratinib: a pilot study. Front Oncol. 2021;11: 604584.","journal-title":"Front Oncol"},{"key":"5639_CR41","doi-asserted-by":"publisher","first-page":"876","DOI":"10.18632\/oncotarget.28252","volume":"13","author":"H Liang","year":"2022","unstructured":"Liang H, Jo J-H, Zhang Z, MacGibeny MA, Han J, Proctor DM, Taylor ME, Che Y, Juneau P, Apolo AB, et al. Predicting cancer immunotherapy response from gut microbiomes using machine learning models. Oncotarget. 2022;13:876.","journal-title":"Oncotarget"},{"issue":"11","key":"5639_CR42","doi-asserted-by":"publisher","first-page":"0224365","DOI":"10.1371\/journal.pone.0224365","volume":"14","author":"A Vabalas","year":"2019","unstructured":"Vabalas A, Gowen E, Poliakoff E, Casson AJ. Machine learning algorithm validation with a limited sample size. PLoS ONE. 2019;14(11):0224365.","journal-title":"PLoS ONE"},{"issue":"12","key":"5639_CR43","doi-asserted-by":"publisher","first-page":"2639","DOI":"10.1038\/ismej.2017.119","volume":"11","author":"BJ Callahan","year":"2017","unstructured":"Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. 2017;11(12):2639\u201343.","journal-title":"ISME J"},{"issue":"4","key":"5639_CR44","doi-asserted-by":"publisher","first-page":"146","DOI":"10.3390\/bioengineering9040146","volume":"9","author":"JT Jeske","year":"2022","unstructured":"Jeske JT, Gallert C. Microbiome analysis via otu and asv-based pipelines-a comparative interpretation of ecological data in wwtp systems. Bioengineering. 2022;9(4):146.","journal-title":"Bioengineering"},{"key":"5639_CR45","doi-asserted-by":"crossref","unstructured":"Loganathan T, et al. The influence of machine learning technologies in gut microbiome research and cancer studies-a review. Life Sci. 2022;121118","DOI":"10.1016\/j.lfs.2022.121118"},{"key":"5639_CR46","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1007\/s12275-020-0066-8","volume":"58","author":"J Namkung","year":"2020","unstructured":"Namkung J. Machine learning methods for microbiome studies. J Microbiol. 2020;58:206\u201316.","journal-title":"J Microbiol"},{"key":"5639_CR47","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1007881","volume-title":"Improving reproducibility in computational biology research","author":"JA Papin","year":"2020","unstructured":"Papin JA, Mac Gabhann F, Sauro HM, Nickerson D, Rampadarath A. Improving reproducibility in computational biology research. San Francisco: Public Library of Science; 2020."},{"key":"5639_CR48","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-019-3050-8","volume":"20","author":"A Lopez-Rincon","year":"2019","unstructured":"Lopez-Rincon A, Martinez-Archundia M, Martinez-Ruiz GU, Schoenhuth A, Tonda A. Automatic discovery of 100-mirna signature for cancer classification using ensemble feature selection. BMC Bioinform. 2019;20:1\u201317.","journal-title":"BMC Bioinform"},{"issue":"2","key":"5639_CR49","doi-asserted-by":"publisher","first-page":"00193","DOI":"10.1128\/mSystems.00193-20","volume":"6","author":"MM David","year":"2021","unstructured":"David MM, Tataru C, Daniels J, Schwartz J, Keating J, Hampton-Marcell J, Gottel N, Gilbert JA, Wall DP. Children with autism and their typically developing siblings differ in amplicon sequence variants and predicted functions of stool-associated microbes. Msystems. 2021;6(2):00193\u201320.","journal-title":"Msystems"},{"issue":"4","key":"5639_CR50","first-page":"203","volume":"19","author":"A-M \u0160imundi\u0107","year":"2009","unstructured":"\u0160imundi\u0107 A-M. Measures of diagnostic accuracy: basic definitions. ejifcc. 2009;19(4):203.","journal-title":"ejifcc"},{"issue":"24","key":"5639_CR51","doi-asserted-by":"publisher","first-page":"5916","DOI":"10.1016\/j.cell.2021.10.015","volume":"184","author":"CX Yap","year":"2021","unstructured":"Yap CX, Henders AK, Alvares GA, Wood DL, Krause L, Tyson GW, Restuadi R, Wallace L, McLaren T, Hansell NK, et al. Autism-related dietary preferences mediate autism-gut microbiome associations. Cell. 2021;184(24):5916\u201331.","journal-title":"Cell"},{"issue":"2","key":"5639_CR52","doi-asserted-by":"publisher","first-page":"398","DOI":"10.4049\/jimmunol.2001296","volume":"207","author":"C-S Hsieh","year":"2021","unstructured":"Hsieh C-S, Rengarajan S, Kau A, Tarazona-Meza C, Nicholson A, Checkley W, Romero K, Hansel NN. Altered iga response to gut bacteria is associated with childhood asthma in peru. J Immunol. 2021;207(2):398\u2013407.","journal-title":"J Immunol"},{"key":"5639_CR53","doi-asserted-by":"crossref","unstructured":"Chun Y, Do A, Grishina G, Grishin A, Fang G, Rose S, Spencer C, Vicencio A, Schadt E, Bunyavanich S. Integrative study of the upper and lower airway microbiome and transcriptome in asthma. JCI insight. 2020;5(5).","DOI":"10.1172\/jci.insight.133707"},{"issue":"3","key":"5639_CR54","doi-asserted-by":"publisher","first-page":"683","DOI":"10.3390\/microorganisms11030683","volume":"11","author":"M P\u00e9rez-Losada","year":"2023","unstructured":"P\u00e9rez-Losada M, Castro-Nallar E, Laerte Boechat J, Delgado L, Azenha Rama T, Berrios-Far\u00edas V, Oliveira M. Nasal bacteriomes of patients with asthma and allergic rhinitis show unique composition, structure, function and interactions. Microorganisms. 2023;11(3):683.","journal-title":"Microorganisms"},{"issue":"6","key":"5639_CR55","doi-asserted-by":"publisher","first-page":"1954","DOI":"10.1093\/bib\/bbz105","volume":"21","author":"Y Wang","year":"2020","unstructured":"Wang Y, L\u00eaCao K-A. Managing batch effects in microbiome data. Brief Bioinform. 2020;21(6):1954\u201370.","journal-title":"Brief Bioinform"},{"key":"5639_CR56","doi-asserted-by":"crossref","unstructured":"Rincon AL, Kraneveld AD, Tonda A. Batch correction of genomic data in chronic fatigue syndrome using cma-es. In: Proceedings of the 2020 genetic and evolutionary computation conference companion, 2020;277\u2013278","DOI":"10.1145\/3377929.3389947"},{"issue":"7","key":"5639_CR57","doi-asserted-by":"publisher","first-page":"1785","DOI":"10.3390\/cancers12071785","volume":"12","author":"A Lopez-Rincon","year":"2020","unstructured":"Lopez-Rincon A, Mendoza-Maldonado L, Martinez-Archundia M, Sch\u00f6nhuth A, Kraneveld AD, Garssen J, Tonda A. Machine learning-based ensemble recursive feature selection of circulating mirnas for cancer tumor classification. Cancers. 2020;12(7):1785.","journal-title":"Cancers"},{"issue":"1","key":"5639_CR58","doi-asserted-by":"publisher","first-page":"4541","DOI":"10.1038\/s41598-021-83660-9","volume":"11","author":"PI Metselaar","year":"2021","unstructured":"Metselaar PI, Mendoza-Maldonado L, Li Yim AYF, Abarkan I, Henneman P, Te Velde AA, Sch\u00f6nhuth A, Bosch JA, Kraneveld AD, Lopez-Rincon A. Recursive ensemble feature selection provides a robust mrna expression signature for myalgic encephalomyelitis\/chronic fatigue syndrome. Sci Rep. 2021;11(1):4541.","journal-title":"Sci Rep"},{"key":"5639_CR59","doi-asserted-by":"publisher","DOI":"10.1016\/j.phanu.2022.100326","volume":"23","author":"K Kamphorst","year":"2023","unstructured":"Kamphorst K, Lopez-Rincon A, Vlieger AM, Garssen J, van\u2019t Riet E, van Elburg RM. Predictive factors for allergy at 4\u20136 years of age based on machine learning: a pilot study. PharmaNutrition. 2023;23: 100326.","journal-title":"PharmaNutrition"},{"key":"5639_CR60","doi-asserted-by":"publisher","DOI":"10.3389\/fimmu.2021.685742","volume":"12","author":"M Benner","year":"2021","unstructured":"Benner M, Lopez-Rincon A, Thijssen S, Garssen J, Ferwerda G, Joosten I, van der Molen RG, Hogenkamp A. Antibiotic intervention affects maternal immunity during gestation in mice. Front Immunol. 2021;12: 685742.","journal-title":"Front Immunol"},{"issue":"2","key":"5639_CR61","doi-asserted-by":"publisher","first-page":"13919","DOI":"10.1111\/pai.13919","volume":"34","author":"JM Blankestijn","year":"2023","unstructured":"Blankestijn JM, Lopez-Rincon A, Neerincx AH, Vijverberg SJ, Hashimoto S, Gorenjak M, Sard\u00f3n Prado O, Corcuera-Elosegui P, Korta-Murua J, Pino-Yanes M, et al. Classifying asthma control using salivary and fecal bacterial microbiome in children with moderate-to-severe asthma. Pediatr Allergy Immunol. 2023;34(2):13919.","journal-title":"Pediatr Allergy Immunol"},{"key":"5639_CR62","doi-asserted-by":"crossref","unstructured":"Rojas-Velazquez D, Tonda A, Rodriguez-Guerra I, Kraneveld AD, Lopez-Rincon A. Multi-objective evolutionary discretization of gene expression profiles: application to covid-19 severity prediction. In: International conference on the applications of evolutionary computation (part of EvoStar), 2023;703\u2013717","DOI":"10.1007\/978-3-031-30229-9_45"},{"key":"5639_CR63","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825\u201330.","journal-title":"J Mach Learn Res"},{"issue":"1","key":"5639_CR64","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12864-019-6413-7","volume":"21","author":"D Chicco","year":"2020","unstructured":"Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation. BMC Genom. 2020;21(1):1\u201313.","journal-title":"BMC Genom"},{"key":"5639_CR65","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1016\/j.jpsychires.2020.06.032","volume":"129","author":"X Ding","year":"2020","unstructured":"Ding X, Xu Y, Zhang X, Zhang L, Duan G, Song C, Li Z, Yang Y, Wang Y, Wang X, et al. Gut microbiota changes in patients with autism spectrum disorders. J Psychiatr Res. 2020;129:149\u201359.","journal-title":"J Psychiatr Res"},{"issue":"9","key":"5639_CR66","doi-asserted-by":"publisher","first-page":"1614","DOI":"10.1002\/aur.2358","volume":"13","author":"R Zou","year":"2020","unstructured":"Zou R, Xu F, Wang Y, Duan M, Guo M, Zhang Q, Zhao H, Zheng H. Changes in the gut microbiota of children with autism spectrum disorder. Autism Res. 2020;13(9):1614\u201325.","journal-title":"Autism Res"},{"issue":"7681","key":"5639_CR67","doi-asserted-by":"publisher","first-page":"507","DOI":"10.1038\/nature24460","volume":"551","author":"D Vandeputte","year":"2017","unstructured":"Vandeputte D, Kathagen G, D\u2019hoe K, Vieira-Silva S, Valles-Colomer M, Sabino J, Wang J, Tito RY, De Commer L, Darzi Y, et al. Quantitative microbiome profiling links gut community variation to microbial load. Nature. 2017;551(7681):507\u201311.","journal-title":"Nature"},{"key":"5639_CR68","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1016\/j.ebiom.2019.09.008","volume":"48","author":"T Yamada","year":"2019","unstructured":"Yamada T, Hino S, Iijima H, Genda T, Aoki R, Nagata R, Han K-H, Hirota M, Kinashi Y, Oguchi H, et al. Mucin o-glycans facilitate symbiosynthesis to maintain gut immune homeostasis. EBioMedicine. 2019;48:513\u201325.","journal-title":"EBioMedicine"},{"key":"5639_CR69","doi-asserted-by":"crossref","unstructured":"Imai J, Ichikawa H, Kitamoto S, Golob JL, Kaneko M, Nagata J, Takahashi M, Gillilland MG III, Tanaka R, Nagao-Kitamoto H, et al. A potential pathogenic association between periodontal disease and Crohn\u2019s disease. JCI Insight. 2021;6(23).","DOI":"10.1172\/jci.insight.148543"},{"issue":"1","key":"5639_CR70","doi-asserted-by":"publisher","first-page":"54","DOI":"10.2337\/dc16-1324","volume":"40","author":"J De La Cuesta-Zuluaga","year":"2017","unstructured":"De La Cuesta-Zuluaga J, Mueller NT, Corrales-Agudelo V, Vel\u00e1squez-Mej\u00eda EP, Carmona JA, Abad JM, Escobar JS. Metformin is associated with higher relative abundance of mucin-degrading akkermansia muciniphila and several short-chain fatty acid-producing microbiota in the gut. Diabetes Care. 2017;40(1):54\u201362.","journal-title":"Diabetes Care"},{"issue":"12","key":"5639_CR71","doi-asserted-by":"publisher","first-page":"0226372","DOI":"10.1371\/journal.pone.0226372","volume":"14","author":"A Ahmad","year":"2019","unstructured":"Ahmad A, Yang W, Chen G, Shafiq M, Javed S, Ali Zaidi SS, Shahid R, Liu C, Bokhari H. Analysis of gut microbiota of obese individuals with type 2 diabetes and healthy individuals. PLoS ONE. 2019;14(12):0226372.","journal-title":"PLoS ONE"},{"issue":"1","key":"5639_CR72","doi-asserted-by":"publisher","first-page":"2142009","DOI":"10.1080\/19490976.2022.2142009","volume":"14","author":"A Saleem","year":"2022","unstructured":"Saleem A, Ikram A, Dikareva E, Lahtinen E, Matharu D, Pajari A-M, de Vos WM, Hasan F, Salonen A, Jian C. Unique pakistani gut microbiota highlights population-specific microbiota signatures of type 2 diabetes mellitus. Gut Microbes. 2022;14(1):2142009.","journal-title":"Gut Microbes"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05639-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-024-05639-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05639-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,16]],"date-time":"2024-01-16T22:04:04Z","timestamp":1705442644000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-024-05639-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,15]]},"references-count":72,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["5639"],"URL":"https:\/\/doi.org\/10.1186\/s12859-024-05639-3","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-3699085\/v1","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,15]]},"assertion":[{"value":"3 December 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 January 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 January 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"26"}}