{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T02:29:12Z","timestamp":1773109752876,"version":"3.50.1"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003478","name":"Ministry of Health, Labour and Welfare","doi-asserted-by":"publisher","award":["15654110"],"award-info":[{"award-number":["15654110"]}],"id":[{"id":"10.13039\/501100003478","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100009619","name":"Japan Agency for Medical Research and Development","doi-asserted-by":"publisher","award":["17ek0210078h0002"],"award-info":[{"award-number":["17ek0210078h0002"]}],"id":[{"id":"10.13039\/100009619","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>To increase the accuracy of microbiome data analysis, solving the technical limitations of the existing sequencing machines is required. Quality trimming is suggested to reduce the effect of the progressive decrease in sequencing quality with the increased length of the sequenced library. In this study, we examined the effect of the trimming thresholds (0\u201320 for QIIME1 and 0\u201330 for QIIME2) on the number of reads that remained after the quality control and chimera removal (the good reads). We also examined the distance of the analysis results to the gold standard using simulated samples.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Quality trimming increased the number of good reads and abundance measurement accuracy in Illumina paired-end reads of the V3-V4 hypervariable region.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Our results suggest that the pre-analysis trimming step should be included before the application of QIIME1 or QIIME2.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-019-3187-5","type":"journal-article","created":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T17:02:47Z","timestamp":1573837367000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":87,"title":["Impact of quality trimming on the efficiency of reads joining and diversity analysis of Illumina paired-end reads in the context of QIIME1 and QIIME2 microbiome analysis frameworks"],"prefix":"10.1186","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0690-8012","authenticated-orcid":false,"given":"Attayeb","family":"Mohsen","sequence":"first","affiliation":[]},{"given":"Jonguk","family":"Park","sequence":"additional","affiliation":[]},{"given":"Yi-An","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Hitoshi","family":"Kawashima","sequence":"additional","affiliation":[]},{"given":"Kenji","family":"Mizuguchi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,11,15]]},"reference":[{"key":"3187_CR1","doi-asserted-by":"publisher","first-page":"3720","DOI":"10.3390\/ijms19123720","volume":"19","author":"K Ganesan","year":"2018","unstructured":"Ganesan K, Chung SK, Vanamala J, Xu B. Causal relationship between diet-induced gut microbiota changes and diabetes: a novel strategy to transplant Faecalibacterium prausnitzii in preventing diabetes. Int J Mol Sci. 2018;19:3720.","journal-title":"Int J Mol Sci"},{"key":"3187_CR2","doi-asserted-by":"publisher","first-page":"1704","DOI":"10.1101\/gr.151803.112","volume":"23","author":"CA Lozupone","year":"2013","unstructured":"Lozupone CA, Stombaugh J, Gonzalez A, Ackermann G, Wendel D, V\u00e1zquez-Baeza Y, et al. Meta-analyses of studies of the human microbiota. Genome Res. 2013;23:1704\u201314.","journal-title":"Genome Res"},{"key":"3187_CR3","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1097\/MOG.0000000000000139","volume":"31","author":"AB Shreiner","year":"2015","unstructured":"Shreiner AB, Kao JY, Young VB. The gut microbiome in health and in disease. Curr Opin Gastroenterol. 2015;31:69\u201375.","journal-title":"Curr Opin Gastroenterol"},{"key":"3187_CR4","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1038\/nature24621","volume":"551","author":"LR Thompson","year":"2017","unstructured":"Thompson LR, Sanders JG, McDonald D, Amir A, Ladau J, Locey KJ, et al. A communal catalogue reveals Earth\u2019s multiscale microbial diversity. Nature. 2017;551:457\u201363.","journal-title":"Nature."},{"key":"3187_CR5","doi-asserted-by":"publisher","first-page":"4151","DOI":"10.1128\/JB.00345-12","volume":"194","author":"EJ Stewart","year":"2012","unstructured":"Stewart EJ. Growing Unculturable Bacteria. J Bacteriol. 2012;194:4151\u201360.","journal-title":"J Bacteriol"},{"key":"3187_CR6","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1038\/nature09796","volume":"470","author":"ER Mardis","year":"2011","unstructured":"Mardis ER. A decade\u2019s perspective on DNA sequencing technology. Nature. 2011;470:198\u2013203.","journal-title":"Nature."},{"key":"3187_CR7","doi-asserted-by":"publisher","first-page":"927","DOI":"10.1038\/nature03062","volume":"431","author":"X She","year":"2004","unstructured":"She X, Jiang Z, Clark RA, Liu G, Cheng Z, Tuzun E, et al. Shotgun sequence assembly and recent segmental duplications within the human genome. Nature. 2004;431:927.","journal-title":"Nature."},{"key":"3187_CR8","doi-asserted-by":"publisher","first-page":"209","DOI":"10.3389\/fpls.2014.00209","volume":"5","author":"TJ Sharpton","year":"2014","unstructured":"Sharpton TJ. An introduction to the analysis of shotgun metagenomic data. Front Plant Sci. 2014;5:209.","journal-title":"Front Plant Sci"},{"key":"3187_CR9","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1038\/nature11234","volume":"486","author":"C Huttenhower","year":"2012","unstructured":"The Human Microbiome Project Consortium, Huttenhower C, Gevers D, Knight R, Abubucker S, Badger JH, et al. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207\u201314.","journal-title":"Nature"},{"key":"3187_CR10","doi-asserted-by":"publisher","first-page":"767","DOI":"10.3389\/fmicb.2018.00767","volume":"9","author":"M-A Osman","year":"2018","unstructured":"Osman M-A, Neoh H-M, Ab Mutalib N-S, Chin S-F, Jamal R. 16S rRNA gene sequencing for deciphering the colorectal Cancer gut microbiome: current protocols and workflows. Front Microbiol. 2018;9:767.","journal-title":"Front Microbiol"},{"key":"3187_CR11","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1186\/1471-2105-11-485","volume":"11","author":"MP Cox","year":"2010","unstructured":"Cox MP, Peterson DA, Biggs PJ. SolexaQA: at-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics. 2010;11:485.","journal-title":"BMC Bioinformatics"},{"key":"3187_CR12","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1038\/nmeth.f.303","volume":"7","author":"JG Caporaso","year":"2010","unstructured":"Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7:335\u20136.","journal-title":"Nat Methods"},{"key":"3187_CR13","unstructured":"QIIME. http:\/\/qiime.org\/. Accessed 19 Mar 2019."},{"key":"3187_CR14","unstructured":"QIIME 2. https:\/\/qiime2.org\/. Accessed 19 Mar 2019."},{"key":"3187_CR15","doi-asserted-by":"publisher","first-page":"2460","DOI":"10.1093\/bioinformatics\/btq461","volume":"26","author":"RC Edgar","year":"2010","unstructured":"Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460\u20131.","journal-title":"Bioinformatics."},{"issue":"1","key":"3187_CR16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2174\/1875036201307010001","volume":"7","author":"Erik Aronesty","year":"2013","unstructured":"Aronesty E. Comparison of sequencing utility programs. The Open Bioinformatics Journal. 2013;7.","journal-title":"The Open Bioinformatics Journal"},{"key":"3187_CR17","unstructured":"Genomics EA. ea-utils. C++. 2019. https:\/\/github.com\/ExpressionAnalysis\/ea-utils. Accessed 19 Mar 2019."},{"key":"3187_CR18","unstructured":"Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37:852\u20137.\u00a0"},{"key":"3187_CR19","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0085024","volume":"8","author":"CD Fabbro","year":"2013","unstructured":"Fabbro CD, Scalabrin S, Morgante M, Giorgi FM. An extensive evaluation of read trimming effects on Illumina NGS data analysis. PLoS One. 2013;8:e85024.","journal-title":"PLoS One"},{"key":"3187_CR20","doi-asserted-by":"publisher","first-page":"e61217","DOI":"10.1371\/journal.pone.0061217","volume":"8","author":"PJ McMurdie","year":"2013","unstructured":"McMurdie PJ, Holmes S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLOS ONE. 2013;8:e61217.","journal-title":"PLOS ONE"},{"key":"3187_CR21","unstructured":"NCBI-SRA (Sequence Read Archive). https:\/\/www.ncbi.nlm.nih.gov\/sra. Accessed 19 Mar 2019."},{"key":"3187_CR22","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1038\/nmeth.3869","volume":"13","author":"BJ Callahan","year":"2016","unstructured":"Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13:581\u20133.","journal-title":"Nat Methods"},{"key":"3187_CR23","doi-asserted-by":"publisher","first-page":"2639","DOI":"10.1038\/ismej.2017.119","volume":"11","author":"BJ Callahan","year":"2017","unstructured":"Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. 2017;11:2639\u201343.","journal-title":"ISME J"},{"key":"3187_CR24","doi-asserted-by":"publisher","first-page":"1558","DOI":"10.1038\/s41467-017-01544-x","volume":"8","author":"M Kleiner","year":"2017","unstructured":"Kleiner M, Thorson E, Sharp CE, Dong X, Liu D, Li C, et al. Assessing species biomass contributions in microbial communities via metaproteomics. Nat Commun. 2017;8:1558.","journal-title":"Nat Commun"},{"key":"3187_CR25","doi-asserted-by":"publisher","unstructured":"Mohsen A, Park J, Kawashima H, Chen Y-A, Natsume-Kitatani Y. Mizuguchi K. Auto-q Qiime Analysis Automating Script. 2018. https:\/\/doi.org\/10.5281\/zenodo.1439555.","DOI":"10.5281\/zenodo.1439555"},{"issue":"1","key":"3187_CR26","doi-asserted-by":"publisher","first-page":"e1","DOI":"10.1093\/nar\/gks808","volume":"41","author":"Anna Klindworth","year":"2012","unstructured":"Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res 2013;41:e1\u2013e1.","journal-title":"Nucleic Acids Research"},{"key":"3187_CR27","doi-asserted-by":"publisher","first-page":"1422","DOI":"10.1093\/bioinformatics\/btp163","volume":"25","author":"PJA Cock","year":"2009","unstructured":"Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25:1422\u20133.","journal-title":"Bioinformatics."},{"key":"3187_CR28","unstructured":"Oliphant TE. A guide to NumPy: Trelgol Publishing USA; 2006."},{"key":"3187_CR29","unstructured":"BBMap Guide. DOE Joint Genome Institute. https:\/\/jgi.doe.gov\/data-and-tools\/bbtools\/bb-tools-user-guide\/bbmap-guide\/. Accessed 19 Mar 2019."},{"key":"3187_CR30","doi-asserted-by":"publisher","first-page":"D590","DOI":"10.1093\/nar\/gks1219","volume":"41","author":"C Quast","year":"2013","unstructured":"Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590\u20136.","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"3187_CR31","doi-asserted-by":"publisher","first-page":"D643","DOI":"10.1093\/nar\/gkt1209","volume":"42","author":"P Yilmaz","year":"2014","unstructured":"Yilmaz P, Parfrey LW, Yarza P, Gerken J, Pruesse E, Quast C, et al. The SILVA and \u201call-species living tree project (LTP)\u201d taxonomic frameworks. Nucleic Acids Res. 2014;42(Database issue):D643\u20138.","journal-title":"Nucleic Acids Res"},{"key":"3187_CR32","doi-asserted-by":"publisher","first-page":"1492","DOI":"10.12688\/f1000research.8986.1","volume":"5","author":"Ben J. Callahan","year":"2016","unstructured":"Callahan BJ, Sankaran K, Fukuyama JA, McMurdie PJ, Holmes SP. Bioconductor workflow for microbiome data analysis: from raw reads to community analyses. F1000Res. 2016;5. doi:https:\/\/doi.org\/10.12688\/f1000research.8986.1.","journal-title":"F1000Research"},{"key":"3187_CR33","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v086.i01","volume":"86","author":"S Bougeard","year":"2018","unstructured":"Bougeard S, Dray S. Supervised multiblock analysis in R with the ade4 package. J Stat Softw. 2018;86:1\u201317.","journal-title":"J Stat Softw"},{"key":"3187_CR34","unstructured":"Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, et al. vegan: Community Ecology Package. 2019. https:\/\/CRAN.R-project.org\/package=vegan."},{"key":"3187_CR35","doi-asserted-by":"crossref","unstructured":"Wickham H. Ggplot2 elegant graphics for data analysis. Dordrecht; New York: Springer; 2009. http:\/\/public.eblib.com\/EBLPublic\/PublicView.do?ptiID=511468. Accessed 3 Dec 2012.","DOI":"10.1007\/978-0-387-98141-3"},{"key":"3187_CR36","doi-asserted-by":"crossref","unstructured":"McKinney W. Data Structures for Statistical Computing in Python. In: Walt S van der, Millman J, editors. Proceedings of the 9th Python in Science Conference. 2010. p. 51\u20136.","DOI":"10.25080\/Majora-92bf1922-00a"},{"key":"3187_CR37","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MCSE.2007.55","volume":"9","author":"JD Hunter","year":"2007","unstructured":"Hunter JD. Matplotlib: a 2D graphics environment. Computing in Science & Engineering. 2007;9:90\u20135.","journal-title":"Computing in Science & Engineering"},{"key":"3187_CR38","unstructured":"SRA Links for BioProject (Select 382861) - SRA - NCBI. https:\/\/www.ncbi.nlm.nih.gov\/sra?linkname=bioproject_sra_all&from_uid=382861. Accessed 19 Mar 2019."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3187-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-019-3187-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3187-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,23]],"date-time":"2023-09-23T00:55:16Z","timestamp":1695430516000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-019-3187-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,15]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["3187"],"URL":"https:\/\/doi.org\/10.1186\/s12859-019-3187-5","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,15]]},"assertion":[{"value":"2 April 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 November 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2019","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"581"}}