{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:40Z","timestamp":1772138080951,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":500,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Motivation: Deep sequencing of clinical samples is now an established tool for the detection of infectious pathogens, with direct medical applications. The large amount of data generated produces an opportunity to detect species even at very low levels, provided that computational tools can effectively profile the relevant metagenomic communities. Data interpretation is complicated by the fact that short sequencing reads can match multiple organisms and by the lack of completeness of existing databases, in particular for viral pathogens. Here we present metaMix, a Bayesian mixture model framework for resolving complex metagenomic mixtures. We show that the use of parallel Monte Carlo Markov chains for the exploration of the species space enables the identification of the set of species most likely to contribute to the mixture.<\/jats:p>\n                  <jats:p>Results: We demonstrate the greater accuracy of metaMix compared with relevant methods, particularly for profiling complex communities consisting of several related species. We designed metaMix specifically for the analysis of deep transcriptome sequencing datasets, with a focus on viral pathogen detection; however, the principles are generally applicable to all types of metagenomic mixtures.<\/jats:p>\n                  <jats:p>Availability and implementation: metaMix is implemented as a user friendly R package, freely available on CRAN: http:\/\/cran.r-project.org\/web\/packages\/metaMix<\/jats:p>\n                  <jats:p>Contact: \u00a0sofia.morfopoulou.10@ucl.ac.uk<\/jats:p>\n                  <jats:p>Supplementary information: \u00a0Supplementary data are available at Bionformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv317","type":"journal-article","created":{"date-parts":[[2015,5,22]],"date-time":"2015-05-22T20:50:24Z","timestamp":1432327824000},"page":"2930-2938","source":"Crossref","is-referenced-by-count":37,"title":["Bayesian mixture analysis for metagenomic community profiling"],"prefix":"10.1093","volume":"31","author":[{"given":"Sofia","family":"Morfopoulou","sequence":"first","affiliation":[{"name":"UCL Genetics Institute, University College London, London WC1E 6BT, UK"}]},{"given":"Vincent","family":"Plagnol","sequence":"additional","affiliation":[{"name":"UCL Genetics Institute, University College London, London WC1E 6BT, UK"}]}],"member":"286","published-online":{"date-parts":[[2015,5,21]]},"reference":[{"key":"2023020202225517600_btv317-B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023020202225517600_btv317-B2","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.jcv.2013.03.003","article-title":"Next-generation sequencing technologies in diagnostic virology","volume":"58","author":"Barzon","year":"2013","journal-title":"J. Clin. Virol."},{"key":"2023020202225517600_btv317-B3","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/nmeth.1358","article-title":"Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models","volume":"6","author":"Brady","year":"2009","journal-title":"Nat. Methods"},{"key":"2023020202225517600_btv317-B4","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1093\/cid\/ciu940","article-title":"Astrovirus VA1\/HMO-C: an increasingly recognised neurotropic pathogen in immunocompromised patients","volume":"60","author":"Brown","year":"2014","journal-title":"Clin. Infect. Dis."},{"key":"2023020202225517600_btv317-B5","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1016\/j.mib.2013.05.001","article-title":"Viral pathogen discovery","volume":"16","author":"Chiu","year":"2013","journal-title":"Curr. Opin. Microbiol."},{"key":"2023020202225517600_btv317-B6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Stat. Soc.."},{"key":"2023020202225517600_btv317-B7","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1111\/j.2517-6161.1994.tb01985.x","article-title":"Estimation of finite mixture distributions through Bayesian sampling","volume":"56","author":"Diebolt","year":"1994","journal-title":"J. R. Stat. Soc. Ser. B Methodol."},{"key":"2023020202225517600_btv317-B8","doi-asserted-by":"crossref","first-page":"646","DOI":"10.1093\/bib\/bbs031","article-title":"Taxonomic binning of metagenome samples generated by next-generation sequencing technologies","volume":"13","author":"Dr\u00f6ge","year":"2012","journal-title":"Brief. Bioinform."},{"key":"2023020202225517600_btv317-B9","doi-asserted-by":"crossref","first-page":"3910","DOI":"10.1039\/b509983h","article-title":"Parallel tempering: theory, applications, and new perspectives","volume":"7","author":"Earl","year":"2005","journal-title":"Phys. Chem. Chem. Phys."},{"key":"2023020202225517600_btv317-B10","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.virol.2012.09.025","article-title":"Computational tools for viral metagenomics and their application in clinical research","volume":"434","author":"Fancello","year":"2012","journal-title":"Virology"},{"key":"2023020202225517600_btv317-B11","doi-asserted-by":"crossref","first-page":"1721","DOI":"10.1101\/gr.150151.112","article-title":"Pathoscope: species identification and strain attribution with unassembled sequencing data","volume":"23","author":"Francis","year":"2013","journal-title":"Genome Research"},{"key":"2023020202225517600_btv317-B12","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1080\/00401706.1995.10484303","article-title":"Weighted average importance sampling and defensive mixture distributions","volume":"37","author":"Hesterberg","year":"1995","journal-title":"Technometrics"},{"key":"2023020202225517600_btv317-B13","first-page":"382","article-title":"Bayesian model averaging: a tutorial","volume":"14","author":"Hoeting","year":"1999","journal-title":"Stat. Sci."},{"key":"2023020202225517600_btv317-B14","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1101\/gr.5969107","article-title":"MEGAN analysis of metagenomic data","volume":"17","author":"Huson","year":"2007","journal-title":"Genome Res."},{"key":"2023020202225517600_btv317-B15","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1007\/s11222-007-9028-9","article-title":"On population-based simulation for static inference","volume":"17","author":"Jasra","year":"2007","journal-title":"Stat. Comput."},{"key":"2023020202225517600_btv317-B16","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1128\/MMBR.00009-08","article-title":"A bioinformatician\u2019s guide to metagenomics","volume":"72","author":"Kunin","year":"2008","journal-title":"Microbiol. Mol. Biol. Rev."},{"key":"2023020202225517600_btv317-B17","doi-asserted-by":"crossref","first-page":"e111","DOI":"10.1093\/nar\/gks335","article-title":"Rapid identification of high-confidence taxonomic assignments for metagenomic data","volume":"40","author":"MacDonald","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023020202225517600_btv317-B18","first-page":"223","article-title":"Bayesian modelling and inference on mixtures of distributions","volume-title":"Handbook of Statistics","author":"Marin","year":"2005"},{"key":"2023020202225517600_btv317-B19","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1038\/nmeth976","article-title":"Accurate phylogenetic classification of variable-length DNA fragments","volume":"4","author":"McHardy","year":"2007","journal-title":"Nat. Methods"},{"key":"2023020202225517600_btv317-B20","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1056\/NEJMoa1203378","article-title":"A new phlebovirus associated with severe febrile illness in Missouri","volume":"367","author":"McMullan","year":"2012","journal-title":"N. Engl. J. Med."},{"key":"2023020202225517600_btv317-B21","doi-asserted-by":"crossref","first-page":"1616","DOI":"10.1101\/gr.122705.111","article-title":"The human gut virome: inter-individual variation and dynamic response to diet","volume":"21","author":"Minot","year":"2011","journal-title":"Genome Res."},{"key":"2023020202225517600_btv317-B22","doi-asserted-by":"crossref","first-page":"e1003987","DOI":"10.1371\/journal.pgen.1003987","article-title":"Expanding the marine virosphere using metagenomics","volume":"9","author":"Mizuno","year":"2013","journal-title":"PLoS Genet."},{"key":"2023020202225517600_btv317-B23","doi-asserted-by":"crossref","first-page":"e1002304","DOI":"10.1371\/journal.ppat.1002304","article-title":"Discovery of an ebolavirus-like filovirus in europe","volume":"7","author":"Negredo","year":"2011","journal-title":"PLoS Pathogens"},{"key":"2023020202225517600_btv317-B24","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature08821","article-title":"A human gut microbial gene catalogue established by metagenomic sequencing","volume":"464","author":"Qin","year":"2010","journal-title":"Nature"},{"key":"2023020202225517600_btv317-B25","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1186\/1471-2164-13-341","article-title":"A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers","volume":"13","author":"Quail","year":"2012","journal-title":"BMC Genomics"},{"key":"2023020202225517600_btv317-B26","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1186\/s12915-014-0087-z","article-title":"Reagent and laboratory contamination can critically impact sequence-based microbiome analyses","volume":"12","author":"Salter","year":"2014","journal-title":"BMC Biol."},{"key":"2023020202225517600_btv317-B27","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1093\/bioinformatics\/btr026","article-title":"Quality control and preprocessing of metagenomic datasets","volume":"27","author":"Schmieder","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020202225517600_btv317-B28","doi-asserted-by":"crossref","first-page":"e7370","DOI":"10.1371\/journal.pone.0007370","article-title":"Metagenomic analysis of respiratory tract DNA viral communities in cystic fibrosis and non-cystic fibrosis individuals","volume":"4","author":"Willner","year":"2009","journal-title":"PLoS One"},{"key":"2023020202225517600_btv317-B29","doi-asserted-by":"crossref","first-page":"e27992","DOI":"10.1371\/journal.pone.0027992","article-title":"Accurate genome relative abundance estimation based on shotgun metagenomic reads","volume":"6","author":"Xia","year":"2011","journal-title":"PLoS One"},{"key":"2023020202225517600_btv317-B30","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1101\/gr.074492.107","article-title":"Velvet: algorithms for de novo short read assembly using de Bruijn graphs","volume":"18","author":"Zerbino","year":"2008","journal-title":"Genome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/18\/2930\/49034853\/bioinformatics_31_18_2930.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/18\/2930\/49034853\/bioinformatics_31_18_2930.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,9]],"date-time":"2024-06-09T01:22:49Z","timestamp":1717896169000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/18\/2930\/241140"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,5,21]]},"references-count":30,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2015,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv317","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/007476","asserted-by":"object"}]},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,9,15]]},"published":{"date-parts":[[2015,5,21]]}}}