{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T08:27:27Z","timestamp":1773476847218,"version":"3.50.1"},"reference-count":19,"publisher":"Oxford University Press (OUP)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Compared with composition-based binning algorithms, the binning accuracy and specificity of alignment-based binning algorithms is significantly higher. However, being alignment-based, the latter class of algorithms require enormous amount of time and computing resources for binning huge metagenomic datasets. The motivation was to develop a binning approach that can analyze metagenomic datasets as rapidly as composition-based approaches, but nevertheless has the accuracy and specificity of alignment-based algorithms. This article describes a hybrid binning approach (SPHINX) that achieves high binning efficiency by utilizing the principles of both \u2018composition\u2019- and \u2018alignment\u2019-based binning algorithms.<\/jats:p>\n               <jats:p>Results: Validation results with simulated sequence datasets indicate that SPHINX is able to analyze metagenomic sequences as rapidly as composition-based algorithms. Furthermore, the binning efficiency (in terms of accuracy and specificity of assignments) of SPHINX is observed to be comparable with results obtained using alignment-based algorithms.<\/jats:p>\n               <jats:p>Availability: A web server for the SPHINX algorithm is available at http:\/\/metagenomics.atc.tcs.com\/SPHINX\/.<\/jats:p>\n               <jats:p>Contact: \u00a0sharmila@atc.tcs.com<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq608","type":"journal-article","created":{"date-parts":[[2010,10,29]],"date-time":"2010-10-29T02:13:37Z","timestamp":1288318417000},"page":"22-30","source":"Crossref","is-referenced-by-count":58,"title":["SPHINX\u2014an algorithm for taxonomic binning of metagenomic sequences"],"prefix":"10.1093","volume":"27","author":[{"given":"Monzoorul Haque","family":"Mohammed","sequence":"first","affiliation":[{"name":"Bio-sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 1 Software Units Layout, Madhapur, Hyderabad 500081, Andhra Pradesh, India"}]},{"given":"Tarini Shankar","family":"Ghosh","sequence":"additional","affiliation":[{"name":"Bio-sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 1 Software Units Layout, Madhapur, Hyderabad 500081, Andhra Pradesh, India"}]},{"given":"Nitin Kumar","family":"Singh","sequence":"additional","affiliation":[{"name":"Bio-sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 1 Software Units Layout, Madhapur, Hyderabad 500081, Andhra Pradesh, India"}]},{"given":"Sharmila S.","family":"Mande","sequence":"additional","affiliation":[{"name":"Bio-sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, 1 Software Units Layout, Madhapur, Hyderabad 500081, Andhra Pradesh, India"}]}],"member":"286","published-online":{"date-parts":[[2010,10,28]]},"reference":[{"key":"2023012511155879000_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped Blast and PSIBlast: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012511155879000_B2","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1128\/mr.59.1.143-169.1995","article-title":"Phylogenetic identification and in situ detection of individual microbial cells without cultivation","volume":"59","author":"Amann","year":"1995","journal-title":"Microbiol. Rev."},{"key":"2023012511155879000_B3","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/nmeth.1358","article-title":"Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models","volume":"6","author":"Brady","year":"2009","journal-title":"Nat. Methods"},{"key":"2023012511155879000_B4","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1186\/1471-2105-10-56","article-title":"TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach","volume":"10","author":"Diaz","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012511155879000_B5","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1038\/nature06810","article-title":"Functional metagenomic profiling of nine biomes","volume":"452","author":"Dinsdale","year":"2008","journal-title":"Nature"},{"key":"2023012511155879000_B6","doi-asserted-by":"crossref","first-page":"100","DOI":"10.2307\/2346830","article-title":"A K-means clustering algorithm","volume":"28","author":"Hartigan","year":"1979","journal-title":"Appl. Stat."},{"key":"2023012511155879000_B7","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-2-reviews0003","article-title":"Exploring prokaryotic diversity in the genomic era","volume":"3","author":"Hugenholtz","year":"2002","journal-title":"Genome Biol."},{"key":"2023012511155879000_B8","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1101\/gr.5969107","article-title":"MEGAN analysis of metagenomic data","volume":"17","author":"Huson","year":"2007","journal-title":"Genome Res."},{"key":"2023012511155879000_B9","doi-asserted-by":"crossref","first-page":"2230","DOI":"10.1093\/nar\/gkn038","article-title":"Phylogenetic classification of short environmental DNA fragments","volume":"36","author":"Krause","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012511155879000_B10","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","article-title":"Genome sequencing in micro-fabricated high-density pico-litre reactors","volume":"437","author":"Margulies","year":"2005","journal-title":"Nature"},{"key":"2023012511155879000_B11","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1038\/nmeth976","article-title":"Accurate phylogenetic classification of variable-length DNA fragments","volume":"4","author":"McHardy","year":"2007","journal-title":"Nat. Methods"},{"key":"2023012511155879000_B12","doi-asserted-by":"crossref","first-page":"1722","DOI":"10.1093\/bioinformatics\/btp317","article-title":"SOrt-ITEMS: sequence orthology based approach for improved taxonomic estimation of metagenomic sequences","volume":"25","author":"Monzoorul","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012511155879000_B13","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1101\/gr.335003","article-title":"Evolutionary implications of microbial genome tetranucleotide frequency biases","volume":"13","author":"Pride","year":"2003","journal-title":"Genome Res."},{"key":"2023012511155879000_B14","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1146\/annurev.micro.57.030502.090759","article-title":"The uncultured microbial majority","volume":"57","author":"Rappe","year":"2003","journal-title":"Annu. Rev. Microbiol."},{"key":"2023012511155879000_B15","doi-asserted-by":"crossref","first-page":"e3373","DOI":"10.1371\/journal.pone.0003373","article-title":"MetaSim - a sequencing simulator for genomics and metagenomics","volume":"3","author":"Richter","year":"2008","journal-title":"PLoS ONE"},{"key":"2023012511155879000_B16","doi-asserted-by":"crossref","first-page":"1027","DOI":"10.1038\/nature05414","article-title":"An obesity-associated gut microbiome with increased capacity for energy harvest","volume":"444","author":"Turnbaugh","year":"2006","journal-title":"Nature"},{"key":"2023012511155879000_B17","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1126\/science.1093857","article-title":"Environmental genome shotgun sequencing of the Sargasso Sea","volume":"11","author":"Venter","year":"2004","journal-title":"Science"},{"key":"2023012511155879000_B18","doi-asserted-by":"crossref","first-page":"e1000352","DOI":"10.1371\/journal.pcbi.1000352","article-title":"Statistical methods for detecting differentially abundant features in clinical metagenomic samples","volume":"5","author":"White","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012511155879000_B19","doi-asserted-by":"crossref","first-page":"1752","DOI":"10.1111\/j.1462-2920.2009.01901.x","article-title":"Metagenomic signatures of 86 microbial and viral metagenomes","volume":"11","author":"Willner","year":"2009","journal-title":"Env. Microbiol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/1\/22\/48861482\/bioinformatics_27_1_22.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/1\/22\/48861482\/bioinformatics_27_1_22.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T11:26:24Z","timestamp":1674645984000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/1\/22\/201666"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,10,28]]},"references-count":19,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq608","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,1,1]]},"published":{"date-parts":[[2010,10,28]]}}}