{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:14:34Z","timestamp":1760170474652},"reference-count":48,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2008,4,10]],"date-time":"2008-04-10T00:00:00Z","timestamp":1207785600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"DOI":"10.1186\/1471-2105-9-182","type":"journal-article","created":{"date-parts":[[2008,4,10]],"date-time":"2008-04-10T18:13:53Z","timestamp":1207851233000},"source":"Crossref","is-referenced-by-count":40,"title":["Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering"],"prefix":"10.1186","volume":"9","author":[{"given":"Shibu","family":"Yooseph","sequence":"first","affiliation":[]},{"given":"Weizhong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Granger","family":"Sutton","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,4,10]]},"reference":[{"key":"2167_CR1","unstructured":"Genome Project Statistic[ http:\/\/www.ncbi.nlm.nih.gov\/genomes\/static\/gpstat.html ]"},{"issue":"6","key":"2167_CR2","doi-asserted-by":"publisher","first-page":"459","DOI":"10.1038\/nrmicro1158","volume":"3","author":"EF DeLong","year":"2005","unstructured":"DeLong EF: Microbial community genomics in the ocean. Nat Rev Microbiol 2005, 3(6):459\u2013469. 10.1038\/nrmicro1158","journal-title":"Nat Rev Microbiol"},{"issue":"3","key":"2167_CR3","doi-asserted-by":"publisher","first-page":"e82","DOI":"10.1371\/journal.pbio.0050082","volume":"5","author":"JA Eisen","year":"2007","unstructured":"Eisen JA: Environmental Shotgun Sequencing: Its Potential and Challenges for Studying the Hidden World of Microbes. PLoS Biol 2007, 5(3):e82. 10.1371\/journal.pbio.0050082","journal-title":"PLoS Biol"},{"issue":"5721","key":"2167_CR4","doi-asserted-by":"publisher","first-page":"554","DOI":"10.1126\/science.1107851","volume":"308","author":"SG Tringe","year":"2005","unstructured":"Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, Podar M, Short JM, Mathur EJ, Detter JC, Bork P, Hugenholtz P, Rubin EM: Comparative metagenomics of microbial communities. Science 2005, 308(5721):554\u2013557. 10.1126\/science.1107851","journal-title":"Science"},{"issue":"6978","key":"2167_CR5","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1038\/nature02340","volume":"428","author":"GW Tyson","year":"2004","unstructured":"Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, Richardson PM, Solovyev VV, Rubin EM, Rokhsar DS, Banfield JF: Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 2004, 428(6978):37\u201343. 10.1038\/nature02340","journal-title":"Nature"},{"issue":"5667","key":"2167_CR6","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1126\/science.1093857","volume":"304","author":"JC Venter","year":"2004","unstructured":"Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, Fouts DE, Levy S, Knap AH, Lomas MW, Nealson K, White O, Peterson J, Hoffman J, Parsons R, Baden-Tillson H, Pfannkoch C, Rogers YH, Smith HO: Environmental genome shotgun sequencing of the Sargasso Sea. Science 2004, 304(5667):66\u201374. 10.1126\/science.1093857","journal-title":"Science"},{"issue":"5760","key":"2167_CR7","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1126\/science.1120250","volume":"311","author":"EF DeLong","year":"2006","unstructured":"DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, Frigaard NU, Martinez A, Sullivan MB, Edwards R, Brito BR, Chisholm SW, Karl DM: Community genomics among stratified microbial assemblages in the ocean's interior. Science 2006, 311(5760):496\u2013503. 10.1126\/science.1120250","journal-title":"Science"},{"issue":"3","key":"2167_CR8","doi-asserted-by":"publisher","first-page":"e77","DOI":"10.1371\/journal.pbio.0050077","volume":"5","author":"DB Rusch","year":"2007","unstructured":"Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, Remington K, Beeson K, Tran B, Smith H, Baden-Tillson H, Stewart C, Thorpe J, Freeman J, Andrews-Pfannkoch C, Venter JE, Li K, Kravitz S, Heidelberg JF, Utterback T, Rogers YH, Falcon LI, Souza V, Bonilla-Rosso G, Eguiarte LE, Karl DM, Sathyendranath S, Platt T, Bermingham E, Gallardo V, Tamayo-Castillo G, Ferrari MR, Strausberg RL, Nealson K, Friedman R, Frazier M, Venter JC: The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific. PLoS Biol 2007, 5(3):e77. 10.1371\/journal.pbio.0050077","journal-title":"PLoS Biol"},{"issue":"3","key":"2167_CR9","doi-asserted-by":"publisher","first-page":"e16","DOI":"10.1371\/journal.pbio.0050016","volume":"5","author":"S Yooseph","year":"2007","unstructured":"Yooseph S, Sutton G, Rusch DB, Halpern AL, Williamson SJ, Remington K, Eisen JA, Heidelberg KB, Manning G, Li W, Jaroszewski L, Cieplak P, Miller CS, Li H, Mashiyama ST, Joachimiak MP, van Belle C, Chandonia JM, Soergel DA, Zhai Y, Natarajan K, Lee S, Raphael BJ, Bafna V, Friedman R, Brenner SE, Godzik A, Eisenberg D, Dixon JE, Taylor SS, Strausberg RL, Frazier M, Venter JC: The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families. PLoS Biol 2007, 5(3):e16. 10.1371\/journal.pbio.0050016","journal-title":"PLoS Biol"},{"issue":"2","key":"2167_CR10","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1371\/journal.pcbi.0010024","volume":"1","author":"K Chen","year":"2005","unstructured":"Chen K, Pachter L: Bioinformatics for whole-genome shotgun sequencing of microbial communities. PLoS Comput Biol 2005, 1(2):106\u2013112. 10.1371\/journal.pcbi.0010024","journal-title":"PLoS Comput Biol"},{"issue":"6","key":"2167_CR11","doi-asserted-by":"publisher","first-page":"495","DOI":"10.1038\/nmeth1043","volume":"4","author":"K Mavromatis","year":"2007","unstructured":"Mavromatis K, Ivanova N, Barry K, Shapiro H, Goltsman E, McHardy AC, Rigoutsos I, Salamov A, Korzeniewski F, Land M, Lapidus A, Grigoriev I, Richardson P, Hugenholtz P, Kyrpides NC: Use of simulated data sets to evaluate the fidelity of metagenomic processing methods. Nat Methods 2007, 4(6):495\u2013500. 10.1038\/nmeth1043","journal-title":"Nat Methods"},{"issue":"5778","key":"2167_CR12","doi-asserted-by":"publisher","first-page":"1355","DOI":"10.1126\/science.1124234","volume":"312","author":"SR Gill","year":"2006","unstructured":"Gill SR, Pop M, Deboy RT, Eckburg PB, Turnbaugh PJ, Samuel BS, Gordon JI, Relman DA, Fraser-Liggett CM, Nelson KE: Metagenomic analysis of the human distal gut microbiome. Science 2006, 312(5778):1355\u20131359. 10.1126\/science.1124234","journal-title":"Science"},{"issue":"1","key":"2167_CR13","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1038\/nmeth976","volume":"4","author":"AC McHardy","year":"2007","unstructured":"McHardy AC, Martin HG, Tsirigos A, Hugenholtz P, Rigoutsos I: Accurate phylogenetic classification of variable-length DNA fragments. Nat Methods 2007, 4(1):63\u201372. 10.1038\/nmeth976","journal-title":"Nat Methods"},{"issue":"14","key":"2167_CR14","doi-asserted-by":"publisher","first-page":"e281","DOI":"10.1093\/bioinformatics\/btl247","volume":"22","author":"L Krause","year":"2006","unstructured":"Krause L, Diaz NN, Bartels D, Edwards RA, Puhler A, Rohwer F, Meyer F, Stoye J: Finding novel genes in bacterial communities isolated from the environment. Bioinformatics 2006, 22(14):e281\u20139. 10.1093\/bioinformatics\/btl247","journal-title":"Bioinformatics"},{"issue":"19","key":"2167_CR15","doi-asserted-by":"publisher","first-page":"5623","DOI":"10.1093\/nar\/gkl723","volume":"34","author":"H Noguchi","year":"2006","unstructured":"Noguchi H, Park J, Takagi T: MetaGene: prokaryotic gene finding from environmental genome shotgun sequences. Nucleic Acids Res 2006, 34(19):5623\u20135630. 10.1093\/nar\/gkl723","journal-title":"Nucleic Acids Res"},{"issue":"35","key":"2167_CR16","doi-asserted-by":"publisher","first-page":"13913","DOI":"10.1073\/pnas.0702636104","volume":"104","author":"ED Harrington","year":"2007","unstructured":"Harrington ED, Singh AH, Doerks T, Letunic I, von Mering C, Jensen LJ, Raes J, Bork P: Quantitative assessment of protein function prediction from metagenomics shotgun sequences. Proc Natl Acad Sci U S A 2007, 104(35):13913\u201313918. 10.1073\/pnas.0702636104","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"4","key":"2167_CR17","doi-asserted-by":"publisher","first-page":"536","DOI":"10.1006\/jmbi.1995.0159","volume":"247","author":"AG Murzin","year":"1995","unstructured":"Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 1995, 247(4):536\u2013540. 10.1006\/jmbi.1995.0159","journal-title":"J Mol Biol"},{"issue":"8","key":"2167_CR18","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","volume":"5","author":"CA Orengo","year":"1997","unstructured":"Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM: CATH--a hierarchic classification of protein domain structures. Structure 1997, 5(8):1093\u20131108. 10.1016\/S0969-2126(97)00260-8","journal-title":"Structure"},{"issue":"Database issue","key":"2167_CR19","doi-asserted-by":"publisher","first-page":"D138","DOI":"10.1093\/nar\/gkh121","volume":"32","author":"A Bateman","year":"2004","unstructured":"Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR: The Pfam protein families database. Nucleic Acids Res 2004, 32(Database issue):D138\u201341. 10.1093\/nar\/gkh121","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2167_CR20","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1093\/nar\/26.1.323","volume":"26","author":"F Corpet","year":"1998","unstructured":"Corpet F, Gouzy J, Kahn D: The ProDom database of protein domain families. Nucleic Acids Res 1998, 26(1):323\u2013326. 10.1093\/nar\/26.1.323","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2167_CR21","doi-asserted-by":"publisher","first-page":"371","DOI":"10.1093\/nar\/gkg128","volume":"31","author":"DH Haft","year":"2003","unstructured":"Haft DH, Selengut JD, White O: The TIGRFAMs database of protein families. Nucleic Acids Res 2003, 31(1):371\u2013373. 10.1093\/nar\/gkg128","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"2167_CR22","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215(3):403\u2013410.","journal-title":"J Mol Biol"},{"issue":"3","key":"2167_CR23","doi-asserted-by":"publisher","first-page":"e75","DOI":"10.1371\/journal.pbio.0050075","volume":"5","author":"R Seshadri","year":"2007","unstructured":"Seshadri R, Kravitz SA, Smarr L, Gilna P, Frazier M: CAMERA: A Community Resource for Metagenomics. PLoS Biol 2007, 5(3):e75. 10.1371\/journal.pbio.0050075","journal-title":"PLoS Biol"},{"issue":"Database issue","key":"2167_CR24","doi-asserted-by":"publisher","first-page":"D5","DOI":"10.1093\/nar\/gkl1031","volume":"35","author":"DL Wheeler","year":"2007","unstructured":"Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, Geer LY, Kapustin Y, Khovayko O, Landsman D, Lipman DJ, Madden TL, Maglott DR, Ostell J, Miller V, Pruitt KD, Schuler GD, Sequeira E, Sherry ST, Sirotkin K, Souvorov A, Starchenko G, Tatusov RL, Tatusova TA, Wagner L, Yaschenko E: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2007, 35(Database issue):D5\u201312. 10.1093\/nar\/gkl1031","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"2167_CR25","doi-asserted-by":"publisher","first-page":"D556","DOI":"10.1093\/nar\/gkj133","volume":"34","author":"E Birney","year":"2006","unstructured":"Birney E, Andrews D, Caccamo M, Chen Y, Clarke L, Coates G, Cox T, Cunningham F, Curwen V, Cutts T, Down T, Durbin R, Fernandez-Suarez XM, Flicek P, Graf S, Hammond M, Herrero J, Howe K, Iyer V, Jekosch K, Kahari A, Kasprzyk A, Keefe D, Kokocinski F, Kulesha E, London D, Longden I, Melsopp C, Meidl P, Overduin B, Parker A, Proctor G, Prlic A, Rae M, Rios D, Redmond S, Schuster M, Sealy I, Searle S, Severin J, Slater G, Smedley D, Smith J, Stabenau A, Stalker J, Trevanion S, Ureta-Vidal A, Vogel J, White S, Woodwark C, Hubbard TJ: Ensembl 2006. Nucleic Acids Res 2006, 34(Database issue):D556\u201361. 10.1093\/nar\/gkj133","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2167_CR26","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1093\/nar\/28.1.141","volume":"28","author":"J Quackenbush","year":"2000","unstructured":"Quackenbush J, Liang F, Holt I, Pertea G, Upton J: The TIGR gene indices: reconstruction and representation of expressed gene sequences. Nucleic Acids Res 2000, 28(1):141\u2013145. 10.1093\/nar\/28.1.141","journal-title":"Nucleic Acids Res"},{"issue":"17","key":"2167_CR27","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25(17):3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"2167_CR28","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1110\/ps.9.2.232","volume":"9","author":"L Rychlewski","year":"2000","unstructured":"Rychlewski L, Jaroszewski L, Li W, Godzik A: Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci 2000, 9(2):232\u2013241.","journal-title":"Protein Sci"},{"issue":"7","key":"2167_CR29","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1016\/S0168-9525(02)02668-9","volume":"18","author":"H Ochman","year":"2002","unstructured":"Ochman H: Distinguishing the ORFs from the ELFs: short bacterial genes and the annotation of genomes. Trends Genet 2002, 18(7):335\u2013337. 10.1016\/S0168-9525(02)02668-9","journal-title":"Trends Genet"},{"issue":"5","key":"2167_CR30","first-page":"555","volume":"13","author":"Z Yang","year":"1997","unstructured":"Yang Z: PAML: a program package for phylogenetic analysis by maximum likelihood. Computer Applications in BioSciences 1997, 13(5):555\u2013556.","journal-title":"Computer Applications in BioSciences"},{"issue":"3","key":"2167_CR31","doi-asserted-by":"publisher","first-page":"282","DOI":"10.1093\/bioinformatics\/17.3.282","volume":"17","author":"W Li","year":"2001","unstructured":"Li W, Jaroszewski L, Godzik A: Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics 2001, 17(3):282\u2013283. 10.1093\/bioinformatics\/17.3.282","journal-title":"Bioinformatics"},{"issue":"1","key":"2167_CR32","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1093\/bioinformatics\/18.1.77","volume":"18","author":"W Li","year":"2002","unstructured":"Li W, Jaroszewski L, Godzik A: Tolerating some redundancy significantly speeds up clustering of large protein databases. Bioinformatics 2002, 18(1):77\u201382. 10.1093\/bioinformatics\/18.1.77","journal-title":"Bioinformatics"},{"issue":"13","key":"2167_CR33","doi-asserted-by":"publisher","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","volume":"22","author":"W Li","year":"2006","unstructured":"Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22(13):1658\u20131659. 10.1093\/bioinformatics\/btl158","journal-title":"Bioinformatics"},{"issue":"1","key":"2167_CR34","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1101\/gr.200901","volume":"12","author":"A Nekrutenko","year":"2002","unstructured":"Nekrutenko A, Makova KD, Li WH: The K(A)\/K(S) ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study. Genome Res 2002, 12(1):198\u2013202. 10.1101\/gr.200901","journal-title":"Genome Res"},{"issue":"5","key":"2167_CR35","doi-asserted-by":"publisher","first-page":"1792","DOI":"10.1093\/nar\/gkh340","volume":"32","author":"RC Edgar","year":"2004","unstructured":"Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004, 32(5):1792\u20131797. 10.1093\/nar\/gkh340","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2167_CR36","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1093\/genetics\/155.1.431","volume":"155","author":"Z Yang","year":"2000","unstructured":"Yang Z, Nielsen R, Goldman N, Pedersen AM: Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 2000, 155(1):431\u2013449.","journal-title":"Genetics"},{"key":"2167_CR37","unstructured":"CAMERA[ http:\/\/camera.calit2.net\/ ]"},{"key":"2167_CR38","unstructured":"PANDA[ ftp:\/\/ftp.tigr.org\/pub\/software\/PANDA\/ ]"},{"issue":"12","key":"2167_CR39","doi-asserted-by":"publisher","first-page":"5463","DOI":"10.1073\/pnas.74.12.5463","volume":"74","author":"F Sanger","year":"1977","unstructured":"Sanger F, Nicklen S, Coulson AR: DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 1977, 74(12):5463\u20135467. 10.1073\/pnas.74.12.5463","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"5738","key":"2167_CR40","doi-asserted-by":"publisher","first-page":"1242","DOI":"10.1126\/science.1114057","volume":"309","author":"SJ Giovannoni","year":"2005","unstructured":"Giovannoni SJ, Tripp HJ, Givan S, Podar M, Vergin KL, Baptista D, Bibbs L, Eads J, Richardson TH, Noordewier M, Rappe MS, Short JM, Carrington JC, Mathur EJ: Genome streamlining in a cosmopolitan oceanic bacterium. Science 2005, 309(5738):1242\u20131245. 10.1126\/science.1114057","journal-title":"Science"},{"issue":"11","key":"2167_CR41","doi-asserted-by":"publisher","first-page":"654","DOI":"10.1046\/j.1462-2920.2002.00352.x","volume":"4","author":"M Sait","year":"2002","unstructured":"Sait M, Hugenholtz P, Janssen PH: Cultivation of globally distributed soil bacteria from phylogenetic lineages previously only detected in cultivation-independent surveys. Environ Microbiol 2002, 4(11):654\u2013666. 10.1046\/j.1462-2920.2002.00352.x","journal-title":"Environ Microbiol"},{"issue":"9","key":"2167_CR42","doi-asserted-by":"publisher","first-page":"759","DOI":"10.1093\/bioinformatics\/15.9.759","volume":"15","author":"D Fischer","year":"1999","unstructured":"Fischer D, Eisenberg D: Finding families for genomic ORFans. Bioinformatics 1999, 15(9):759\u2013762. 10.1093\/bioinformatics\/15.9.759","journal-title":"Bioinformatics"},{"key":"2167_CR43","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1002\/cfg.311","volume":"4","author":"N Siew","year":"2003","unstructured":"Siew N, Fischer D: Unravelling the ORFan puzzle. Comparative and Functional Genomics 2003, 4: 432\u2013441. 10.1002\/cfg.311","journal-title":"Comparative and Functional Genomics"},{"issue":"4","key":"2167_CR44","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1002\/prot.10347","volume":"51","author":"R Unger","year":"2003","unstructured":"Unger R, Uliel S, Havlin S: Scaling law in sizes of protein sequence families: from super-families to orphan genes. Proteins 2003, 51(4):569\u2013576. 10.1002\/prot.10347","journal-title":"Proteins"},{"key":"2167_CR45","unstructured":"Microbial Genome Sequencing Project - Gordon and Betty Moore Foundation[ http:\/\/www.moore.org\/microgenome\/ ]"},{"key":"2167_CR46","first-page":"87","volume-title":"Computational Tools For Next-Generation Sequencing Applications","author":"FMDL Vega","year":"2008","unstructured":"Vega FMDL, Marth GT, Sutton G: Computational Tools For Next-Generation Sequencing Applications. 2008, 13: 87\u201389."},{"issue":"7057","key":"2167_CR47","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","volume":"437","author":"M Margulies","year":"2005","unstructured":"Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature 2005, 437(7057):376\u2013380.","journal-title":"Nature"},{"issue":"19","key":"2167_CR48","doi-asserted-by":"publisher","first-page":"3911","DOI":"10.1093\/nar\/27.19.3911","volume":"27","author":"J Besemer","year":"1999","unstructured":"Besemer J, Borodovsky M: Heuristic approach to deriving models for gene finding. Nucleic Acids Res 1999, 27(19):3911\u20133920. 10.1093\/nar\/27.19.3911","journal-title":"Nucleic Acids Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-182.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1471-2105-9-182\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-182.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,7]],"date-time":"2021-09-07T18:58:04Z","timestamp":1631041084000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-182"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,4,10]]},"references-count":48,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2167"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-182","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,4,10]]},"article-number":"182"}}