{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,2,2]],"date-time":"2024-02-02T00:17:04Z","timestamp":1706833024388},"reference-count":17,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,9,15]],"date-time":"2016-09-15T00:00:00Z","timestamp":1473897600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,9,15]],"date-time":"2016-09-15T00:00:00Z","timestamp":1473897600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"MITRE Innovation Program"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Pathogen metadata includes information about where and when a pathogen was collected and the type of environment it came from. Along with genomic nucleotide sequence data, this metadata is growing rapidly and becoming a valuable resource not only for research but for biosurveillance and public health. However, current freely available tools for analyzing this data are geared towards bioinformaticians and\/or do not provide summaries and visualizations needed to readily interpret results.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We designed a platform to easily access and summarize data about pathogen samples. The software includes a PostgreSQL database that captures metadata useful for disease outbreak investigations, and scripts for downloading and parsing data from NCBI BioSample and BioProject into the database. The software provides a user interface to query metadata and obtain standardized results in an exportable, tab-delimited format. To visually summarize results, the user interface provides a 2D histogram for user-selected metadata types and mapping of geolocated entries. The software is built on the LabKey data platform, an open-source data management platform, which enables developers to add functionalities. We demonstrate the use of the software in querying for a pathogen serovar and for genome sequence identifiers.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>This software enables users to create a local database for pathogen metadata, populate it with data from NCBI, easily query the data, and obtain visual summaries. Some of the components, such as the database, are modular and can be incorporated into other data platforms. The source code is freely available for download at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/wchangmitre\/bioattribution\">https:\/\/github.com\/wchangmitre\/bioattribution<\/jats:ext-link>.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-1231-2","type":"journal-article","created":{"date-parts":[[2016,9,15]],"date-time":"2016-09-15T11:48:27Z","timestamp":1473940107000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Pathogen metadata platform: software for accessing and analyzing pathogen strain information"],"prefix":"10.1186","volume":"17","author":[{"given":"Wenling E.","family":"Chang","sequence":"first","affiliation":[]},{"given":"Matthew W.","family":"Peterson","sequence":"additional","affiliation":[]},{"given":"Christopher D.","family":"Garay","sequence":"additional","affiliation":[]},{"given":"Tonia","family":"Korves","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,9,15]]},"reference":[{"key":"1231_CR1","doi-asserted-by":"publisher","first-page":"415","DOI":"10.1038\/nbt.1823","volume":"29","author":"P Yilmaz","year":"2011","unstructured":"Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler L, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotech. 2011;29:415\u201320.","journal-title":"Nat Biotech"},{"key":"1231_CR2","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1038\/nbt1360","volume":"26","author":"D Field","year":"2008","unstructured":"Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541\u20137.","journal-title":"Nat Biotechnol"},{"key":"1231_CR3","doi-asserted-by":"publisher","first-page":"e99979","DOI":"10.1371\/journal.pone.0099979","volume":"9","author":"VG Dugan","year":"2014","unstructured":"Dugan VG, Emrich SJ, Giraldo-Calder\u00f3n GI, Harb OS, Newman RM, Pickett BE, et al. Standardized Metadata for Human Pathogen\/Vector Genomic Sequences. PLoS One. 2014;9:e99979.","journal-title":"PLoS One"},{"key":"1231_CR4","doi-asserted-by":"publisher","first-page":"D57","DOI":"10.1093\/nar\/gkr1163","volume":"40","author":"T Barrett","year":"2012","unstructured":"Barrett T, Clark K, Gevorgyan R, Gorelenkov V, Gribov E, Karsch-Mizrachi I, et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 2012;40:D57\u201363.","journal-title":"Nucleic Acids Res"},{"key":"1231_CR5","doi-asserted-by":"publisher","first-page":"D64","DOI":"10.1093\/nar\/gkr937","volume":"40","author":"M Gostev","year":"2012","unstructured":"Gostev M, Faulconbridge A, Brandizi M, Fernandez-Banet J, Sarkans U, Brazma A, et al. The BioSample Database (BioSD) at the European Bioinformatics Institute. Nucleic Acids Res. 2012;40:D64\u201370.","journal-title":"Nucleic Acids Res"},{"key":"1231_CR6","doi-asserted-by":"publisher","first-page":"D593","DOI":"10.1093\/nar\/gkr859","volume":"40","author":"BE Pickett","year":"2012","unstructured":"Pickett BE, Sadat EL, Zhang Y, Noronha JM, Squires RB, Hunt V, et al. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012;40:D593\u20138.","journal-title":"Nucleic Acids Res"},{"key":"1231_CR7","doi-asserted-by":"publisher","first-page":"D581","DOI":"10.1093\/nar\/gkt1099","volume":"42","author":"AR Wattam","year":"2014","unstructured":"Wattam AR, Abraham D, Dalay O, Disz TL, Driscoll T, Gabbard JL, et al. PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 2014;42:D581\u201391.","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"1231_CR8","doi-asserted-by":"publisher","first-page":"D1099","DOI":"10.1093\/nar\/gku950","volume":"43","author":"TBK Reddy","year":"2015","unstructured":"Reddy TBK, Thomas AD, Stamatis D, Bertsch J, Isbandi M, Jansson J, et al. The Genomes OnLine Database (GOLD) v. 5: a metadata management system based on a four level (meta) genome project classification. Nucleic Acids Res. 2015;43(Database issue):D1099\u20131106.","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"1231_CR9","doi-asserted-by":"publisher","first-page":"D6","DOI":"10.1093\/nar\/gku1130","volume":"43","author":"NCBI Resource Coordinators","year":"2015","unstructured":"NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2015;43(Database issue):D6\u2013D17.","journal-title":"Nucleic Acids Res"},{"key":"1231_CR10","doi-asserted-by":"publisher","first-page":"1422","DOI":"10.1093\/bioinformatics\/btp163","volume":"25","author":"PJA Cock","year":"2009","unstructured":"Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinforma Oxf Engl. 2009;25:1422\u20133.","journal-title":"Bioinforma Oxf Engl"},{"key":"1231_CR11","doi-asserted-by":"publisher","first-page":"2693","DOI":"10.1093\/bioinformatics\/bts494","volume":"28","author":"A Prli\u0107","year":"2012","unstructured":"Prli\u0107 A, Yates A, Bliven SE, Rose PW, Jacobsen J, Troshin PV, et al. BioJava: an open-source framework for bioinformatics in 2012. Bioinforma Oxf Engl. 2012;28:2693\u20135.","journal-title":"Bioinforma Oxf Engl"},{"key":"1231_CR12","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1186\/1471-2105-14-19","volume":"14","author":"Y Zhu","year":"2013","unstructured":"Zhu Y, Stephens RM, Meltzer PS, Davis SR. SRAdb: query and use public next-generation sequencing data from within R. BMC Bioinformatics. 2013;14:19.","journal-title":"BMC Bioinformatics"},{"key":"1231_CR13","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/1471-2105-12-71","volume":"12","author":"EK Nelson","year":"2011","unstructured":"Nelson EK, Piehler B, Eckels J, Rauch A, Bellew M, Hussey P, et al. LabKey Server: an open source platform for scientific data integration, analysis and collaboration. BMC Bioinformatics. 2011;12:71.","journal-title":"BMC Bioinformatics"},{"key":"1231_CR14","doi-asserted-by":"publisher","first-page":"2301","DOI":"10.1109\/TVCG.2011.185","volume":"17","author":"M Bostock","year":"2011","unstructured":"Bostock M, Ogievetsky V, Heer J. D3 Data-Driven Documents. IEEE Trans Vis Comput Graph. 2011;17:2301\u20139.","journal-title":"IEEE Trans Vis Comput Graph"},{"key":"1231_CR15","doi-asserted-by":"publisher","first-page":"1312","DOI":"10.1093\/bioinformatics\/btu033","volume":"30","author":"A Stamatakis","year":"2014","unstructured":"Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312\u20133.","journal-title":"Bioinformatics"},{"key":"1231_CR16","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1111\/j.1096-0031.2010.00314.x","volume":"27","author":"DA Janies","year":"2011","unstructured":"Janies DA, Treseder T, Alexandrov B, Habib F, Chen JJ, Ferreira R, et al. The Supramap project: linking pathogen genomes with geography to fight emergent infectious diseases. Cladistics. 2011;27:61\u20136.","journal-title":"Cladistics"},{"key":"1231_CR17","doi-asserted-by":"publisher","first-page":"e92877","DOI":"10.1371\/journal.pone.0092877","volume":"9","author":"DP Sargeant","year":"2014","unstructured":"Sargeant DP, Hedden MW, Deverasetty S, Strong CL, Alaniz IJ, Bartlett AN, et al. The Geogenomic Mutational Atlas of Pathogens (GoMAP) web system. PloS One. 2014;9:e92877.","journal-title":"PloS One"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1231-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1231-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1231-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:04:08Z","timestamp":1706810648000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1231-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,9,15]]},"references-count":17,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1231"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1231-2","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,9,15]]},"assertion":[{"value":"16 September 2015","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 August 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 September 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"379"}}