{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T22:29:01Z","timestamp":1770589741325,"version":"3.49.0"},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Modern biomedical data collection is generating exponentially more data in a multitude of formats. This flood of complex data poses significant opportunities to discover and understand the critical interplay among such diverse domains as genomics, proteomics, metabolomics, and phenomics, including imaging, biometrics, and clinical data. The Big Data for Discovery Science Center is taking an \u201c-ome to home\u201d approach to discover linkages between these disparate data sources by mining existing databases of proteomic and genomic data, brain images, and clinical assessments. In support of this work, the authors developed new technological capabilities that make it easy for researchers to manage, aggregate, manipulate, integrate, and model large amounts of distributed data. Guided by biological domain expertise, the Center\u2019s computational resources and software will reveal relationships and patterns, aiding researchers in identifying biomarkers for the most confounding conditions and diseases, such as Parkinson\u2019s and Alzheimer\u2019s.<\/jats:p>","DOI":"10.1093\/jamia\/ocv077","type":"journal-article","created":{"date-parts":[[2015,7,22]],"date-time":"2015-07-22T02:00:48Z","timestamp":1437530448000},"page":"1126-1131","source":"Crossref","is-referenced-by-count":62,"title":["Big biomedical data as the key resource for discovery science"],"prefix":"10.1093","volume":"22","author":[{"given":"Arthur W","family":"Toga","sequence":"first","affiliation":[{"name":"Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, University of Southern California, Los Angeles, CA, USA"}]},{"given":"Ian","family":"Foster","sequence":"additional","affiliation":[{"name":"Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA"}]},{"given":"Carl","family":"Kesselman","sequence":"additional","affiliation":[{"name":"Information Sciences Institute, University of Southern California, Los Angeles, CA, USA"}]},{"given":"Ravi","family":"Madduri","sequence":"additional","affiliation":[{"name":"Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA"}]},{"given":"Kyle","family":"Chard","sequence":"additional","affiliation":[{"name":"Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA"}]},{"given":"Eric W","family":"Deutsch","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]},{"given":"Nathan D","family":"Price","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]},{"given":"Gustavo","family":"Glusman","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]},{"given":"Benjamin D","family":"Heavner","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]},{"given":"Ivo D","family":"Dinov","sequence":"additional","affiliation":[{"name":"Statistics Online Computational Resource (SOCR), UMSN, University of Michigan, Ann Arbor, MI, USA"}]},{"given":"Joseph","family":"Ames","sequence":"additional","affiliation":[{"name":"Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, University of Southern California, Los Angeles, CA, USA"}]},{"given":"John","family":"Van Horn","sequence":"additional","affiliation":[{"name":"Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, University of Southern California, Los Angeles, CA, USA"}]},{"given":"Roger","family":"Kramer","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]},{"given":"Leroy","family":"Hood","sequence":"additional","affiliation":[{"name":"Institute for Systems Biology, Seattle, WA, USA"}]}],"member":"286","published-online":{"date-parts":[[2015,7,21]]},"reference":[{"issue":"2","key":"2020110613002977700_ocv077-B1","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1007\/s11682-013-9255-y","article-title":"Human neuroimaging as a \u201cBig Data\u201d science","volume":"8","author":"Van Horn","year":"2014","journal-title":"Brain Imaging Behav"},{"key":"2020110613002977700_ocv077-B2","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1007\/978-3-642-22351-8_31","article-title":"Database-as-a-service for long-tail science","volume-title":"Proceedings of the 23rd International Conference on Scientific and Statistical Database Management","author":"Howe","year":"2011"},{"issue":"7317","key":"2020110613002977700_ocv077-B3","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1038\/467S6a","article-title":"Science brick by brick","volume":"467","author":"Smithies","year":"2010","journal-title":"Nature."},{"key":"2020110613002977700_ocv077-B4","doi-asserted-by":"crossref","DOI":"10.1109\/SSDM.2002.1029704","article-title":"Chimera: a virtual data system for representing, querying, and automating data derivation","volume-title":"14th International Conference on Scientific and Statistical Database Management","author":"Foster","year":"2002"},{"key":"2020110613002977700_ocv077-B5","first-page":"207","article-title":"Accelerating medical research using the swift workflow system","volume":"126","author":"Stef-Praun","year":"2007","journal-title":"Stud Health Technol Inform."},{"key":"2020110613002977700_ocv077-B6","article-title":"Digital asset management for heterogeneous biomedical data in an era of data-intensive science","author":"Schuler","year":"2014","journal-title":"Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on, IEEE, 2 Nov\u20135 Nov 2014, Belfast, United Kingdom"},{"key":"2020110613002977700_ocv077-B7","article-title":"The Alzheimer's Disease Neuroimaging Initiative Informatics Core: A Decade in Review","author":"Crawford","year":"2015","journal-title":"Alzheimer's & Dementia"},{"key":"2020110613002977700_ocv077-B8","first-page":"209","article-title":"Storage resource managers: Middleware components for grid storage","volume-title":"NASA Conference Publication","author":"Shoshani","year":"2002"},{"issue":"1","key":"2020110613002977700_ocv077-B9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2200\/S00233ED1V01Y200912ICR012","article-title":"iRODS Primer: integrated rule-oriented data system","volume":"2","author":"Rajasekar","year":"2010","journal-title":"Synthesis Lectures on Information Concepts, Retrieval, and Services."},{"key":"2020110613002977700_ocv077-B10","article-title":"MERRA analytic services: meeting the big data challenges of climate science through cloud-enabled climate analytics-as-a-service","author":"Schnase","year":"2014","journal-title":"Comput, Environ Urban Sys"},{"issue":"8","key":"2020110613002977700_ocv077-B11","first-page":"1","article-title":"Practical management of heterogeneous neuroimaging metadata by global neuroimaging data repositories","volume":"6","author":"Neu","year":"2012","journal-title":"Front Neuroinform."},{"key":"2020110613002977700_ocv077-B12","article-title":"Data sharing in Alzheimer's disease research","author":"Toga","year":"2015","journal-title":"Alzheimer's Disease and Associated Disorders"},{"key":"2020110613002977700_ocv077-B13","doi-asserted-by":"crossref","DOI":"10.1016\/j.jalz.2015.07.023","article-title":"The Global Alzheimer\u2019s Association Interactive Network","author":"Toga","year":"2015","journal-title":"Alzheimer's & Dementia"},{"key":"2020110613002977700_ocv077-B14","article-title":"The FaceBase Hub: a resource for translational craniofacial genetics","volume-title":"Am J Med Genet Part A","author":"Marazita","year":"2014"},{"issue":"4","key":"2020110613002977700_ocv077-B15","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1097\/WCO.0b013e32832d92de","article-title":"Multisite neuroimaging trials","volume":"22","author":"Van Horn","year":"2009","journal-title":"Curr Opin Neurol."},{"issue":"3","key":"2020110613002977700_ocv077-B16","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1109\/MIC.2011.64","article-title":"Globus online: accelerating and democratizing science through cloud-based services","volume":"15","author":"Foster","year":"2011","journal-title":"IEEE Internet Computing"},{"key":"2020110613002977700_ocv077-B17","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-08590-6_1","article-title":"An asset management approach to continuous integration of heterogeneous biomedical data","volume-title":"Data Integration in the Life Sciences","author":"Schuler","year":"2014"},{"issue":"6","key":"2020110613002977700_ocv077-B18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v044.i06","article-title":"Working with the DICOM and NIfTI Data Standards in R","volume":"44","author":"Whitcher","year":"2011","journal-title":"J Stat Softw."},{"issue":"4","key":"2020110613002977700_ocv077-B19","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1093\/bioinformatics\/btr703","article-title":"Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data","volume":"28","author":"Carver","year":"2012","journal-title":"Bioinformatics."},{"issue":"6","key":"2020110613002977700_ocv077-B20","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1002\/pmic.200900375","article-title":"A guided tour of the trans-proteomic pipeline","volume":"10","author":"Deutsch","year":"2010","journal-title":"Proteomics."},{"key":"2020110613002977700_ocv077-B21","doi-asserted-by":"crossref","DOI":"10.1038\/msb4100024","article-title":"A uniform proteomics MS\/MS analysis platform utilizing open XML file formats","volume-title":"Mol Syst Biol.","author":"Keller","year":"2005"},{"issue":"2","key":"2020110613002977700_ocv077-B22","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1074\/mcp.O114.043380","article-title":"Processing shotgun proteomics data on the Amazon Cloud with the Trans-Proteomic Pipeline","volume":"14","author":"Slagel","year":"2014","journal-title":"Mol Cell Proteomics"},{"issue":"9","key":"2020110613002977700_ocv077-B23","doi-asserted-by":"crossref","first-page":"e13070","DOI":"10.1371\/journal.pone.0013070","article-title":"Neuroimaging study designs, computational analyses and data provenance using the LONI Pipeline","volume":"5","author":"Dinov","year":"2010","journal-title":"PLoS ONE."},{"key":"2020110613002977700_ocv077-B24","doi-asserted-by":"crossref","first-page":"5383","DOI":"10.1021\/ac025747h","article-title":"Empirical statistical model to estimate the accuracy of peptide identifications made by MS\/MS and database search","volume":"74","author":"Keller","year":"2002","journal-title":"Anal Chem."},{"issue":"17","key":"2020110613002977700_ocv077-B25","doi-asserted-by":"crossref","first-page":"4646","DOI":"10.1021\/ac0341261","article-title":"A statistical model for identifying proteins by tandem mass spectrometry","volume":"75","author":"Nesvizhskii","year":"2003","journal-title":"Analytical Chem."},{"key":"2020110613002977700_ocv077-B26","first-page":"45","article-title":"Identification of copy number variants in whole-genome data using Reference Coverage Profiles","volume":"6","author":"Glusman","year":"2015","journal-title":"FrontGenet."},{"issue":"22","key":"2020110613002977700_ocv077-B27","doi-asserted-by":"crossref","first-page":"3216","DOI":"10.1093\/bioinformatics\/btr540","article-title":"\u201cKaviar: an accessible system for testing SNV novelty","volume":"27","author":"Glusman","year":"2011","journal-title":"Bioinformatics."},{"issue":"5","key":"2020110613002977700_ocv077-B28","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1093\/bioinformatics\/16.5.482","article-title":"GESTALT: a workbench for automatic integration and visualization of large-scale genomic sequence analyses","volume":"16","author":"Glusman","year":"2000","journal-title":"Bioinformatics."},{"issue":"5978","key":"2020110613002977700_ocv077-B29","doi-asserted-by":"crossref","first-page":"636","DOI":"10.1126\/science.1186802","article-title":"Analysis of Genetic Inheritance in a Family Quartet by Whole Genome Sequencing","volume":"328","author":"Roach","year":"2010","journal-title":"Science."}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/6\/1126\/34145802\/ocv077.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/6\/1126\/34145802\/ocv077.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T18:43:55Z","timestamp":1604688235000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/22\/6\/1126\/2357756"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,7,21]]},"references-count":29,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2015,7,21]]},"published-print":{"date-parts":[[2015,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocv077","relation":{},"ISSN":["1527-974X","1067-5027"],"issn-type":[{"value":"1527-974X","type":"electronic"},{"value":"1067-5027","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,11]]},"published":{"date-parts":[[2015,7,21]]}}}