{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T13:59:58Z","timestamp":1780495198326,"version":"3.54.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2021,10,6]],"date-time":"2021-10-06T00:00:00Z","timestamp":1633478400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"National Center for Advancing Translational Sciences of the National Institutes of Health","award":["TR001857"],"award-info":[{"award-number":["TR001857"]}]},{"name":"National Center for Advancing Translational Sciences of the National Institutes of Health","award":["UL1 TR001857-01S1"],"award-info":[{"award-number":["UL1 TR001857-01S1"]}]},{"name":"National Center for Advancing Translational Sciences of the National Institutes of Health","award":["U01 TR002623"],"award-info":[{"award-number":["U01 TR002623"]}]},{"name":"Office of the Director of the NIH","award":["OT2 OD026554"],"award-info":[{"award-number":["OT2 OD026554"]}]},{"name":"National Library of Medicine of the NIH","award":["R01 LM012095"],"award-info":[{"award-number":["R01 LM012095"]}]},{"DOI":"10.13039\/100006093","name":"PCORI","doi-asserted-by":"publisher","award":["RI-CRN-2020-006"],"award-info":[{"award-number":["RI-CRN-2020-006"]}],"id":[{"id":"10.13039\/100006093","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Objective<\/jats:title>\n                    <jats:p>As a long-standing Clinical and Translational Science Awards (CTSA) Program hub, the University of Pittsburgh and the University of Pittsburgh Medical Center (UPMC) developed and implemented a modern research data warehouse (RDW) to efficiently provision electronic patient data for clinical and translational research.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Materials and Methods<\/jats:title>\n                    <jats:p>We designed and implemented an RDW named Neptune to serve the specific needs of our CTSA. Neptune uses an atomic design where data are stored at a high level of granularity as represented in source systems. Neptune contains robust patient identity management tailored for research; integrates patient data from multiple sources, including electronic health records (EHRs), health plans, and research studies; and includes knowledge for mapping to standard terminologies.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Neptune contains data for more than 5 million patients longitudinally organized as Health Insurance Portability and Accountability Act (HIPAA) Limited Data with dates and includes structured EHR data, clinical documents, health insurance claims, and research data. Neptune is used as a source for patient data for hundreds of institutional review board-approved research projects by local investigators and for national projects.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Discussion<\/jats:title>\n                    <jats:p>The design of Neptune was heavily influenced by the large size of UPMC, the varied data sources, and the rich partnership between the University and the healthcare system. It includes several unique aspects, including the physical warehouse straddling the University and UPMC networks and management under an HIPAA Business Associates Agreement.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>We describe the design and implementation of an RDW at a large academic healthcare system that uses a distinctive atomic design where data are stored at a high level of granularity.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/jamia\/ocab204","type":"journal-article","created":{"date-parts":[[2021,9,10]],"date-time":"2021-09-10T15:09:34Z","timestamp":1631286574000},"page":"601-608","source":"Crossref","is-referenced-by-count":36,"title":["An atomic approach to the design and implementation of a research data warehouse"],"prefix":"10.1093","volume":"29","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2079-8684","authenticated-orcid":false,"given":"Shyam","family":"Visweswaran","sequence":"first","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"},{"name":"Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Brian","family":"McLay","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nickie","family":"Cappella","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Michele","family":"Morris","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John T","family":"Milnes","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Steven E","family":"Reis","sequence":"additional","affiliation":[{"name":"Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jonathan C","family":"Silverstein","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"},{"name":"Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"},{"name":"Chief Research Information Officer, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5998-8074","authenticated-orcid":false,"given":"Michael J","family":"Becich","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"},{"name":"Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2021,10,6]]},"reference":[{"issue":"5","key":"2022031509050109300_ocab204-B1","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1056\/NEJMp0912825","article-title":"Launching HITECH","volume":"362","author":"Blumenthal","year":"2010","journal-title":"N Engl J Med"},{"key":"2022031509050109300_ocab204-B2","first-page":"189","author":"Evans"},{"key":"2022031509050109300_ocab204-B3"},{"issue":"4","key":"2022031509050109300_ocab204-B4","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1017\/cts.2017.301","article-title":"A survey of practices for the use of electronic health records to support research recruitment","volume":"1","author":"Obeid","year":"2017","journal-title":"J Clin Transl Sci"},{"issue":"27","key":"2022031509050109300_ocab204-B5","doi-asserted-by":"crossref","first-page":"7329","DOI":"10.1073\/pnas.1510502113","article-title":"Characterizing treatment pathways at scale using the OHDSI network","volume":"113","author":"Hripcsak","year":"2016","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"7","key":"2022031509050109300_ocab204-B6","doi-asserted-by":"crossref","first-page":"1229","DOI":"10.1377\/hlthaff.2014.0099","article-title":"A \u2018green button\u2019 for using aggregate patient data at the point of care","volume":"33","author":"Longhurst","year":"2014","journal-title":"Health Aff (Millwood)"},{"issue":"2","key":"2022031509050109300_ocab204-B7","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1038\/clpt.2012.54","article-title":"Detection of pharmacovigilance-related adverse events using electronic health records and automated methods","volume":"92","author":"Haerian","year":"2012","journal-title":"Clin Pharmacol Ther"},{"issue":"2","key":"2022031509050109300_ocab204-B8","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1093\/jamiaopen\/ooy033","article-title":"Accrual to clinical trials (ACT): a clinical and translational science award consortium network","volume":"1","author":"Visweswaran","year":"2018","journal-title":"JAMIA Open"},{"key":"2022031509050109300_ocab204-B9","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1146\/annurev-genom-090314-024956","article-title":"Phenome-wide association studies as a tool to advance precision medicine","volume":"17","author":"Denny","year":"2016","journal-title":"Annu Rev Genomics Hum Genet"},{"key":"2022031509050109300_ocab204-B10","first-page":"101","author":"Prather"},{"key":"2022031509050109300_ocab204-B11","first-page":"206","article-title":"Automated physician order recommendations and outcome predictions by data-mining electronic medical records","volume":"2014","author":"Chen","year":"2014","journal-title":"AMIA Jt Summits Transl Sci Proc"},{"key":"2022031509050109300_ocab204-B12","doi-asserted-by":"crossref","first-page":"103327","DOI":"10.1016\/j.jbi.2019.103327","article-title":"Using machine learning to selectively highlight patient information","volume":"100","author":"King","year":"2019","journal-title":"J Biomed Inform"},{"key":"2022031509050109300_ocab204-B13","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/j.jbi.2016.10.002","article-title":"Outlier-based detection of unusual patient-management actions: an ICU study","volume":"64","author":"Hauskrecht","year":"2016","journal-title":"J Biomed Inform"},{"issue":"3","key":"2022031509050109300_ocab204-B14","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1002\/wsbm.1378","article-title":"Integrated precision medicine: the role of electronic health records in delivering personalized treatment","volume":"9","author":"Sitapati","year":"2017","journal-title":"Wiley Interdiscip Rev Syst Biol Med"},{"issue":"2","key":"2022031509050109300_ocab204-B15","doi-asserted-by":"crossref","first-page":"109","DOI":"10.4258\/hir.2014.20.2.109","article-title":"Characteristics desired in clinical data warehouse for biomedical research","volume":"20","author":"Shin","year":"2014","journal-title":"Healthc Inform Res"},{"key":"2022031509050109300_ocab204-B16","year":"2020"},{"issue":"4","key":"2022031509050109300_ocab204-B17","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1111\/cts.12294","article-title":"Enabling a learning health system through a unified enterprise data warehouse: the experience of the Northwestern University Clinical and Translational Sciences (NUCATS) Institute","volume":"8","author":"Starren","year":"2015","journal-title":"Clin Transl Sci"},{"key":"2022031509050109300_ocab204-B18","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.jbi.2014.07.006","article-title":"Modular design, application architecture, and usage of a self-service model for enterprise data delivery: the Duke Enterprise Data Unified Content Explorer (DEDUCE)","volume":"52","author":"Horvath","year":"2014","journal-title":"J Biomed Inform"},{"key":"2022031509050109300_ocab204-B19","first-page":"391","article-title":"STRIDE\u2014an integrated standards-based translational research informatics platform","volume":"2009","author":"Lowe","year":"2009","journal-title":"AMIA Annu Symp Proc"},{"key":"2022031509050109300_ocab204-B20","author":"Datta","year":"2020"},{"key":"2022031509050109300_ocab204-B21","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.jbi.2014.02.003","article-title":"Secondary use of clinical data: the Vanderbilt approach","volume":"52","author":"Danciu","year":"2014","journal-title":"J Biomed Inform"},{"issue":"2","key":"2022031509050109300_ocab204-B22","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1136\/jamia.2009.000893","article-title":"Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2)","volume":"17","author":"Murphy","year":"2010","journal-title":"J Am Med Inform Assoc"},{"issue":"574","key":"2022031509050109300_ocab204-B23","first-page":"574","article-title":"Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers","volume":"216","author":"Hripcsak","year":"2015","journal-title":"Stud Health Technol Inform"},{"issue":"4","key":"2022031509050109300_ocab204-B24","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1136\/amiajnl-2014-002747","article-title":"Launching PCORnet, a national patient-centered clinical research network","volume":"21","author":"Fleurence","year":"2014","journal-title":"J Am Med Inform Assoc"},{"key":"2022031509050109300_ocab204-B25","first-page":"80","author":"Haertzen","year":"2012"},{"issue":"4","key":"2022031509050109300_ocab204-B26","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1016\/0306-4573(91)90091-Y","article-title":"The Medical Archival System: an information retrieval system based on distributed parallel processing","volume":"27","author":"Yount","year":"1991","journal-title":"Inf Process Manag"},{"key":"2022031509050109300_ocab204-B27","year":"2020"},{"key":"2022031509050109300_ocab204-B28","year":"2020"},{"key":"2022031509050109300_ocab204-B29","volume-title":"An Easy-to-Use Clinical Text De-Identification Tool for Clinical Scientists: NLM Scrubber","author":"Kayaalp"},{"issue":"Database issue","key":"2022031509050109300_ocab204-B30","doi-asserted-by":"crossref","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The Unified Medical Language System (UMLS): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2022031509050109300_ocab204-B31","first-page":"1224","article-title":"The NLM value set authority center","volume":"192","author":"Bodenreider","year":"2013","journal-title":"Stud Health Technol Inform"},{"issue":"2","key":"2022031509050109300_ocab204-B32","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1097\/EDE.0000000000000788","article-title":"Machine learning for fetal growth prediction","volume":"29","author":"Naimi","year":"2018","journal-title":"Epidemiology"},{"key":"2022031509050109300_ocab204-B33","year":"2020"},{"key":"2022031509050109300_ocab204-B34","year":"2020"},{"issue":"5","key":"2022031509050109300_ocab204-B35","doi-asserted-by":"crossref","first-page":"624","DOI":"10.1197\/jamia.M3191","article-title":"The Shared Health Research Information Network (SHRINE): a prototype federated query tool for clinical data repositories","volume":"16","author":"Weber","year":"2009","journal-title":"J Am Med Inform Assoc"},{"issue":"7","key":"2022031509050109300_ocab204-B36","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1056\/NEJMsr1809937","article-title":"The \u201cAll of Us\u201d research program","volume":"381","year":"2019","journal-title":"N Engl J Med"},{"key":"2022031509050109300_ocab204-B37","year":"2020"},{"issue":"2","key":"2022031509050109300_ocab204-B38","first-page":"1213","article-title":"Software-enabled distributed network governance: the PopMedNet experience","volume":"4","author":"Davies","year":"2016","journal-title":"EGEMS (Wash DC)"},{"key":"2022031509050109300_ocab204-B39","year":"2020"},{"issue":"2","key":"2022031509050109300_ocab204-B40","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/s41436-019-0646-3","article-title":"The Genomics Research and Innovation Network: creating an interoperable, federated, genomics learning system","volume":"22","author":"Mandl","year":"2020","journal-title":"Genet Med"},{"key":"2022031509050109300_ocab204-B41","first-page":"96","article-title":"tranSMART: an open source knowledge management and high content data analytics platform","volume":"2014","author":"Scheufele","year":"2014","journal-title":"AMIA Jt Summits Transl Sci Proc"},{"key":"2022031509050109300_ocab204-B42","first-page":"648","author":"Huser"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/29\/4\/601\/42897572\/ocab204.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/29\/4\/601\/42897572\/ocab204.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,15]],"date-time":"2022-03-15T05:07:54Z","timestamp":1647320874000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/29\/4\/601\/6382237"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,6]]},"references-count":42,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,10,6]]},"published-print":{"date-parts":[[2022,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocab204","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.05.05.21256679","asserted-by":"object"}]},"ISSN":["1527-974X"],"issn-type":[{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,4,1]]},"published":{"date-parts":[[2021,10,6]]}}}