{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T23:42:44Z","timestamp":1776814964934,"version":"3.51.2"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2021,11,28]],"date-time":"2021-11-28T00:00:00Z","timestamp":1638057600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Mass General Brigham institutional"},{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["R01-HG009174"],"award-info":[{"award-number":["R01-HG009174"]}],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["U01HG008685-05S1"],"award-info":[{"award-number":["U01HG008685-05S1"]}],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000050","name":"National Heart, Lung, and Blood Institute","doi-asserted-by":"publisher","award":["1OT2HL161841-01"],"award-info":[{"award-number":["1OT2HL161841-01"]}],"id":[{"id":"10.13039\/100000050","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Objective<\/jats:title><jats:p>Integrating and harmonizing disparate patient data sources into one consolidated data portal enables researchers to conduct analysis efficiently and effectively.<\/jats:p><\/jats:sec><jats:sec><jats:title>Materials and Methods<\/jats:title><jats:p>We describe an implementation of Informatics for Integrating Biology and the Bedside (i2b2) to create the Mass General Brigham (MGB) Biobank Portal data repository. The repository integrates data from primary and curated data sources and is updated weekly. The data are made readily available to investigators in a data portal where they can easily construct and export customized datasets for analysis.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>As of July 2021, there are 125\u00a0645 consented patients enrolled in the MGB Biobank. 88\u00a0527 (70.5%) have a biospecimen, 55\u00a0121 (43.9%) have completed the health information survey, 43\u00a0552 (34.7%) have genomic data and 124\u00a0760 (99.3%) have EHR data. Twenty machine learning computed phenotypes are calculated on a weekly basis. There are currently 1220 active investigators who have run 58\u00a0793 patient queries and exported 10\u00a0257 analysis files.<\/jats:p><\/jats:sec><jats:sec><jats:title>Discussion<\/jats:title><jats:p>The Biobank Portal allows noninformatics researchers to conduct study feasibility by querying across many data sources and then extract data that are most useful to them for clinical studies. While institutions require substantial informatics resources to establish and maintain integrated data repositories, they yield significant research value to a wide range of investigators.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>The Biobank Portal and other patient data portals that integrate complex and simple datasets enable diverse research use cases. i2b2 tools to implement these registries and make the data interoperable are open source and freely available.<\/jats:p><\/jats:sec>","DOI":"10.1093\/jamia\/ocab264","type":"journal-article","created":{"date-parts":[[2021,11,16]],"date-time":"2021-11-16T20:12:19Z","timestamp":1637093539000},"page":"643-651","source":"Crossref","is-referenced-by-count":57,"title":["The Mass General Brigham Biobank Portal: an i2b2-based data repository linking disparate and high-dimensional patient data to support multimodal analytics"],"prefix":"10.1093","volume":"29","author":[{"given":"Victor M","family":"Castro","sequence":"first","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Vivian","family":"Gainer","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Nich","family":"Wattanasin","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Barbara","family":"Benoit","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Andrew","family":"Cagan","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Bhaswati","family":"Ghosh","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Sergey","family":"Goryachev","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Reeta","family":"Metta","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9481-3288","authenticated-orcid":false,"given":"Heekyong","family":"Park","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"David","family":"Wang","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Michael","family":"Mendis","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Martin","family":"Rees","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Christopher","family":"Herrick","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"}]},{"given":"Shawn N","family":"Murphy","sequence":"additional","affiliation":[{"name":"Research Information Science and Computing, Mass General Brigham, Somerville, Massachusetts, USA"},{"name":"Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,11,28]]},"reference":[{"issue":"2","key":"2022031509043729300_ocab264-B1","doi-asserted-by":"crossref","first-page":"199","DOI":"10.11613\/BM.2014.022","article-title":"Observational and interventional study design types; an overview","volume":"24","author":"Thiese","year":"2014","journal-title":"Biochem Med (Zagreb)"},{"key":"2022031509043729300_ocab264-B2","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1016\/j.jclinepi.2015.09.016","article-title":"Million Veteran Program: a mega-biobank to study genetic influences on health and disease","volume":"70","author":"Gaziano","year":"2016","journal-title":"J Clin Epidemiol"},{"key":"2022031509043729300_ocab264-B3","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1056\/NEJMsr1809937","article-title":"The \u201cAll of Us\u201d Research Program","volume":"381","year":"2019","journal-title":"N Engl J Med"},{"issue":"7726","key":"2022031509043729300_ocab264-B4","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/s41586-018-0579-z","article-title":"The UK Biobank resource with deep phenotyping and genomic data","volume":"562","author":"Bycroft","year":"2018","journal-title":"Nature"},{"key":"2022031509043729300_ocab264-B5","author":"Oelsner"},{"issue":"10","key":"2022031509043729300_ocab264-B6","doi-asserted-by":"crossref","first-page":"1313","DOI":"10.1161\/01.CIR.0000157730.94423.4B","article-title":"Ethnic differences in coronary calcification: the Multi-Ethnic Study of Atherosclerosis (MESA)","volume":"111","author":"Bild","year":"2005","journal-title":"Circulation"},{"issue":"1","key":"2022031509043729300_ocab264-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/14397595.2019.1660028","article-title":"A large observational cohort study of rheumatoid arthritis, IORRA: providing context for today\u2019s treatment options","volume":"30","author":"Yamanaka","year":"2020","journal-title":"Mod Rheumatol"},{"issue":"2","key":"2022031509043729300_ocab264-B8","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1136\/jamia.2009.000893","article-title":"Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2)","volume":"17","author":"Murphy","year":"2010","journal-title":"J Am Med Inform Assoc"},{"key":"2022031509043729300_ocab264-B9","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci Data"},{"issue":"1","key":"2022031509043729300_ocab264-B10","doi-asserted-by":"crossref","first-page":"2","DOI":"10.3390\/jpm6010002","article-title":"Building the partners healthcare biobank at partners personalized medicine: informed consent, return of research results, recruitment lessons and operational considerations","volume":"6","author":"Karlson","year":"2016","journal-title":"J Pers Med"},{"issue":"2","key":"2022031509043729300_ocab264-B11","doi-asserted-by":"crossref","first-page":"17","DOI":"10.3390\/jpm6020017","article-title":"Implementation of electronic consent at a Biobank: an opportunity for precision medicine research","volume":"6","author":"Boutin","year":"2016","journal-title":"J Pers Med"},{"key":"2022031509043729300_ocab264-B12","year":"2021"},{"key":"2022031509043729300_ocab264-B13","volume-title":"The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling","author":"Kimball","year":"2011"},{"issue":"1","key":"2022031509043729300_ocab264-B14","doi-asserted-by":"crossref","first-page":"6","DOI":"10.3390\/jpm6010006","article-title":"The information technology infrastructure for the translational genomics core and the Partners Biobank at Partners Personalized Medicine","volume":"6","author":"Boutin","year":"2016","journal-title":"J Pers Med"},{"key":"2022031509043729300_ocab264-B15","first-page":"574","article-title":"Observational health data sciences and informatics (OHDSI): opportunities for observational researchers","volume":"216","author":"Hripcsak","year":"2015","journal-title":"Stud Health Technol Inform"},{"issue":"2","key":"2022031509043729300_ocab264-B16","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1016\/j.jbi.2008.08.010","article-title":"Research electronic data capture (REDCap)\u2014a metadata-driven methodology and workflow process for providing translational research informatics support","volume":"42","author":"Harris","year":"2009","journal-title":"J Biomed Inform"},{"key":"2022031509043729300_ocab264-B17","year":"2021"},{"issue":"1","key":"2022031509043729300_ocab264-B18","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1136\/amiajnl-2012-001145","article-title":"Next-generation phenotyping of electronic health records","volume":"20","author":"Hripcsak","year":"2013","journal-title":"J Am Med Inform Assoc"},{"issue":"12","key":"2022031509043729300_ocab264-B19","doi-asserted-by":"crossref","first-page":"3426","DOI":"10.1038\/s41596-019-0227-6","article-title":"High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP)","volume":"14","author":"Zhang","year":"2019","journal-title":"Nat Protoc"},{"key":"2022031509043729300_ocab264-B20","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1007\/978-3-030-01201-4_22","volume-title":"Or 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis","author":"Bridge","year":"2018"},{"issue":"2","key":"2022031509043729300_ocab264-B21","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1148\/radiol.2020201640","article-title":"Population-scale CT-based body composition analysis of a large outpatient population using deep learning to derive age-, sex-, and race-specific reference curves","volume":"298","author":"Magudia","year":"2021","journal-title":"Radiology"},{"key":"2022031509043729300_ocab264-B22","year":"2021"},{"key":"2022031509043729300_ocab264-B23","year":"2021"},{"issue":"3","key":"2022031509043729300_ocab264-B24","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1136\/jamia.1998.0050276","article-title":"Development of the Logical Observation Identifier Names and Codes (LOINC) vocabulary","volume":"5","author":"Huff","year":"1998","journal-title":"J Am Med Inform Assoc"},{"key":"2022031509043729300_ocab264-B25","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/MITP.2005.122","article-title":"RxNorm: prescription for electronic drug information exchange","volume":"7","author":"Liu","year":"2005","journal-title":"IT Prof"},{"key":"2022031509043729300_ocab264-B26","doi-asserted-by":"crossref","first-page":"92S","DOI":"10.1177\/1077558703256726","article-title":"Pharmacy data in the VA health care system","volume":"60 (3 Suppl","author":"Smith","year":"2003","journal-title":"Med Care Res Rev"},{"issue":"4","key":"2022031509043729300_ocab264-B27","doi-asserted-by":"crossref","first-page":"e14325","DOI":"10.2196\/14325","article-title":"Mapping ICD-10 and ICD-10-CM codes to phecodes: workflow development and initial evaluation","volume":"7","author":"Wu","year":"2019","journal-title":"JMIR Med Inform"},{"key":"2022031509043729300_ocab264-B28","year":"2021"},{"key":"2022031509043729300_ocab264-B29","first-page":"279","article-title":"SNOMED-CT: The advanced terminology and coding system for eHealth","volume":"121","author":"Donnelly","year":"2006","journal-title":"Stud Health Technol Inform"},{"key":"2022031509043729300_ocab264-B30","author":"Hong","year":"2021"},{"issue":"10","key":"2022031509043729300_ocab264-B31","doi-asserted-by":"crossref","first-page":"1593","DOI":"10.1093\/jamia\/ocaa180","article-title":"Representation of EHR data for predictive modeling: a comparison between UMLS and other terminologies","volume":"27","author":"Rasmy","year":"2020","journal-title":"J Am Med Inform Assoc"},{"key":"2022031509043729300_ocab264-B32","year":"2018"},{"issue":"1","key":"2022031509043729300_ocab264-B33","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1097\/TA.0000000000002647","article-title":"Identification of a new genetic variant associated with cholecystitis: a multicenter genome-wide association study","volume":"89","author":"Bonde","year":"2020","journal-title":"J Trauma Acute Care Surg"},{"issue":"12","key":"2022031509043729300_ocab264-B34","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1016\/j.biopsych.2017.12.004","article-title":"Genome-wide association study of dimensional psychopathology using electronic health records","volume":"83","author":"McCoy","year":"2018","journal-title":"Biol Psychiatry"},{"issue":"10","key":"2022031509043729300_ocab264-B35","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1176\/appi.ajp.2019.18091085","article-title":"Penetrance and pleiotropy of polygenic risk scores for Schizophrenia in 106,160 patients across four health care systems","volume":"176","author":"Zheutlin","year":"2019","journal-title":"Am J Psychiatry"},{"issue":"1","key":"2022031509043729300_ocab264-B36","doi-asserted-by":"crossref","first-page":"19959","DOI":"10.1038\/s41598-021-98719-w","article-title":"An independently validated, portable algorithm for the rapid identification of COPD patients using electronic health records","volume":"11","author":"Chu","year":"2021","journal-title":"Sci Rep"},{"key":"2022031509043729300_ocab264-B37","doi-asserted-by":"crossref","DOI":"10.3899\/jrheum.210580","article-title":"Association of sinusitis and upper respiratory tract diseases with incident rheumatoid arthritis: a case-control study","author":"Kronzer","year":"2021","journal-title":"J Rheumatol"},{"key":"2022031509043729300_ocab264-B38","article-title":"Clinical validation, implementation, and reporting of polygenic risk scores for common diseases","author":"Vassy","year":"2021","journal-title":"Research Square Preprint"},{"key":"2022031509043729300_ocab264-B39","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1186\/1755-8794-4-13","article-title":"The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies","volume":"4","author":"McCarty","year":"2011","journal-title":"BMC Med Genomics"},{"issue":"24","key":"2022031509043729300_ocab264-B40","doi-asserted-by":"crossref","first-page":"2441","DOI":"10.1001\/jama.2021.7702","article-title":"Progress with the All of Us research program: opening access for researchers","volume":"325","author":"Ramirez","year":"2021","journal-title":"JAMA"},{"key":"2022031509043729300_ocab264-B41","volume-title":"The Book of OHDSI: Observational Health Data Sciences and Informatics","year":"2019"},{"issue":"4","key":"2022031509043729300_ocab264-B42","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1093\/jamiaopen\/ooz050","article-title":"Implementing a hash-based privacy-preserving record linkage tool in the OneFlorida clinical research network","volume":"2","author":"Bian","year":"2019","journal-title":"JAMIA Open"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/29\/4\/643\/42897411\/ocab264.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/29\/4\/643\/42897411\/ocab264.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,12]],"date-time":"2023-11-12T18:52:39Z","timestamp":1699815159000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/29\/4\/643\/6445134"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,28]]},"references-count":42,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,11,28]]},"published-print":{"date-parts":[[2022,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocab264","relation":{},"ISSN":["1527-974X"],"issn-type":[{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,4,1]]},"published":{"date-parts":[[2021,11,28]]}}}