{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T18:23:20Z","timestamp":1778351000109,"version":"3.51.4"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2017,8,30]],"date-time":"2017-08-30T00:00:00Z","timestamp":1504051200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objective<\/jats:title>\n                  <jats:p>Our objective is to create a source of synthetic electronic health records that is readily available; suited to industrial, innovation, research, and educational uses; and free of legal, privacy, security, and intellectual property restrictions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>We developed Synthea, an open-source software package that simulates the lifespans of synthetic patients, modeling the 10 most frequent reasons for primary care encounters and the 10 chronic conditions with the highest morbidity in the United States.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Synthea adheres to a previously developed conceptual framework, scales via open-source deployment on the Internet, and may be extended with additional disease and treatment modules developed by its user community. One million synthetic patient records are now freely available online, encoded in standard formats (eg, Health Level-7 [HL7] Fast Healthcare Interoperability Resources [FHIR] and Consolidated-Clinical Document Architecture), and accessible through an HL7 FHIR application program interface.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Discussion<\/jats:title>\n                  <jats:p>Health care lags other industries in information technology, data exchange, and interoperability. The lack of freely distributable health records has long hindered innovation in health care. Approaches and tools are available to inexpensively generate synthetic health records at scale without accidental disclosure risk, lowering current barriers to entry for promising early-stage developments. By engaging a growing community of users, the synthetic data generated will become increasingly comprehensive, detailed, and realistic over time.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusion<\/jats:title>\n                  <jats:p>Synthetic patients can be simulated with models of disease progression and corresponding standards of care to produce risk-free realistic synthetic health care records at scale.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocx079","type":"journal-article","created":{"date-parts":[[2017,7,5]],"date-time":"2017-07-05T19:12:43Z","timestamp":1499281963000},"page":"230-238","source":"Crossref","is-referenced-by-count":310,"title":["Synthea: An approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record"],"prefix":"10.1093","volume":"25","author":[{"given":"Jason","family":"Walonoski","sequence":"first","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Mark","family":"Kramer","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Joseph","family":"Nichols","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Andre","family":"Quina","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Chris","family":"Moesel","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Dylan","family":"Hall","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Carlton","family":"Duffett","sequence":"additional","affiliation":[{"name":"The MITRE Corporation, Bedford, MA, USA"}]},{"given":"Kudakwashe","family":"Dube","sequence":"additional","affiliation":[{"name":"HIKER Group, Massey University, Palmerston North, New Zealand,"}]},{"given":"Thomas","family":"Gallagher","sequence":"additional","affiliation":[{"name":"Department of Applied Computing and Engineering Technology, University of Montana, Missoula, MT, USA"}]},{"given":"Scott","family":"McLachlan","sequence":"additional","affiliation":[{"name":"HIKER Group, Massey University, Palmerston North, New Zealand,"}]}],"member":"286","published-online":{"date-parts":[[2017,8,30]]},"reference":[{"issue":"1","key":"2020110612383599300_ocx079-B1","doi-asserted-by":"crossref","DOI":"10.5210\/ojphi.v1i1.2720","article-title":"Construction and Validation of Synthetic Electronic Medical Records","volume":"1","author":"Moniz","year":"2009","journal-title":"Online J Public Health Inform."},{"key":"2020110612383599300_ocx079-B2","doi-asserted-by":"crossref","DOI":"10.1109\/ICDM.2013.89","volume-title":"Cox Regression with Correlation Based Regularization for Electronic Health Records","author":"Vinzamuri","year":"2013"},{"key":"2020110612383599300_ocx079-B3","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-40994-3_35","article-title":"Forest-based point process for event prediction from electronic health records","volume-title":"European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases","author":"Weiss","year":"2013"},{"key":"2020110612383599300_ocx079-B4","article-title":"From EHR to Healthcare App Platform","volume-title":"Information Week: Healthcare","author":"Braunstein","year":"2014"},{"key":"2020110612383599300_ocx079-B5","doi-asserted-by":"crossref","volume-title":"Identifying Participants in the Personal Genome Project by Name","author":"Sweeney","DOI":"10.2139\/ssrn.2257732"},{"key":"2020110612383599300_ocx079-B6","first-page":"348","article-title":"The NHS\u2019 care.data scheme: What are the risks to privacy?","author":"Hoeksma","year":"2014","journal-title":"Brit Med J."},{"key":"2020110612383599300_ocx079-B7","article-title":"This little-known firm is getting rich off your medical data","author":"Tanner","year":"2016","journal-title":"Fortune"},{"key":"2020110612383599300_ocx079-B8","article-title":"Doctors selling medical records","author":"Frenkel","journal-title":"Herald Sun"},{"key":"2020110612383599300_ocx079-B9","article-title":"Personal health data is for sale","author":"Peel","journal-title":"Health Privacy Summit."},{"issue":"2","key":"2020110612383599300_ocx079-B10","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1136\/amiajnl-2013-001847","article-title":"Exploiting the potential of large databases of electronic health records for research using rapid search algorithms and an intuitive query interface","volume":"21","author":"Tate","year":"2014","journal-title":"J Am Inform Assoc."},{"issue":"4","key":"2020110612383599300_ocx079-B11","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1108\/17459265200600060","article-title":"Under threat: patient confidentiality and NHS computing","volume":"6","author":"Ross","year":"2006","journal-title":"Drugs and Alcohol Today"},{"key":"2020110612383599300_ocx079-B12","year":"2010"},{"issue":"12):","key":"2020110612383599300_ocx079-B13","doi-asserted-by":"crossref","first-page":"e28071","DOI":"10.1371\/journal.pone.0028071","article-title":"A systematic review of re-identification attacks on health data","volume":"6","author":"El Emam","year":"2011","journal-title":"PLOS One"},{"key":"2020110612383599300_ocx079-B14","first-page":"6117","article-title":"Identifying personal genomes by surname inference","volume":"v339","author":"Gymrek","year":"2013","journal-title":"Science."},{"key":"2020110612383599300_ocx079-B15","article-title":"Using the CareMap with health incidents statistics for generating the realistic synthetic electronic healthcare record.","author":"McLachlan","journal-title":"2016 IEEE International Conference on Healthcare Informatics (ICHI),"},{"key":"2020110612383599300_ocx079-B16","article-title":"Encrypted private medical records released by Department of Health are vulnerable","author":"Dunlevy","year":"2016","journal-title":"News Limited"},{"key":"2020110612383599300_ocx079-B17","article-title":"Millions of Australians caught in data breach","author":"Middleton","year":"2016","journal-title":"The Saturday Paper: Monthly"},{"key":"2020110612383599300_ocx079-B18","article-title":"Medicare data breach prompts law change","author":"Rollins","year":"2016","journal-title":"Australian Med Online J."},{"key":"2020110612383599300_ocx079-B19","first-page":"5","article-title":"Policy by procrastination: Secondary use of electronic health records for health research purposes","volume":"2","author":"Kosseim","year":"2008","journal-title":"McGill JL Health."},{"key":"2020110612383599300_ocx079-B20","article-title":"Realism in Synthetic Data Generation","author":"McLachlan","year":"2016"},{"key":"2020110612383599300_ocx079-B21","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1186\/1471-2105-7-43","article-title":"SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms","volume":"7","author":"Van den Bulcke","year":"2006","journal-title":"BMC Bioinformatics."},{"key":"2020110612383599300_ocx079-B22","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1007\/s004290100201","article-title":"Computer generation and quantitative morphometric analysis of virtual neurons","volume":"204","author":"Ascoli","year":"2001","journal-title":"Anat Embryol."},{"key":"2020110612383599300_ocx079-B23","first-page":"1855","article-title":"An evaluation of two methods for generating synthetic HL7 segments reflecting real-world health information exchange transactions","volume":"2014","author":"Mwogi","year":"2014","journal-title":"AMIA Annual Symp Proc."},{"key":"2020110612383599300_ocx079-B24","article-title":"A Methodology to Generate Virtual Patient Repositories","author":"Uri","year":"2016"},{"key":"2020110612383599300_ocx079-B25","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1186\/1472-6947-10-59","article-title":"Data-driven approach for creating synthetic electronic medical records","volume":"10","author":"Buczak","year":"2010","journal-title":"BMC Med Inform Decis Mak."},{"key":"2020110612383599300_ocx079-B26","article-title":"Introducing a New Clinical Data Paradigm","author":"MDClone","year":"2016"},{"key":"2020110612383599300_ocx079-B27","article-title":"Generating Multi-label Discrete Electronic Health Records using Generative Adversarial Networks","author":"Choi"},{"key":"2020110612383599300_ocx079-B28","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1002\/(SICI)1097-4571(199004)41:3<223::AID-ASI14>3.0.CO;2-3","article-title":"Peer review and the changing research record","volume":"41","author":"Crawford","year":"1990","journal-title":"J Am Soc Inf Sci."},{"key":"2020110612383599300_ocx079-B29","volume-title":"Research Design: Qualitative, Quantitative, and Mixed Method Approaches","author":"Creswell","year":"2003"},{"key":"2020110612383599300_ocx079-B30","author":"Birkin"},{"key":"2020110612383599300_ocx079-B31","volume-title":"Changing Order: Replication and Induction in Scientific Practice","author":"Collins","year":"1985"},{"key":"2020110612383599300_ocx079-B32","author":"Stodden"},{"issue":"1","key":"2020110612383599300_ocx079-B33","article-title":"A review of feature selection methods on synthetic data","volume":"34","author":"Bolon-Canedo","year":"2013","journal-title":"Knowledge Inform Syst."},{"key":"2020110612383599300_ocx079-B34"},{"key":"2020110612383599300_ocx079-B35","volume-title":"Foundations of Health Information Engineering and Systems","author":"Dube","year":"2014"},{"key":"2020110612383599300_ocx079-B36"},{"key":"2020110612383599300_ocx079-B37"},{"key":"2020110612383599300_ocx079-B38"},{"key":"2020110612383599300_ocx079-B39"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/3\/230\/34150150\/ocx079.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/3\/230\/34150150\/ocx079.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T18:00:50Z","timestamp":1604685650000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/25\/3\/230\/4098271"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,30]]},"references-count":39,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2017,8,30]]},"published-print":{"date-parts":[[2018,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocx079","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,3]]},"published":{"date-parts":[[2017,8,30]]}}}