{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,3]],"date-time":"2026-05-03T23:47:59Z","timestamp":1777852079787,"version":"3.51.4"},"reference-count":31,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2016,5,19]],"date-time":"2016-05-19T00:00:00Z","timestamp":1463616000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Health Informatics J"],"published-print":{"date-parts":[[2017,12]]},"abstract":"<jats:p>A health record database contains structured data fields that identify the patient, such as patient ID, patient name, e-mail and phone number. These data are fairly easy to de-identify, that is, replace with other identifiers. However, these data also occur in fields with doctors\u2019 free-text notes written in an abbreviated style that cannot be analyzed grammatically. If we replace a word that looks like a name, but isn\u2019t, we degrade readability and medical correctness. If we fail to replace it when we should, we degrade confidentiality. We de-identified an existing Danish electronic health record database, ending up with 323,122 patient health records. We had to invent many methods for de-identifying potential identifiers in the free-text notes. The de-identified health records should be used with caution for statistical purposes because we removed health records that were so special that they couldn\u2019t be de-identified. Furthermore, we distorted geography by replacing zip codes with random zip codes.<\/jats:p>","DOI":"10.1177\/1460458216647760","type":"journal-article","created":{"date-parts":[[2016,5,20]],"date-time":"2016-05-20T05:45:16Z","timestamp":1463723116000},"page":"291-303","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":11,"title":["Preserving medical correctness, readability and consistency in de-identified health records"],"prefix":"10.1177","volume":"23","author":[{"given":"Kostas","family":"Pantazos","sequence":"first","affiliation":[{"name":"IT-University of Copenhagen, Denmark"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Soren","family":"Lauesen","sequence":"additional","affiliation":[{"name":"IT-University of Copenhagen, Denmark"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Soren","family":"Lippert","sequence":"additional","affiliation":[{"name":"IT-University of Copenhagen, Denmark"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2016,5,19]]},"reference":[{"key":"bibr1-1460458216647760","unstructured":"ISO\/TR 20514:2005. Health informatics\u2014electronic health record definition, scope and context."},{"key":"bibr2-1460458216647760","unstructured":"Sweeney L. Replacing personally-identifying information in medical records, the scrub system. In: Proceedings of the AMIA annual fall symposium, Washington, DC, 1996, p. 333. Bethesda, MD: American Medical Informatics Association. Available at: http:\/\/dataprivacylab.org\/projects\/scrub\/paper1.pdf"},{"key":"bibr3-1460458216647760","first-page":"729","volume-title":"Proceedings of the AMIA symposium","author":"Ruch P"},{"key":"bibr4-1460458216647760","first-page":"777","volume-title":"Proceedings of the AMIA symposium","author":"Thomas SM"},{"issue":"6","key":"bibr5-1460458216647760","doi-asserted-by":"crossref","first-page":"680","DOI":"10.5858\/2003-127-680-CMDS","volume":"127","author":"Berman JJ","year":"2003","journal-title":"Arch Pathol Lab Med"},{"key":"bibr6-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1309\/E6K33GBPE5C27FYU"},{"key":"bibr7-1460458216647760","first-page":"329","volume-title":"Proceedings of the 16th Nordic conference of computational linguistics","author":"Kokkinakis D"},{"key":"bibr8-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M2441"},{"key":"bibr9-1460458216647760","first-page":"735","volume-title":"Medical Informatics in a United and Healthy Europe","volume":"150","author":"Grouin C","year":"2009"},{"key":"bibr10-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2009.04.005"},{"key":"bibr11-1460458216647760","volume-title":"AAAI fall symposium series","author":"Dalianis H"},{"key":"bibr12-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2013.03.005"},{"key":"bibr13-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1186\/1472-6947-11-53"},{"key":"bibr14-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-14-S17-A5"},{"key":"bibr15-1460458216647760","first-page":"132","volume":"2012","author":"Gordon JS","year":"2012","journal-title":"Nurs Inform"},{"key":"bibr16-1460458216647760","first-page":"862","volume":"169","author":"Pantazos K","year":"2011","journal-title":"Stud Health Technol Inform"},{"key":"bibr17-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1186\/1472-6947-6-12"},{"key":"bibr18-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M2444"},{"key":"bibr19-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2010.09.007"},{"key":"bibr20-1460458216647760","doi-asserted-by":"crossref","unstructured":"Susilo W, Win K. Security and access of health research data. J Med Syst 2007; 31(2): 103\u2013107, http:\/\/dx.doi.org\/10.1007\/s10916-006-9035-y (accessed August 2010).","DOI":"10.1007\/s10916-006-9035-y"},{"key":"bibr21-1460458216647760","doi-asserted-by":"crossref","unstructured":"Huang LC, Chu HC, Lien CY, et al. Embedding a hiding function in a portable electronic health record for privacy preservation. J Med Syst 2010; 34(3): 313\u2013320, http:\/\/dx.doi.org\/10.1007\/s10916-008-9243-8 (accessed August 2010).","DOI":"10.1007\/s10916-008-9243-8"},{"key":"bibr22-1460458216647760","volume-title":"Proceedings of the HelsIT\u201904 conference","author":"Tveit A"},{"key":"bibr23-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-10-70"},{"key":"bibr24-1460458216647760","doi-asserted-by":"publisher","DOI":"10.1186\/2041-1480-1-6"},{"key":"bibr25-1460458216647760","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.8.4.e28"},{"key":"bibr26-1460458216647760","unstructured":"Danmarks Statistik, http:\/\/www.dst.dk\/HomeUK\/Statistics\/Names.aspx (accessed August 2010)."},{"key":"bibr27-1460458216647760","unstructured":"Copenhagen University, http:\/\/danskernesnavne.navneforskning.ku.dk\/TopNavne.asp (accessed August 2010)."},{"key":"bibr28-1460458216647760","unstructured":"Post Denmark, http:\/\/www.postdanmark.dk\/da\/Privat\/Kundeservice\/postnummerkort\/Sider\/postnummerkort.aspx (accessed May 2016)."},{"key":"bibr29-1460458216647760","unstructured":"Sygehusvalg, http:\/\/www.sygehusvalg.dk\/ (accessed August 2010)."},{"key":"bibr30-1460458216647760","unstructured":"Who Named It, http:\/\/www.whonamedit.com\/azeponyms.cfm\/ (accessed August 2010)."},{"key":"bibr31-1460458216647760","unstructured":"HIPAA. HIPAA privacy rules and public health, http:\/\/www.cdc.gov\/mmwr\/preview\/mmwrhtml\/m2e411a1.htm (accessed August 2010)."}],"container-title":["Health Informatics Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1460458216647760","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1460458216647760","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1460458216647760","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T22:30:10Z","timestamp":1777501810000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1460458216647760"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,5,19]]},"references-count":31,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2017,12]]}},"alternative-id":["10.1177\/1460458216647760"],"URL":"https:\/\/doi.org\/10.1177\/1460458216647760","relation":{},"ISSN":["1460-4582","1741-2811"],"issn-type":[{"value":"1460-4582","type":"print"},{"value":"1741-2811","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,5,19]]}}}