{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T13:37:51Z","timestamp":1776778671292,"version":"3.51.2"},"reference-count":46,"publisher":"Public Library of Science (PLoS)","issue":"9","license":[{"start":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T00:00:00Z","timestamp":1758585600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000865","name":"Bill and Melinda Gates Foundation","doi-asserted-by":"publisher","award":["INV-037558"],"award-info":[{"award-number":["INV-037558"]}],"id":[{"id":"10.13039\/100000865","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Structured patient data generated within the health data ecosystem are shared both internally for operational use and also externally for research and public health benefit. Protecting individual privacy and health data confidentiality in these contexts relies on data de-identification and anonymisation, although there are no universally accepted standards for these processes and the techniques involved can be technically complex. We present practical recommendations grounded in the principle of data minimisation\u2014avoiding unnecessary granularity and identifying variables that could lead to re-identification when combined with other datasets. We provide practical guidance for anonymising and perturbing structured health data in ways that support compliance with data protection laws, describing technical and operational methods for reducing re-identification risk that include rounding numerical values, replacing precise values with ranges, adding jitter to numeric fields, aggregating data, management of date values and separating sensitive fields from identifying data to prevent linkage leading to re-identification. While some methods require advanced technical knowledge, we focus here on accessible strategies that can be implemented without specialist expertise, recognising the importance of the legal and governance frameworks in which anonymisation occurs. These guidelines support researchers, data managers and institutions in sharing health data responsibly, maintaining data utility while upholding privacy and promoting ethical and legal data stewardship for data-driven health research.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1013507","type":"journal-article","created":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T17:31:17Z","timestamp":1758648677000},"page":"e1013507","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":2,"title":["Ten quick tips for protecting health data using de-identification and perturbation of structured datasets"],"prefix":"10.1371","volume":"21","author":[{"given":"Tshikala Eddie","family":"Lulamba","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2857-7197","authenticated-orcid":true,"given":"Themba","family":"Mutemaringa","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5083-2735","authenticated-orcid":true,"given":"Nicki","family":"Tiffin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"340","published-online":{"date-parts":[[2025,9,23]]},"reference":[{"issue":"7","key":"pcbi.1013507.ref001","doi-asserted-by":"crossref","first-page":"646","DOI":"10.1038\/s41588-020-0651-0","article-title":"Privacy challenges and research opportunities for genomic data sharing","volume":"52","author":"L Bonomi","year":"2020","journal-title":"Nat Genet"},{"key":"pcbi.1013507.ref002","unstructured":"World Health Organisation. Sharing and reuse of health-related data for research purposes: WHO policy and implementation guidance. 2022. Available from: https:\/\/iris.who.int\/bitstream\/handle\/10665\/352859\/9789240044968-eng.pdf?sequence=1"},{"key":"pcbi.1013507.ref003","article-title":"Identifying participants in the personal genome project by name (A re-identification experiment)","author":"L Sweeney","year":"2013","journal-title":"arXiv"},{"key":"pcbi.1013507.ref004","volume-title":"Simple demographics often identify people uniquely","author":"L Sweeney","year":"2000"},{"key":"pcbi.1013507.ref005","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1016\/j.ins.2022.05.040","article-title":"Data anonymization evaluation for big data and IoT environment","volume":"605","author":"C Ni","year":"2022","journal-title":"Inf Sci"},{"key":"pcbi.1013507.ref006","article-title":"UW medicine faces class action lawsuit over 974,000-record data breach. In:","author":"S Alder","year":"2020","journal-title":"The HIPAA Journal [Internet]"},{"issue":"1","key":"pcbi.1013507.ref007","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1136\/jamia.2010.004622","article-title":"Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule","volume":"18","author":"B Malin","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"pcbi.1013507.ref008","unstructured":"Standards for privacy of individually identifiable health information. In: Federal Register [Internet]. 14 Aug 2002 [cited 24 Mar 2025]. Available from: https:\/\/www.federalregister.gov\/documents\/2002\/08\/14\/02-20554\/standards-for-privacy-of-individually-identifiable-health-information"},{"key":"pcbi.1013507.ref009","first-page":"593","article-title":"Shades of gray: seeing the full spectrum of practical data de-identification","volume":"56","author":"K Finch","year":"2016","journal-title":"Santa Clara Law Rev"},{"key":"pcbi.1013507.ref010","article-title":"Written informed consent and selection bias in observational studies using medical records: systematic review","volume":"338","author":"ME Kho","year":"2009","journal-title":"BMJ"},{"key":"pcbi.1013507.ref011","unstructured":"UK Health Data Research Alliance. Trusted Research Environments (TRE): a strategy to build public trust and meet changing health data science needs. 2023. Available from: https:\/\/ukhealthdata.org\/wp-content\/uploads\/2020\/07\/200723-Alliance-Board_Paper-E_TRE-Green-Paper.pdf"},{"key":"pcbi.1013507.ref012","unstructured":"About Health Level Seven International | HL7 International. [cited 24 Mar 2025]. Available from: https:\/\/www.hl7.org\/about\/index.cfm?ref=nav"},{"issue":"2","key":"pcbi.1013507.ref013","first-page":"1143","article-title":"Data centre profile: the provincial health data centre of the Western Cape province, South Africa","volume":"4","author":"A Boulle","year":"2019","journal-title":"Int J Popul Data Sci"},{"key":"pcbi.1013507.ref014","author":"E McCallister","year":"2010","journal-title":"Guide to protecting the confidentiality of Personally Identifiable Information (PII)"},{"key":"pcbi.1013507.ref015","author":"A Burt","year":"2021"},{"key":"pcbi.1013507.ref016","unstructured":"European Medicines Agency, London. Data anonymisation: a key enabler for clinical data sharing\u2014Workshop report. 2021. Available from: https:\/\/www.ema.europa.eu\/en\/documents\/report\/report-data-anonymisation-key-enabler-clinical-data-sharing_en.pdf?utm_source=chatgpt.com"},{"key":"pcbi.1013507.ref017","author":"European Data Protection Board"},{"key":"pcbi.1013507.ref018","author":"S Garfinkel","year":"2023"},{"key":"pcbi.1013507.ref019","author":"Information Commissioner\u2019s Office","year":"2012"},{"key":"pcbi.1013507.ref020","year":"2016"},{"key":"pcbi.1013507.ref021","doi-asserted-by":"crossref","DOI":"10.1201\/b14764","author":"KE Emam","year":"2013","journal-title":"Guide to the de-identification of personal health information"},{"key":"pcbi.1013507.ref022","article-title":"Concepts and methods for de-identifying clinical trial data.","volume-title":"Sharing clinical trial data: maximizing benefits, minimizing risk","author":"K Emam","year":"2015"},{"key":"pcbi.1013507.ref023","author":"PDPC Singapore, SG Digital","year":"2024"},{"key":"pcbi.1013507.ref024","unstructured":"Personal Data Protection Commission of Singapore. PDPC | Basic anonymisation. [cited 3 Apr 2025]. Available from: https:\/\/www.pdpc.gov.sg\/help-and-resources\/2018\/01\/basic-anonymisation"},{"key":"pcbi.1013507.ref025","author":"SL Garfinkel","year":"2015"},{"issue":"4","key":"pcbi.1013507.ref026","first-page":"307","article-title":"Evaluating the risk of re-identification of patients from hospital prescription records","volume":"62","author":"KE Emam","year":"2009","journal-title":"Can J Hosp Pharm"},{"key":"pcbi.1013507.ref027","volume-title":"Global tables of data privacy laws and bills","author":"G Greenleaf","year":"2017","edition":"5"},{"issue":"1","key":"pcbi.1013507.ref028","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1186\/s12961-024-01230-7","article-title":"Data protection legislation in Africa and pathways for enhancing compliance in big data health research","volume":"22","author":"NS Munung","year":"2024","journal-title":"Health Res Policy Syst"},{"key":"pcbi.1013507.ref029","unstructured":"General Data Protection Regulation (GDPR). Official legal text. In: General Data Protection Regulation (GDPR) [Internet]. [cited 19 Nov 2023]. Available from: https:\/\/gdpr-info.eu\/"},{"key":"pcbi.1013507.ref030","author":"Information Regulator South Africa","year":"2013"},{"key":"pcbi.1013507.ref031","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1007\/978-3-030-15745-6_7","article-title":"The san code of research ethics.","author":"D Schroeder","year":"2019","journal-title":"Equitable research partnerships: a global code of conduct to counter ethics dumping"},{"key":"pcbi.1013507.ref032","unstructured":"South African San Institute. San code of research; 2019. Available from: http:\/\/trust-project.eu\/wp-content\/uploads\/2017\/03\/San-Code-of-RESEARCH-Ethics-Booklet-final.pdf"},{"key":"pcbi.1013507.ref033","author":"M Hudson","year":"2010"},{"issue":"10","key":"pcbi.1013507.ref034","doi-asserted-by":"crossref","DOI":"10.1136\/bmjgh-2023-013092","article-title":"Multiple modes of data sharing can facilitate secondary use of sensitive health data for research","volume":"8","author":"T Tamuhla","year":"2023","journal-title":"BMJ Glob Health"},{"issue":"5","key":"pcbi.1013507.ref035","doi-asserted-by":"crossref","first-page":"670","DOI":"10.1197\/jamia.M3144","article-title":"A globally optimal k-anonymity method for the de-identification of health data","volume":"16","author":"K El Emam","year":"2009","journal-title":"J Am Med Inform Assoc"},{"issue":"10","key":"pcbi.1013507.ref036","doi-asserted-by":"crossref","DOI":"10.1136\/bmjgh-2024-016474","article-title":"The PHA4GE Microbial Data-Sharing Accord: establishing baseline consensus microbial data-sharing norms to facilitate cross-sectoral collaboration","volume":"9","author":"EJ Griffiths","year":"2024","journal-title":"BMJ Glob Health"},{"key":"pcbi.1013507.ref037","article-title":"The separation principle.","author":"Australian Government National Statistical Service","year":"2013"},{"key":"pcbi.1013507.ref038","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1186\/s12967-015-0545-6","article-title":"A workflow-driven approach to integrate generic software modules in a Trusted Third Party","volume":"13","author":"M Bialke","year":"2015","journal-title":"J Transl Med"},{"key":"pcbi.1013507.ref039","unstructured":"Data Linkage Services Western Australia. In: Data linkage services WA [Internet]. [cited 3 Apr 2025]. Available from: https:\/\/www.datalinkageservices.health.wa.gov.au\/"},{"key":"pcbi.1013507.ref040","unstructured":"Government of Western Australia Department of Health. WA health data linkage strategy 2022\u20132024. Available from: https:\/\/www.datalinkageservices.health.wa.gov.au\/wp-content\/uploads\/2023\/05\/Data-Linkage-Strategy-2022-2024.pd"},{"key":"pcbi.1013507.ref041","doi-asserted-by":"crossref","first-page":"984807","DOI":"10.3389\/fbinf.2022.984807","article-title":"Algorithms to anonymize structured medical and healthcare data: a systematic review","volume":"2","author":"A Sepas","year":"2022","journal-title":"Front Bioinform"},{"issue":"7","key":"pcbi.1013507.ref042","doi-asserted-by":"crossref","first-page":"1277","DOI":"10.1002\/spe.2812","article-title":"Flexible data anonymization using ARX\u2014current status and challenges ahead","volume":"50","author":"F Prasser","year":"2020","journal-title":"Softw Pract Exp"},{"issue":"4","key":"pcbi.1013507.ref043","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v067.i04","article-title":"Statistical disclosure control for micro-data using the R Package sdcMicro","volume":"67","author":"M Templ","year":"2015","journal-title":"J Stat Soft"},{"key":"pcbi.1013507.ref044","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","author":"MD Wilkinson","year":"2016","journal-title":"Sci Data"},{"key":"pcbi.1013507.ref045","doi-asserted-by":"crossref","DOI":"10.5334\/dsj-2020-043","article-title":"The CARE principles for indigenous data governance","volume":"19","author":"SR Carroll","year":"2020","journal-title":"Data Sci J"},{"issue":"1","key":"pcbi.1013507.ref046","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/s11568-014-0003-1","article-title":"Framework for responsible sharing of genomic and health-related data","volume":"8","author":"BM Knoppers","year":"2014","journal-title":"Hugo J"}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013507","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T17:31:23Z","timestamp":1758648683000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013507"}},"subtitle":[],"editor":[{"given":"Patricia M.","family":"Palagi","sequence":"first","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,9,23]]},"references-count":46,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2025,9,23]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1013507","relation":{},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,23]]}}}