{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T14:16:23Z","timestamp":1758636983645,"version":"3.44.0"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2025,9,11]],"date-time":"2025-09-11T00:00:00Z","timestamp":1757548800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objectives<\/jats:title>\n                  <jats:p>Demonstrate the ability to encapsulate clinical-grade genomics data normalization algorithms within a FHIR Genomics reference implementation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Background<\/jats:title>\n                  <jats:p>Variability in genomics data representation is a significant impediment to precise search, clinical decision support rule writing, variant annotation, and more. Such variability is problematic not just for genetic variants, but also applies to HLA alleles, phenotype codes, and more. Here, we provide an overview of genomic data variability and normalization algorithms, focusing on three key areas: genetic variants, HLA alleles, condition and medication variant annotations. We describe and demonstrate the strategies used in a public open source FHIR Genomics reference implementation.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>We developed a set of design considerations, which we used to weigh different normalization approaches. All data (ingested patient data, ingested knowledge, query parameters) are subjected to normalization. Variant normalization leverages the biocommons\/hgvs python package. HLA allele normalization leverages the py-ard python package. For variant annotation terminology variability (for conditions and medications), we leveraged FHIR-based ConceptMaps.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Algorithms for normalization of genetic variants and HLA alleles, and terminology translations, have been implemented and deployed in a public open source FHIR Genomics Operations reference implementation. All data and source code described in this report are located at https:\/\/github.com\/FHIR\/genomics-operations, and deployed at https:\/\/fhir-gen-ops.herokuapp.com\/. Every normalization strategy examined to date has known limitations.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusion<\/jats:title>\n                  <jats:p>While we report on our experience successfully encapsulating genomic data normalization in FHIR Genomics Operations, the challenges and solutions identified are broadly applicable to many other contexts.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocaf136","type":"journal-article","created":{"date-parts":[[2025,8,7]],"date-time":"2025-08-07T11:47:20Z","timestamp":1754567240000},"page":"1598-1608","source":"Crossref","is-referenced-by-count":1,"title":["Genetic data normalization for genomic medicine: a Fast Healthcare Interoperability Resources Genomics reference implementation"],"prefix":"10.1093","volume":"32","author":[{"given":"Robert H","family":"Dolin","sequence":"first","affiliation":[{"name":"Elimu Informatics , El Cerrito, CA 94530,","place":["United States"]}]},{"given":"Nicolae-Mihai","family":"Todor","sequence":"additional","affiliation":[{"name":"Independent Data Lab , Dublin D15 CD79,","place":["Ireland"]}]},{"given":"James","family":"Shalaby","sequence":"additional","affiliation":[{"name":"Elimu Informatics , El Cerrito, CA 94530,","place":["United States"]}]},{"given":"Huda","family":"Arsalan","sequence":"additional","affiliation":[{"name":"Elimu Informatics , El Cerrito, CA 94530,","place":["United States"]}]},{"given":"Eshani","family":"Shah","sequence":"additional","affiliation":[{"name":"Luddy School of Informatics, Computing, and Engineering, Indiana University , Indianapolis, IN 46202,","place":["United States"]}]},{"given":"Nedah","family":"Basravi","sequence":"additional","affiliation":[{"name":"College of Pharmacy, Touro University California , Vallejo, CA 94589,","place":["United States"]}]},{"given":"Ammar","family":"Husami","sequence":"additional","affiliation":[{"name":"Division of Human Genetics, Cincinnati Children\u2019s Hospital Medical Center , Cincinnati, OH 45229,","place":["United States"]},{"name":"Department of Pediatrics, University of Cincinnati College of Medicine , Cincinnati, OH 45219,","place":["United States"]}]},{"given":"Akash","family":"Rampersad","sequence":"additional","affiliation":[{"name":"Oak Bioinformatics , Fairfax, VA 22030,","place":["United States"]}]},{"given":"Bret S E","family":"Heale","sequence":"additional","affiliation":[{"name":"Humanized Health Consulting , Salt Lake City, UT 84102,","place":["United States"]}]},{"given":"Srikar","family":"Chamala","sequence":"additional","affiliation":[{"name":"Keck School of Medicine, Department of Pathology, University of Southern California , Los Angeles, CA 90033,","place":["United States"]},{"name":"Department of Pathology and Laboratory Medicine, Children\u2019s Hospital Los Angeles , Los Angeles, CA 90027,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,9,11]]},"reference":[{"key":"2025092208423535400_ocaf136-B1","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1093\/jamia\/ocac246","article-title":"Introducing HL7 FHIR Genomics Operations: a developer-friendly approach to genomics-EHR integration","volume":"30","author":"Dolin","year":"2023","journal-title":"J Am Med Inform Assoc"},{"author":"HL7 FHIR Genomics Reporting Implementation Guide v3.0.0","key":"2025092208423535400_ocaf136-B2"},{"key":"2025092208423535400_ocaf136-B3","doi-asserted-by":"crossref","first-page":"1902","DOI":"10.1093\/bioinformatics\/btz856","article-title":"SPDI: data model for variants and applications at NCBI","volume":"36","author":"Holmes","year":"2020","journal-title":"Bioinformatics"},{"author":"The Variant Call Format (VCF) Specification","key":"2025092208423535400_ocaf136-B4"},{"author":"HGVS Sequence Variant Nomenclature","key":"2025092208423535400_ocaf136-B5"},{"key":"2025092208423535400_ocaf136-B6","doi-asserted-by":"publisher","first-page":"100027","DOI":"10.1016\/j.xgen.2021.100027","article-title":"The GA4GH variation representation specification: a computational framework for variation representation and federated identification","volume":"1","author":"Wagner","year":"2021","journal-title":"Cell Genom"},{"key":"2025092208423535400_ocaf136-B7","doi-asserted-by":"publisher","first-page":"104","DOI":"10.1186\/s12859-021-04039-1","article-title":"vcf2fhir: a utility to convert VCF files into HL7 FHIR format for genomics-EHR integration","volume":"22","author":"Dolin","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"2025092208423535400_ocaf136-B8","doi-asserted-by":"publisher","first-page":"1803","DOI":"10.1002\/humu.23615","article-title":"hgvs: a Python package for manipulating sequence variants using HGVS nomenclature: 2018 Update","volume":"39","author":"Wang","year":"2018","journal-title":"Hum Mutat"},{"author":"Skidmore","key":"2025092208423535400_ocaf136-B9"},{"year":"2024","author":"Tretyakov","key":"2025092208423535400_ocaf136-B10"},{"key":"2025092208423535400_ocaf136-B11","doi-asserted-by":"crossref","first-page":"2235","DOI":"10.1056\/NEJMsr1406261","article-title":"ClinGen\u2014the clinical genome resource","volume":"372","author":"Rehm","year":"2015","journal-title":"N Engl J Med"},{"key":"2025092208423535400_ocaf136-B12","doi-asserted-by":"publisher","first-page":"68","DOI":"10.3892\/ijo.2023.5516","article-title":"Human leukocyte antigen and tumor immunotherapy (Review)","volume":"62","author":"Liu","year":"2023","journal-title":"Int J Oncol"},{"author":"HLA Nomenclature @ hla.alleles.org","key":"2025092208423535400_ocaf136-B13"},{"author":"Genotype List String 1.1: Extending the Genotype List String grammar for describing HLA and Killer-cell Immunoglobulin-like Receptor genotypes\u2014Mack\u20142023\u2014HLA\u2014Wiley Online Library","key":"2025092208423535400_ocaf136-B14"},{"author":"MAC UI","key":"2025092208423535400_ocaf136-B15"},{"key":"2025092208423535400_ocaf136-B16","doi-asserted-by":"publisher","first-page":"3406230","DOI":"10.1155\/2017\/3406230","article-title":"HLA epitopes: the targets of monoclonal and alloantibodies defined","volume":"2017","author":"El-Awar","year":"2017","journal-title":"J Immunol Res"},{"key":"2025092208423535400_ocaf136-B17","doi-asserted-by":"publisher","first-page":"e15549","DOI":"10.1111\/tan.15549","article-title":"25 years of the IPD-IMGT\/HLA database","volume":"103","author":"Robinson","year":"2024","journal-title":"HLA"},{"key":"2025092208423535400_ocaf136-B18"},{"key":"2025092208423535400_ocaf136-B19","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1016\/j.humimm.2023.08.075","article-title":"P508 PY-ARD\u2014a Swiss army knife of HLA assignments","volume":"84","author":"Maiers","year":"2023","journal-title":"Hum Immunol"},{"year":"2024","author":"nmdp-bioinformatics\/py-ard","key":"2025092208423535400_ocaf136-B20"},{"author":"SnpEff and SnpSift","key":"2025092208423535400_ocaf136-B21"},{"key":"2025092208423535400_ocaf136-B22","doi-asserted-by":"publisher","first-page":"80","DOI":"10.4161\/fly.19695","article-title":"A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3","volume":"6","author":"Cingolani","year":"2012","journal-title":"Fly (Austin)"},{"key":"2025092208423535400_ocaf136-B23","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1186\/s13059-016-0974-4","article-title":"The Ensembl Variant Effect Predictor","volume":"17","author":"McLaren","year":"2016","journal-title":"Genome Biol"},{"key":"2025092208423535400_ocaf136-B24","doi-asserted-by":"publisher","first-page":"e164","DOI":"10.1093\/nar\/gkq603","article-title":"ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data","volume":"38","author":"Wang","year":"2010","journal-title":"Nucleic Acids Res"},{"author":"GA4GH Variant Annotation Specification\u2013GA4GH Variant Annotation Specification HEAD documentation","key":"2025092208423535400_ocaf136-B25"},{"key":"2025092208423535400_ocaf136-B26","doi-asserted-by":"crossref","first-page":"D862","DOI":"10.1093\/nar\/gkv1222","article-title":"ClinVar: public archive of interpretations of clinically relevant variants","volume":"44","author":"Landrum","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"2025092208423535400_ocaf136-B27","doi-asserted-by":"publisher","first-page":"D1230","DOI":"10.1093\/nar\/gkac979","article-title":"CIViCdb 2022: evolution of an open-access cancer variant interpretation knowledgebase","volume":"51","author":"Krysiak","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025092208423535400_ocaf136-B28","doi-asserted-by":"publisher","first-page":"e226","DOI":"10.1002\/cpz1.226","article-title":"PharmGKB, an integrated resource of pharmacogenomic knowledge","volume":"1","author":"Gong","year":"2021","journal-title":"Curr Protoc"},{"key":"2025092208423535400_ocaf136-B29","first-page":"346","article-title":"Kaiser permanente\u2019s convergent medical terminology","volume":"107","author":"Dolin","year":"2004","journal-title":"Stud Health Technol Inform"},{"author":"Terminology-module\u2014FHIR v5.0.0","key":"2025092208423535400_ocaf136-B30"},{"key":"2025092208423535400_ocaf136-B31","doi-asserted-by":"publisher","first-page":"e10385","DOI":"10.1002\/lrh2.10385","article-title":"Sync for genes phase 5: computable artifacts for sharing dynamically annotated FHIR-formatted genomic variants","volume":"7","author":"Dolin","year":"2023","journal-title":"Learn Health Syst"},{"first-page":"113","author":"Dolin","key":"2025092208423535400_ocaf136-B32","doi-asserted-by":"publisher","DOI":"10.1016\/j.yamp.2024.07.006"},{"year":"2024","author":"FHIR\/genomics-operations","key":"2025092208423535400_ocaf136-B33"},{"key":"2025092208423535400_ocaf136-B34","doi-asserted-by":"publisher","first-page":"e115","DOI":"10.1055\/s-0038-1676466","article-title":"A pharmacogenomics clinical decision support service based on FHIR and CDS hooks","volume":"57","author":"Dolin","year":"2018","journal-title":"Methods Inf Med"},{"key":"2025092208423535400_ocaf136-B35","doi-asserted-by":"publisher","first-page":"310","DOI":"10.1038\/s41586-022-04558-8","article-title":"A joint NCBI and EMBL-EBI transcript set for clinical genomics and research","volume":"604","author":"Morales","year":"2022","journal-title":"Nature"},{"key":"2025092208423535400_ocaf136-B36","first-page":"210","article-title":"Selective retrieval of pre- and post-coordinated SNOMED concepts","author":"Dolin","year":"2002","journal-title":"Proc AMIA Symp"},{"key":"2025092208423535400_ocaf136-B37","first-page":"627","article-title":"Normal forms for description logic expressions of clinical concepts in SNOMED RT","author":"Spackman","year":"2001","journal-title":"Proc AMIA Symp"},{"key":"2025092208423535400_ocaf136-B38","doi-asserted-by":"publisher","first-page":"103585","DOI":"10.1016\/j.jbi.2020.103585","article-title":"Automatic full conversion of clinical terms into SNOMED CT concepts","volume":"111","author":"Kate","year":"2020","journal-title":"J Biomed Inform"},{"key":"2025092208423535400_ocaf136-B39","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1136\/jamia.1998.0050203","article-title":"Evaluation of a \u2018lexically assign, logically refine\u2019 strategy for semi-automated integration of overlapping terminologies","volume":"5","author":"Dolin","year":"1998","journal-title":"J Am Med Inform Assoc"},{"key":"2025092208423535400_ocaf136-B40","doi-asserted-by":"publisher","first-page":"S21","DOI":"10.1016\/j.cancergen.2024.08.067","article-title":"Creating a common language for categorical variants","volume":"286-287","author":"Puthawala","year":"2024","journal-title":"Cancer Genet."},{"author":"GA4GH Categorical Variation (CatVar)","key":"2025092208423535400_ocaf136-B41"},{"key":"2025092208423535400_ocaf136-B42","first-page":"359","article-title":"Molecularly-guided cancer clinical trial matching using FHIR and HL7 clinical quality language: a proof of concept","volume":"2024","author":"Dolin","year":"2024","journal-title":"AMIA Annu Symp Proc AMIA Symp"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/32\/10\/1598\/64248009\/ocaf136.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/32\/10\/1598\/64248009\/ocaf136.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,22]],"date-time":"2025-09-22T12:42:43Z","timestamp":1758544963000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/32\/10\/1598\/8251823"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,11]]},"references-count":42,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2025,9,11]]},"published-print":{"date-parts":[[2025,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocaf136","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"type":"print","value":"1067-5027"},{"type":"electronic","value":"1527-974X"}],"subject":[],"published-other":{"date-parts":[[2025,10]]},"published":{"date-parts":[[2025,9,11]]}}}