{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T08:06:39Z","timestamp":1773129999698,"version":"3.50.1"},"reference-count":0,"publisher":"IOS Press","isbn-type":[{"value":"9781643686158","type":"electronic"}],"license":[{"start":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T00:00:00Z","timestamp":1756857600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,9,3]]},"abstract":"<jats:p>Introduction: Mapping local medical data assets to international data standards such as medical ontology SNOMED CT fosters data harmonization and, thereby, global progress in medical research. Since its intense resource requirements often hinder manual SNOMED CT mapping, automated mapping tools such as MedCAT have been developed. We investigated how the formulation of study variable names (VNs) influences the efficacy and accuracy of the SNOMED CT concepts identified by MedCAT. Methods: We extracted 763 VNs from the GEPESTIM database hosted locally in REDCap and created three VNs using different REDCap metadata items for MedCAT-based SNOMED CT mapping. A fourth VN version was created manually. The mapping was evaluated based on the number and quality of identified SNOMED CT concepts, using manual scoring to assess concept accuracy while ensuring a blind evaluation process. Results: Increasing the expressiveness of VNs by adding more metadata items led to more SNOMED CT concepts being mapped, but also introduced mismatches, particularly when additionally included metadata contained misleading terms. The best overall mapping performance was achieved on the manually specified VNs while a basic VN version with minimal extra information from the metadata resulted in similarly good results. Conclusion: Our study identified key challenges in using MedCAT for automatically mapping study variables to SNOMED CT concepts. To improve accuracy, we recommend refining VNs reducing misleading terms and iteratively improving VN phrasing for optimal mapping outcome. Furthermore, it appears reasonable to always conduct a final manual review of the mapping outcome especially for critical variables and for those VNs containing negations or abbreviations.<\/jats:p>","DOI":"10.3233\/shti251390","type":"book-chapter","created":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T10:24:33Z","timestamp":1756895073000},"source":"Crossref","is-referenced-by-count":1,"title":["Catnip for MedCAT: Optimizing the Input for Automated SNOMED CT Mapping of Clinical Variables"],"prefix":"10.3233","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4101-5458","authenticated-orcid":false,"given":"Julia","family":"Gehrmann","sequence":"first","affiliation":[{"name":"Institute for Biomedical Informatics, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany"}]},{"given":"Asme","family":"Dogan","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany"}]},{"given":"Lea","family":"Hagelschuer","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany"},{"name":"Center for Rare Diseases, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany"}]},{"given":"Lars","family":"Quakulinski","sequence":"additional","affiliation":[{"name":"Institute for Biomedical Informatics, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany"}]},{"given":"Anne","family":"Koy","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany"},{"name":"Center for Rare Diseases, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany"}]},{"given":"Oya","family":"Beyan","sequence":"additional","affiliation":[{"name":"Institute for Biomedical Informatics, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany"},{"name":"Department of Data Science and Artificial Intelligence, Fraunhofer FIT, Sankt Augustin, Germany"}]}],"member":"7437","container-title":["Studies in Health Technology and Informatics","German Medical Data Sciences 2025: GMDS Illuminates Health"],"original-title":[],"link":[{"URL":"https:\/\/ebooks.iospress.nl\/pdf\/doi\/10.3233\/SHTI251390","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T10:24:33Z","timestamp":1756895073000},"score":1,"resource":{"primary":{"URL":"https:\/\/ebooks.iospress.nl\/doi\/10.3233\/SHTI251390"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,3]]},"ISBN":["9781643686158"],"references-count":0,"URL":"https:\/\/doi.org\/10.3233\/shti251390","relation":{},"ISSN":["0926-9630","1879-8365"],"issn-type":[{"value":"0926-9630","type":"print"},{"value":"1879-8365","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,3]]}}}