{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,5]],"date-time":"2026-05-05T12:31:52Z","timestamp":1777984312254,"version":"3.51.4"},"reference-count":16,"publisher":"Springer Science and Business Media LLC","issue":"S3","license":[{"start":{"date-parts":[[2024,4,19]],"date-time":"2024-04-19T00:00:00Z","timestamp":1713484800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,4,19]],"date-time":"2024-04-19T00:00:00Z","timestamp":1713484800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000049","name":"National Institute on Aging","doi-asserted-by":"publisher","award":["R21AG068994"],"award-info":[{"award-number":["R21AG068994"]}],"id":[{"id":"10.13039\/100000049","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000065","name":"National Institute of Neurological Disorders and Stroke","doi-asserted-by":"publisher","award":["R01NS116287"],"award-info":[{"award-number":["R01NS116287"]}],"id":[{"id":"10.13039\/100000065","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1931134"],"award-info":[{"award-number":["1931134"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000092","name":"U.S. National Library of Medicine","doi-asserted-by":"publisher","award":["R01LM013335"],"award-info":[{"award-number":["R01LM013335"]}],"id":[{"id":"10.13039\/100000092","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>Alzheimer\u2019s Disease (AD) is a devastating disease that destroys memory and other cognitive functions. There has been an increasing research effort to prevent and treat AD. In the US, two major data sharing resources for AD research are the National Alzheimer\u2019s Coordinating Center (NACC) and the Alzheimer\u2019s Disease Neuroimaging Initiative (ADNI); Additionally, the National Institutes of Health (NIH) Common Data Elements (CDE) Repository has been developed to facilitate data sharing and improve the interoperability among data sets in various disease research areas.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Method<\/jats:title>\n                    <jats:p>To better understand how AD-related data elements in these resources are interoperable with each other, we leverage different representation models to map data elements from different resources: NACC to ADNI, NACC to NIH CDE, and ADNI to NIH CDE. We explore bag-of-words based and word embeddings based models (Word2Vec and BioWordVec) to perform the data element mappings in these resources.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>The data dictionaries downloaded on November 23, 2021 contain 1,195 data elements in NACC, 13,918 in ADNI, and 27,213 in NIH CDE Repository. Data element preprocessing reduced the numbers of NACC and ADNI data elements for mapping to 1,099 and 7,584 respectively. Manual evaluation of the mapping results showed that the bag-of-words based approach achieved the best precision, while the BioWordVec based approach attained the best recall. In total, the three approaches mapped 175 out of 1,099 (15.92%) NACC data elements to ADNI; 107 out of 1,099 (9.74%) NACC data elements to NIH CDE; and 171 out of 7,584 (2.25%) ADNI data elements to NIH CDE.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>The bag-of-words based and word embeddings based approaches showed promise in mapping AD-related data elements between different resources. Although the mapping approaches need further improvement, our result indicates that there is a critical need to standardize CDEs across these valuable AD research resources in order to maximize the discoveries regarding AD pathophysiology, diagnosis, and treatment that can be gleaned from them.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12911-024-02500-8","type":"journal-article","created":{"date-parts":[[2024,4,19]],"date-time":"2024-04-19T04:01:51Z","timestamp":1713499311000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Mapping of Alzheimer\u2019s disease related data elements and the NIH Common Data Elements"],"prefix":"10.1186","volume":"24","author":[{"given":"Xubing","family":"Hao","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rashmie","family":"Abeysinghe","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fengbo","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paul E.","family":"Schulz","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"name":"The Alzheimer\u2019s Disease Neuroimaging Initiative","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5549-8780","authenticated-orcid":false,"given":"Licong","family":"Cui","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,4,19]]},"reference":[{"issue":"8 Suppl","key":"2500_CR1","first-page":"S177","volume":"26","author":"W Wong","year":"2020","unstructured":"Wong W. Economic burden of Alzheimer disease and managed care considerations. Am J Manag Care. 2020;26(8 Suppl):S177\u201383.","journal-title":"Am J Manag Care."},{"issue":"4","key":"2500_CR2","first-page":"270","volume":"18","author":"DL Beekly","year":"2004","unstructured":"Beekly DL, Ramos EM, van Belle G, Deitrich W, Clark AD, Jacka ME, et al. The national Alzheimer\u2019s coordinating center (NACC) database: an Alzheimer disease database. Alzheimer Dis Assoc Disord. 2004;18(4):270\u20137.","journal-title":"Alzheimer Dis Assoc Disord."},{"issue":"4","key":"2500_CR3","doi-asserted-by":"publisher","first-page":"869","DOI":"10.1016\/j.nic.2005.09.008","volume":"15","author":"SG Mueller","year":"2005","unstructured":"Mueller SG, Weiner MW, Thal LJ, Petersen RC, Jack C, Jagust W, et al. The Alzheimer\u2019s disease neuroimaging initiative. Neuroimaging Clin N Am. 2005;15(4):869.","journal-title":"Neuroimaging Clin N Am."},{"issue":"3","key":"2500_CR4","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1097\/WAD.0b013e318142774e","volume":"21","author":"DL Beekly","year":"2007","unstructured":"Beekly DL, Ramos EM, Lee WW, Deitrich WD, Jacka ME, Wu J, et al. The National Alzheimer\u2019s Coordinating Center (NACC) database: the uniform data set. Alzheimer Dis Assoc Disord. 2007;21(3):249\u201358.","journal-title":"Alzheimer Dis Assoc Disord."},{"issue":"6","key":"2500_CR5","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1177\/1740774516653238","volume":"13","author":"J Sheehan","year":"2016","unstructured":"Sheehan J, Hirschfeld S, Foster E, Ghitza U, Goetz K, Karpinski J, et al. Improving the value of clinical research through the use of common data elements. Clin Trials. 2016;13(6):671\u20136.","journal-title":"Clin Trials."},{"key":"2500_CR6","doi-asserted-by":"crossref","unstructured":"Mougin F, Burgun A, Bodenreider O. Mapping data elements to terminological resources for integrating biomedical data sources. In: BMC bioinformatics, vol.\u00a07. BioMed Central; 2006. p. 1\u201310.","DOI":"10.1186\/1471-2105-7-S3-S6"},{"issue":"4","key":"2500_CR7","doi-asserted-by":"publisher","first-page":"376","DOI":"10.1136\/amiajnl-2010-000061","volume":"18","author":"J Pathak","year":"2011","unstructured":"Pathak J, Wang J, Kashyap S, Basford M, Li R, Masys DR, et al. Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience. J Am Med Inform Assoc. 2011;18(4):376\u201386.","journal-title":"J Am Med Inform Assoc."},{"key":"2500_CR8","doi-asserted-by":"crossref","unstructured":"Liu K, Acharya A, Alai S, Schleyer T. Using electronic dental record data for research: a data-mapping study. J Dent Res. 2013;92(7_suppl):S90\u20136.","DOI":"10.1177\/0022034513487560"},{"key":"2500_CR9","unstructured":"Glyph L Cog. XPDFReader. 2023. https:\/\/www.xpdfreader.com\/about.html. Accessed 12 May 2023."},{"key":"2500_CR10","unstructured":"Davydova O. Text preprocessing in Python: Steps, Tools, and Examples. 2018. https:\/\/medium.com\/product-ai\/text-preprocessing-in-python-steps-tools-and-examples-bf025f872908. Accessed 12 May 2023."},{"key":"2500_CR11","unstructured":"Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. 2013. arXiv preprint arXiv:1301.3781."},{"key":"2500_CR12","first-page":"3111","volume-title":"In: Advances in neural information processing systems","author":"T Mikolov","year":"2013","unstructured":"Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. In: In: Advances in neural information processing systems. Red Hook: Curran Associates Inc.; 2013. p. 3111\u20139."},{"key":"2500_CR13","unstructured":"Google. word2vec. 2013. https:\/\/code.google.com\/archive\/p\/word2vec\/. Accessed 9 Sep 2021."},{"issue":"1","key":"2500_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41597-019-0055-0","volume":"6","author":"Y Zhang","year":"2019","unstructured":"Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Sci Data. 2019;6(1):1\u20139.","journal-title":"Sci Data."},{"key":"2500_CR15","unstructured":"Kusner M, Sun Y, Kolkin N, Weinberger K. From word embeddings to document distances. In: International conference on machine learning. PMLR; 2015. p. 957\u2013966."},{"key":"2500_CR16","unstructured":"Rehurek R, Sojka P. Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks. Citeseer; 2010."}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-024-02500-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12911-024-02500-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-024-02500-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,19]],"date-time":"2024-04-19T04:04:46Z","timestamp":1713499486000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-024-02500-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,19]]},"references-count":16,"journal-issue":{"issue":"S3","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["2500"],"URL":"https:\/\/doi.org\/10.1186\/s12911-024-02500-8","relation":{"is-referenced-by":[{"id-type":"doi","id":"10.1186\/s12955-026-02512-0","asserted-by":"object"}]},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,19]]},"assertion":[{"value":"24 December 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 April 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 April 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"103"}}