{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:51:17Z","timestamp":1760244677884,"version":"build-2065373602"},"reference-count":55,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2022,11,15]],"date-time":"2022-11-15T00:00:00Z","timestamp":1668470400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001871","name":"Funda\u00e7\u00e3o para a Ci\u00eancia e Tecnologia (FCT)","doi-asserted-by":"publisher","award":["DSAIPA\/AI\/0088\/2020","PD\/BD\/142877\/2018","SFRH\/BD\/147837\/2019"],"award-info":[{"award-number":["DSAIPA\/AI\/0088\/2020","PD\/BD\/142877\/2018","SFRH\/BD\/147837\/2019"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001871","name":"Funda\u00e7\u00e3o para a Ci\u00eancia e a Tecnologia (FCT; national funds)","doi-asserted-by":"publisher","award":["DSAIPA\/AI\/0088\/2020","PD\/BD\/142877\/2018","SFRH\/BD\/147837\/2019"],"award-info":[{"award-number":["DSAIPA\/AI\/0088\/2020","PD\/BD\/142877\/2018","SFRH\/BD\/147837\/2019"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Healthcare"],"abstract":"<jats:p>Biomedical databases often have restricted access policies and governance rules. Thus, an adequate description of their content is essential for researchers who wish to use them for medical research. A strategy for publishing information without disclosing patient-level data is through database fingerprinting and aggregate characterisations. However, this information is still presented in a format that makes it challenging to search, analyse, and decide on the best databases for a domain of study. Several strategies allow one to visualise and compare the characteristics of multiple biomedical databases. Our study focused on a European platform for sharing and disseminating biomedical data. We use semantic data visualisation techniques to assist in comparing descriptive metadata from several databases. The great advantage lies in streamlining the database selection process, ensuring that sensitive details are not shared. To address this goal, we have considered two levels of data visualisation, one characterising a single database and the other involving multiple databases in network-level visualisations. This study revealed the impact of the proposed visualisations and some open challenges in representing semantically annotated biomedical datasets. Identifying future directions in this scope was one of the outcomes of this work.<\/jats:p>","DOI":"10.3390\/healthcare10112287","type":"journal-article","created":{"date-parts":[[2022,11,15]],"date-time":"2022-11-15T02:31:15Z","timestamp":1668479475000},"page":"2287","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Semantic Data Visualisation for Biomedical Database Catalogues"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3361-6269","authenticated-orcid":false,"given":"Arnaldo","family":"Pereira","sequence":"first","affiliation":[{"name":"Department of Electronics, Telecommunications and Informatics\/Institute of Electronics and Informatics Engineering of Aveiro, Intelligent Systems Associate Laboratory, University of Aveiro, 3810-193 Aveiro, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0729-2264","authenticated-orcid":false,"given":"Jo\u00e3o Rafael","family":"Almeida","sequence":"additional","affiliation":[{"name":"Department of Computation, University of A Coru\u00f1a, 15071 A Coru\u00f1a, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9170-5078","authenticated-orcid":false,"given":"Rui Pedro","family":"Lopes","sequence":"additional","affiliation":[{"name":"Research Centre in Digitalization and Intelligent Robotics, Polytechnic Institute of Bragan\u00e7a, 5300-253 Bragan\u00e7a, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6672-6176","authenticated-orcid":false,"given":"Jos\u00e9 Lu\u00eds","family":"Oliveira","sequence":"additional","affiliation":[{"name":"Department of Electronics, Telecommunications and Informatics\/Institute of Electronics and Informatics Engineering of Aveiro, Intelligent Systems Associate Laboratory, University of Aveiro, 3810-193 Aveiro, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2022,11,15]]},"reference":[{"key":"ref_1","first-page":"574","article-title":"Observational Health Data Sciences and Informatics (OHDSI): Opportunities for observational researchers","volume":"216","author":"Hripcsak","year":"2015","journal-title":"Stud. Health Technol. Inform."},{"key":"ref_2","first-page":"371","article-title":"Secondary analysis of existing data: Opportunities and implementation","volume":"26","author":"Cheng","year":"2014","journal-title":"Shanghai Arch. Psychiatry"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"7329","DOI":"10.1073\/pnas.1510502113","article-title":"Characterizing treatment pathways at scale using the OHDSI network","volume":"113","author":"Hripcsak","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1089\/omi.2011.0152","article-title":"Opportunities and challenges for the life sciences community","volume":"16","author":"Kolker","year":"2012","journal-title":"OMICS J. Integr. Biol."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Brickley, D., Burgess, M., and Noy, N. (2019, January 13\u201317). Google Dataset Search: Building a search engine for datasets in an open Web ecosystem. Proceedings of the The World Wide Web Conference (WWW), San Francisco, CA, USA.","DOI":"10.1145\/3308558.3313685"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/j.cmpb.2018.03.024","article-title":"MONTRA: An agile architecture for data publishing and discovery","volume":"160","author":"Silva","year":"2018","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"957","DOI":"10.1002\/humu.22841","article-title":"Cafe Variome: General-purpose software for making genotype-phenotype data discoverable in restricted or open access contexts","volume":"36","author":"Lancaster","year":"2015","journal-title":"Hum. Mutat."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/database\/bay022","article-title":"YummyData: Providing high-quality open life science data","volume":"2018","author":"Yamamoto","year":"2018","journal-title":"Database"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1038\/s41587-019-0080-8","article-title":"FAIRsharing as a community approach to standards, repositories and policies","volume":"37","author":"Sansone","year":"2019","journal-title":"Nat. Biotechnol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1186\/s13195-018-0396-5","article-title":"The EMIF-AD Multimodal Biomarker Discovery study: Design, methods and cohort characteristics","volume":"10","author":"Bos","year":"2018","journal-title":"Alzheimer\u2019S Res. Ther."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"969","DOI":"10.1093\/jamia\/ocy032","article-title":"Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data","volume":"25","author":"Reps","year":"2018","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1049\/iet-sen:20070109","article-title":"Semantic software metrics computed from natural language design specifications","volume":"2","author":"Gall","year":"2008","journal-title":"IET Softw."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Almeida, J.R., Monteiro, E., Silva, L.B., Sierra, A.P., and Oliveira, J.L. (2020, January 28\u201330). A recommender system to help discovering cohorts in rare diseases. Proceedings of the 33rd International Symposium on Computer-Based Medical Systems (CBMS), Rochester, MN, USA.","DOI":"10.1109\/CBMS49503.2020.00012"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1049\/sfw2.12028","article-title":"Systematic review of question answering over knowledge bases","volume":"16","author":"Pereira","year":"2022","journal-title":"IET Softw."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1016\/j.jbi.2013.08.006","article-title":"An innovative portal for rare genetic diseases research: The semantic Diseasecard","volume":"46","author":"Lopes","year":"2013","journal-title":"J. Biomed. Inform."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pereira, A., Almeida, J.R., Lopes, R.P., and Oliveira, J.L. (2022, January 21\u201322). Visualising time-evolving semantic biomedical data. Proceedings of the 35th International Symposium on Computer-Based Medical Systems (CBMS), Shenzhen, China.","DOI":"10.1109\/CBMS55023.2022.00053"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Rietveld, L., and Hoekstra, R. (2013, January 27). YASGUI: Not just another SPARQL client. Proceedings of the ESWC2013 Workshop on Services and Applications over Linked APIs and Data, Montpellier, France.","DOI":"10.1007\/978-3-642-41242-4_7"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Schweiger, D., Trajanoski, Z., and Pabinger, S. (2014). SPARQLGraph: A web-based platform for graphically querying biological Semantic Web databases. BMC Bioinform., 15.","DOI":"10.1186\/1471-2105-15-279"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1186\/s13326-017-0151-z","article-title":"PIBAS FedSPARQL: A web-based platform for integration and exploration of bioinformatics datasets","volume":"8","author":"Cvjetkovic","year":"2017","journal-title":"J. Biomed. Semant."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Callahan, A., Cruz-Toledo, J., Ansell, P., and Dumontier, M. (2013, January 26\u201330). Bio2RDF Release 2: Improved coverage, interoperability and provenance of life science linked data. Proceedings of the The Semantic Web: Semantics and Big Data, ESWC 2013, Montpellier, France.","DOI":"10.1007\/978-3-642-38288-8_14"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Chen, B., Dong, X., Jiao, D., Wang, H., Zhu, Q., Ding, Y., and Wild, D. (2010). Chem2Bio2RDF: A semantic framework for linking and data mining chemogenomic and systems chemical biology data. BMC Bioinform., 11.","DOI":"10.1186\/1471-2105-11-255"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"W580","DOI":"10.1093\/nar\/gkv279","article-title":"The EMBL-EBI bioinformatics web and programmatic tools framework","volume":"43","author":"Li","year":"2015","journal-title":"Nucleic Acids Res."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1093\/bioinformatics\/btx739","article-title":"SATORI: A system for ontology-guided visual exploration of biomedical data repositories","volume":"34","author":"Lekschas","year":"2017","journal-title":"Bioinformatics"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1109\/TVCG.2021.3114863","article-title":"KG4Vis: A knowledge graph-based approach for visualization recommendation","volume":"28","author":"Li","year":"2022","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1006\/jvlc.1997.0037","article-title":"Visual query systems for databases: A survey","volume":"8","author":"Catarci","year":"1997","journal-title":"J. Vis. Lang. Comput."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Lloret-Gazo, J. (2016, January 5\u20138). A survey on visual query systems in the Web era. Proceedings of the 27th International Conference on Database and Expert Systems Applications (DEXA), Porto, Portugal.","DOI":"10.1007\/978-3-319-44406-2_28"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1145\/1121949.1121979","article-title":"Exploratory search: From finding to understanding","volume":"49","author":"Marchionini","year":"2006","journal-title":"Commun. ACM"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1456650.1456652","article-title":"A review of overview+detail, zooming, and focus+context interfaces","volume":"41","author":"Cockburn","year":"2009","journal-title":"ACM Comput. Surv."},{"key":"ref_29","unstructured":"Lima, M. (2011). Visual Complexity: Mapping Patterns of Information, Princeton Architectural Press."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1224","DOI":"10.1109\/TVCG.2007.70515","article-title":"Toward a deeper understanding of the role of interaction in information visualization","volume":"13","author":"Yi","year":"2007","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1145\/2133416.2146416","article-title":"Interactive dynamics for visual analysis: A taxonomy of tools that support the fluent and flexible use of visualizations","volume":"10","author":"Heer","year":"2012","journal-title":"Queue"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Knight, S.A., and Spink, A. (2008). Toward a Web search information behavior model. Web Search: Multidisciplinary Perspectives, Springer. Chapter 12.","DOI":"10.1007\/978-3-540-75829-7_12"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.websem.2014.10.001","article-title":"An overview of semantic search evaluation initiatives","volume":"30","author":"Elbedweihy","year":"2015","journal-title":"J. Web Semant."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"384","DOI":"10.1145\/371578.371593","article-title":"Extracting usability information from user interface events","volume":"32","author":"Hilbert","year":"2000","journal-title":"ACM Comput. Surv."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1016\/j.ijmedinf.2019.02.006","article-title":"EMIF Catalogue: A collaborative platform for sharing and reusing biomedical data","volume":"126","author":"Oliveira","year":"2019","journal-title":"Int. J. Med. Inform."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Trifan, A., and Oliveira, J.L. (2018, January 18\u201321). A FAIR marketplace for biomedical data custodians and clinical researchers. Proceedings of the 31st International Symposium on Computer-Based Medical Systems (CBMS), Karlstad, Sweden.","DOI":"10.1109\/CBMS.2018.00040"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Agnihotri, M., and Chug, A. (2021, January 5\u20137). Analyzing the Relationship between Software Metrics and Bad Smells Using Critical Metric Value (CMV). Proceedings of the 2021 13th International Conference on Contemporary Computing (IC3-2021), Noida, India.","DOI":"10.1145\/3474124.3474193"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3233\/SW-160249","article-title":"Visualisation of Linked Data\u2014Reprise","volume":"8","author":"Dadzie","year":"2016","journal-title":"Semant. Web"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"2201","DOI":"10.1056\/NEJMp1704482","article-title":"Bridging the data-sharing divide\u2014Seeing the devil in the details, not the other camp","volume":"376","author":"Rosenbaum","year":"2017","journal-title":"N. Engl. J. Med."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1093\/ije\/dyv098","article-title":"Data resource profile: Clinical Practice Research Datalink (CPRD)","volume":"44","author":"Herrett","year":"2015","journal-title":"Int. J. Epidemiol."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"497","DOI":"10.1016\/S0140-6736(20)30183-5","article-title":"Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China","volume":"395","author":"Huang","year":"2020","journal-title":"Lancet"},{"key":"ref_42","first-page":"1","article-title":"Analysis and visualization of disease courses in a semantic enabled cancer registry","volume":"8","author":"Boeker","year":"2017","journal-title":"J. Biomed. Semant."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"100760","DOI":"10.1016\/j.imu.2021.100760","article-title":"A methodology for cohort harmonisation in multicentre clinical research","volume":"27","author":"Almeida","year":"2021","journal-title":"Inform. Med. Unlocked"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1186\/s40537-021-00501-2","article-title":"Design matters in patient-level prediction: Evaluation of a cohort vs. case-control design when developing predictive models in observational healthcare datasets","volume":"8","author":"Reps","year":"2021","journal-title":"J. Big Data"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Sequeira, M., Almeida, J.R., and Oliveira, J.L. (2021, January 7\u20139). A comparative analysis of data platforms for rare diseases. Proceedings of the 34th International Symposium on Computer-Based Medical Systems (CBMS), Aveiro, Portugal.","DOI":"10.1109\/CBMS52027.2021.00041"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"e98","DOI":"10.1016\/S2589-7500(20)30289-2","article-title":"Renin\u2013angiotensin system blockers and susceptibility to COVID-19: An international, open science, cohort analysis","volume":"3","author":"Morales","year":"2021","journal-title":"Lancet Digit. Health"},{"key":"ref_47","first-page":"D865","article-title":"The Human Phenotype Ontology in 2017","volume":"45","author":"Vasilevsky","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"The Gene Ontology Consortium (2016). Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res., 45, D331\u2013D338.","DOI":"10.1093\/nar\/gkw1108"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"S7","DOI":"10.1186\/2041-1480-1-S1-S7","article-title":"Modeling biomedical experimental processes with OBI","volume":"1","author":"Brinkman","year":"2010","journal-title":"J. Biomed. Semant."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1049\/iet-sen.2016.0048","article-title":"Ontology-based service discovery framework for dynamic environments","volume":"11","author":"Zeshan","year":"2017","journal-title":"IET Softw."},{"key":"ref_51","first-page":"1","article-title":"Semantic similarity and machine learning with ontologies","volume":"22","author":"Kulmanov","year":"2020","journal-title":"Briefings Bioinform."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3177850","article-title":"RDF data storage and query processing schemes: A survey","volume":"51","author":"Wylot","year":"2018","journal-title":"ACM Comput. Surv."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1142\/S0218488502001648","article-title":"k-anonymity: A model for protecting privacy","volume":"10","author":"Sweeney","year":"2002","journal-title":"Int. J. Uncertain. Fuzziness Knowl. Based Syst."},{"key":"ref_54","first-page":"1","article-title":"L-diversity: Privacy beyond k-anonymity","volume":"1","author":"Machanavajjhala","year":"2007","journal-title":"ACM Trans. Knowl. Discov. Data (TKDD)"},{"key":"ref_55","first-page":"1866","article-title":"Balanced k-Anonymization","volume":"1","year":"2007","journal-title":"Int. J. Comput. Inf. Eng."}],"container-title":["Healthcare"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2227-9032\/10\/11\/2287\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:18:13Z","timestamp":1760145493000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2227-9032\/10\/11\/2287"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,15]]},"references-count":55,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["healthcare10112287"],"URL":"https:\/\/doi.org\/10.3390\/healthcare10112287","relation":{},"ISSN":["2227-9032"],"issn-type":[{"type":"electronic","value":"2227-9032"}],"subject":[],"published":{"date-parts":[[2022,11,15]]}}}