{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T16:15:40Z","timestamp":1759335340745,"version":"3.37.3"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T00:00:00Z","timestamp":1687219200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T00:00:00Z","timestamp":1687219200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001823","name":"Ministerstvo \u0160kolstv\u00ed, Ml\u00e1de\u017ee a T\u011blov\u00fdchovy","doi-asserted-by":"publisher","award":["LM2018131"],"award-info":[{"award-number":["LM2018131"]}],"id":[{"id":"10.13039\/501100001823","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Institute of Organic Chemistry and Biochemistry, Czech Republic","award":["RVO:61388963"],"award-info":[{"award-number":["RVO:61388963"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Current biological and chemical research is increasingly dependent on the reusability of previously acquired data, which typically come from various sources. Consequently, there is a growing need for database systems and databases stored in them to be interoperable with each other. One of the possible solutions to address this issue is to use systems based on Semantic Web technologies, namely on the Resource Description Framework (RDF) to express data and on the SPARQL query language to retrieve the data. Many existing biological and chemical databases are stored in the form of a relational database (RDB). Converting a relational database into the RDF form and storing it in a native RDF database system may not be desirable in many cases. It may be necessary to preserve the original database form, and having two versions of the same data may not be convenient. A solution may be to use a system mapping the relational database to the RDF form. Such a system keeps data in their original relational form and translates incoming SPARQL queries to equivalent SQL queries, which are evaluated by a relational-database system. This review compares different RDB-to-RDF mapping systems with a primary focus on those that can be used free of charge. In addition, it compares different approaches to expressing RDB-to-RDF mappings. The review shows that these systems represent a viable method providing sufficient performance. Their real-life performance is demonstrated on data and queries coming from the neXtProt project.<\/jats:p>","DOI":"10.1186\/s13321-023-00729-5","type":"journal-article","created":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T06:02:02Z","timestamp":1687240922000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL"],"prefix":"10.1186","volume":"15","author":[{"given":"Jakub","family":"Galgonek","sequence":"first","affiliation":[]},{"given":"Ji\u0159\u00ed","family":"Vondr\u00e1\u0161ek","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,6,20]]},"reference":[{"key":"729_CR1","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"MD Wilkinson","year":"2016","unstructured":"Wilkinson MD et al (2016) The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3:160018","journal-title":"Sci Data"},{"key":"729_CR2","doi-asserted-by":"publisher","first-page":"D1075","DOI":"10.1093\/nar\/gkt978","volume":"42","author":"Y Wang","year":"2014","unstructured":"Wang Y et al (2014) PubChem BioAssay: 2014 update. Nucleic Acids Res 42:D1075\u2013D1082","journal-title":"Nucleic Acids Res"},{"key":"729_CR3","doi-asserted-by":"publisher","first-page":"D930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez D et al (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res 47:D930\u2013D940","journal-title":"Nucleic Acids Res"},{"key":"729_CR4","doi-asserted-by":"publisher","first-page":"D754","DOI":"10.1093\/nar\/gkr1126","volume":"40","author":"R Alcantara","year":"2012","unstructured":"Alcantara R et al (2012) Rhea\u2014a manually curated resource of biochemical reactions. Nucleic Acids Res 40:D754\u2013D760","journal-title":"Nucleic Acids Res"},{"key":"729_CR5","doi-asserted-by":"publisher","first-page":"baz078","DOI":"10.1093\/database\/baz078","volume":"2019","author":"J Juracka","year":"2019","unstructured":"Juracka J, Srejber M, Melikova M, Bazgier V, Berka K (2019) MolMeDB: molecules on membranes database. Database 2019:baz078","journal-title":"Database"},{"unstructured":"W3C (2014) RDF 1.1 primer . https:\/\/www.w3.org\/TR\/rdf11-primer\/. Accessed 15 Sep 2022","key":"729_CR6"},{"unstructured":"W3C (2013) SPARQL 1.1 query language . https:\/\/www.w3.org\/TR\/sparql11-query\/. Accessed 15 Sep 2022","key":"729_CR7"},{"key":"729_CR8","doi-asserted-by":"publisher","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","volume":"49","author":"C UniProt","year":"2021","unstructured":"UniProt C (2021) UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res 49:D480\u2013D489","journal-title":"Nucleic Acids Res"},{"key":"729_CR9","doi-asserted-by":"publisher","first-page":"D693","DOI":"10.1093\/nar\/gkab1016","volume":"50","author":"P Bansal","year":"2022","unstructured":"Bansal P et al (2022) Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res 50:D693\u2013D700","journal-title":"Nucleic Acids Res"},{"key":"729_CR10","first-page":"D328","volume":"48","author":"M Zahn-Zabal","year":"2020","unstructured":"Zahn-Zabal M et al (2020) The neXtProt knowledgebase in 2020: data, tools and usability improvements. Nucleic Acids Res 48:D328\u2013D334","journal-title":"Nucleic Acids Res"},{"key":"729_CR11","first-page":"D845","volume":"48","author":"J Pinero","year":"2020","unstructured":"Pinero J et al (2020) The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res 48:D845\u2013D855","journal-title":"Nucleic Acids Res"},{"unstructured":"Wikidata. https:\/\/www.wikidata.org. Accessed 15 Sep 2022","key":"729_CR12"},{"key":"729_CR13","doi-asserted-by":"publisher","first-page":"e70780","DOI":"10.7554\/eLife.70780","volume":"11","author":"A Rutz","year":"2022","unstructured":"Rutz A et al (2022) The LOTUS initiative for open knowledge management in natural products research. Elife 11:e70780","journal-title":"Elife"},{"unstructured":"W3C (2014) RDF 1.1 concepts and abstract syntax . https:\/\/www.w3.org\/TR\/rdf11-concepts\/. Accessed 15 Sep 2022","key":"729_CR14"},{"key":"729_CR15","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1145\/362384.362685","volume":"13","author":"EF Codd","year":"1970","unstructured":"Codd EF (1970) A relational model of data for large shared data banks. Commun ACM 13:377\u2013387. https:\/\/doi.org\/10.1145\/362384.362685","journal-title":"Commun ACM"},{"unstructured":"W3C (2012) R2RML: RDB to RDF mapping language . https:\/\/www.w3.org\/TR\/r2rml\/. Accessed 15 Sep 2022","key":"729_CR16"},{"unstructured":"OpenLink Software: Virtuoso. https:\/\/virtuoso.openlinksw.com. Accessed 15 Sep 2022","key":"729_CR17"},{"unstructured":"Blazegraph. https:\/\/blazegraph.com. Accessed 15 Sep 2022","key":"729_CR18"},{"key":"729_CR19","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbac199","author":"N Zong","year":"2022","unstructured":"Zong N et al (2022) BETA: a comprehensive benchmark for computational drug-target prediction. Brief Bioinform. https:\/\/doi.org\/10.1093\/bib\/bbac199","journal-title":"Brief Bioinform"},{"unstructured":"Ontotext GraphDB. https:\/\/graphdb.ontotext.com. Accessed 15 Sep 2022","key":"729_CR20"},{"key":"729_CR21","doi-asserted-by":"publisher","first-page":"3231","DOI":"10.1093\/bioinformatics\/bty319","volume":"34","author":"S Zhao","year":"2018","unstructured":"Zhao S et al (2018) GlycoStore: a database of retention properties for glycan analysis. Bioinformatics 34:3231\u20133232","journal-title":"Bioinformatics"},{"key":"729_CR22","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1186\/s12859-017-1840-4","volume":"18","author":"N Zaki","year":"2017","unstructured":"Zaki N, Tennakoon C (2017) BioCarian: search engine for exploratory searches in heterogeneous biological databases. BMC Bioinf 18:435","journal-title":"BMC Bioinf"},{"unstructured":"Apache Jena. https:\/\/jena.apache.org. Accessed 15 Sep 2022","key":"729_CR23"},{"unstructured":"Linked data views over RDBMS data source. http:\/\/docs.openlinksw.com\/virtuoso\/rdfviewsrdbms\/. Accessed 15 Sep 2022","key":"729_CR24"},{"unstructured":"Ontop. https:\/\/ontop-vkg.org. Accessed 15 Sep 2022","key":"729_CR25"},{"key":"729_CR26","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1186\/s13321-021-00515-1","volume":"13","author":"J Galgonek","year":"2021","unstructured":"Galgonek J, Vondrasek J (2021) IDSM ChemWebRDF: SPARQLing small-molecule datasets. J Cheminform 13:38","journal-title":"J Cheminform"},{"unstructured":"neXtProt SNORQL. https:\/\/snorql.nextprot.org. Accessed 15 Sep 2022","key":"729_CR27"},{"key":"729_CR28","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1186\/s13321-018-0282-y","volume":"10","author":"M Kratochvil","year":"2018","unstructured":"Kratochvil M, Vondrasek J, Galgonek J (2018) Sachem: a chemical cartridge for high-performance substructure search. J Cheminform 10:27","journal-title":"J Cheminform"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00729-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00729-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00729-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T06:04:43Z","timestamp":1687241083000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00729-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,20]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["729"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00729-5","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2023,6,20]]},"assertion":[{"value":"15 September 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 May 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 June 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests except that they develop the IDSM SPARQL engine, one of the RDB-to-RDF mapping systems compared in the review.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"61"}}