{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,6]],"date-time":"2026-01-06T05:07:40Z","timestamp":1767676060726,"version":"build-2065373602"},"reference-count":52,"publisher":"SAGE Publications","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["SW"],"published-print":{"date-parts":[[2021,10,4]]},"abstract":"<jats:p>Ontology-Based Data Access (OBDA) has traditionally focused on providing a unified view of heterogeneous datasets (e.g., relational databases, CSV and JSON files), either by materializing integrated data into RDF or by performing on-the-fly querying via SPARQL query translation. In the specific case of tabular datasets represented as several CSV or Excel files, query translation approaches have been applied by considering each source as a single table that can be loaded into a relational database management system (RDBMS). Nevertheless, constraints over these tables are not represented (e.g., referential integrity among sources, datatypes, or data integrity); thus, neither consistency among attributes nor indexes over tables are enforced. As a consequence, efficiency of the SPARQL-to-SQL translation process may be affected, as well as the completeness of the answers produced during the evaluation of the generated SQL query. Our work is focused on applying implicit constraints on the OBDA query translation process over tabular data. We propose Morph-CSV, a framework for querying tabular data that exploits information from typical OBDA inputs (e.g., mappings, queries) to enforce constraints that can be used together with any SPARQL-to-SQL OBDA engine. Morph-CSV relies on both a constraint component and a set of constraint operators. For a given set of constraints, the operators are applied to each type of constraint with the aim of enhancing query completeness and performance. We evaluate Morph-CSV in several domains: e-commerce with the BSBM benchmark; transportation with the GTFS-Madrid benchmark; and biology with a use case extracted from the Bio2RDF project. We compare and report the performance of two SPARQL-to-SQL OBDA engines, without and with the incorporation of Morph-CSV. The observed results suggest that Morph-CSV is able to speed up the total query execution time by up to two orders of magnitude, while it is able to produce all the query answers.<\/jats:p>","DOI":"10.3233\/sw-210432","type":"journal-article","created":{"date-parts":[[2021,4,9]],"date-time":"2021-04-09T12:22:49Z","timestamp":1617970969000},"page":"869-902","source":"Crossref","is-referenced-by-count":12,"title":["Enhancing virtual ontology based access over tabular data with Morph-CSV"],"prefix":"10.1177","volume":"12","author":[{"given":"David","family":"Chaves-Fraga","sequence":"first","affiliation":[{"name":"Ontology Engineering Group, Universidad Polit\u00e9cnica de Madrid, Spain. E-mails:\u00a0dchaves@fi.upm.es,\u00a0eruckhaus@fi.upm.es,\u00a0fpriyatna@fi.upm.es,\u00a0ocorcho@fi.upm.es"}]},{"given":"Edna","family":"Ruckhaus","sequence":"additional","affiliation":[{"name":"Ontology Engineering Group, Universidad Polit\u00e9cnica de Madrid, Spain. E-mails:\u00a0dchaves@fi.upm.es,\u00a0eruckhaus@fi.upm.es,\u00a0fpriyatna@fi.upm.es,\u00a0ocorcho@fi.upm.es"}]},{"given":"Freddy","family":"Priyatna","sequence":"additional","affiliation":[{"name":"Ontology Engineering Group, Universidad Polit\u00e9cnica de Madrid, Spain. E-mails:\u00a0dchaves@fi.upm.es,\u00a0eruckhaus@fi.upm.es,\u00a0fpriyatna@fi.upm.es,\u00a0ocorcho@fi.upm.es"}]},{"given":"Maria-Esther","family":"Vidal","sequence":"additional","affiliation":[{"name":"TIB - Leibniz Information Centre for Science and Technology and L3S Leibniz University of Hannover, Germany. E-mail:\u00a0maria.vidal@tib.eu"}]},{"given":"Oscar","family":"Corcho","sequence":"additional","affiliation":[{"name":"Ontology Engineering Group, Universidad Polit\u00e9cnica de Madrid, Spain. E-mails:\u00a0dchaves@fi.upm.es,\u00a0eruckhaus@fi.upm.es,\u00a0fpriyatna@fi.upm.es,\u00a0ocorcho@fi.upm.es"}]}],"member":"179","reference":[{"key":"10.3233\/SW-210432_ref1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213864"},{"key":"10.3233\/SW-210432_ref2","doi-asserted-by":"publisher","DOI":"10.1016\/B978-0-934613-53-8.50035-2"},{"issue":"5","key":"10.3233\/SW-210432_ref3","doi-asserted-by":"publisher","first-page":"706","DOI":"10.1016\/j.jbi.2008.03.004","article-title":"Bio2RDF: Towards a mashup to build bioinformatics knowledge systems","volume":"41","author":"Belleau","year":"2008","journal-title":"Journal of biomedical informatics"},{"key":"10.3233\/SW-210432_ref4","doi-asserted-by":"publisher","DOI":"10.4018\/jswis.2009081901"},{"issue":"2","key":"10.3233\/SW-210432_ref5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.4018\/jswis.2009040101","article-title":"The Berlin SPARQL benchmark","volume":"5","author":"Bizer","year":"2009","journal-title":"International Journal on Semantic Web and Information Systems (IJSWIS)"},{"issue":"1","key":"10.3233\/SW-210432_ref6","doi-asserted-by":"publisher","first-page":"21","DOI":"10.3233\/IA-190023","article-title":"Ontology-based data access\u2013Beyond relational sources","volume":"13","author":"Botoeva","year":"2019","journal-title":"Intelligenza Artificiale"},{"issue":"3","key":"10.3233\/SW-210432_ref7","doi-asserted-by":"publisher","first-page":"471","DOI":"10.3233\/SW-160217","article-title":"Ontop: Answering SPARQL queries over relational databases","volume":"8","author":"Calvanese","year":"2017","journal-title":"Semantic Web"},{"key":"10.3233\/SW-210432_ref8","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45632-5_2"},{"key":"10.3233\/SW-210432_ref9","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2020.100596"},{"key":"10.3233\/SW-210432_ref10","unstructured":"S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman and J. Widom, The TSIMMIS project: Integration of heterogenous information sources, in: Information Processing Society of Japan (IPSJ 1994), 1994."},{"issue":"4","key":"10.3233\/SW-210432_ref11","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1145\/320107.320109","article-title":"Extending the database relational model to capture more meaning","volume":"4","author":"Codd","year":"1979","journal-title":"ACM Transactions on Database Systems (TODS)"},{"issue":"1","key":"10.3233\/SW-210432_ref12","doi-asserted-by":"publisher","first-page":"153","DOI":"10.3233\/SW-190384","article-title":"Towards a new generation of ontology based data access","volume":"11","author":"Corcho","year":"2020","journal-title":"Semantic Web Journal"},{"key":"10.3233\/SW-210432_ref14","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-47602-5_10"},{"key":"10.3233\/SW-210432_ref15","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-58451-5_3"},{"key":"10.3233\/SW-210432_ref16","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2019.10.006"},{"key":"10.3233\/SW-210432_ref17","unstructured":"C. Debruyne and D. O\u2019Sullivan, R2RML-F: Towards sharing and executing domain logic in R2RML mappings, in: LDOW@ WWW, 2016."},{"key":"10.3233\/SW-210432_ref18","unstructured":"A. Dimou, M. Vander Sande, P. Colpaert, R. Verborgh, E. Mannens and R. Van de Walle, RML: A generic language for integrated RDF mappings of heterogeneous data, in: LDOW, 2014."},{"key":"10.3233\/SW-210432_ref19","doi-asserted-by":"publisher","DOI":"10.1016\/C2011-0-06130-6"},{"key":"10.3233\/SW-210432_ref20","unstructured":"M. Dumontier, A. Callahan, J. Cruz-Toledo, P. Ansell, V. Emonet, F. Belleau and A. Droit, Bio2RDF release 3: A larger connected network of linked data for the life sciences, in: Proceedings of the 2014 International Conference on Posters & Demonstrations Track, Vol. 1272, 2014, pp. 401\u2013404."},{"key":"10.3233\/SW-210432_ref21","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-58415-6_1"},{"key":"10.3233\/SW-210432_ref22","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-27615-7_29"},{"key":"10.3233\/SW-210432_ref23","doi-asserted-by":"publisher","DOI":"10.1145\/3034786.3056124"},{"key":"10.3233\/SW-210432_ref25","unstructured":"A. Halevy, A. Rajaraman and J. Ordille, Data integration: The teenage years, in: Proceedings of the 32nd International Conference on Very Large Data Bases, 2006, pp. 9\u201316."},{"key":"10.3233\/SW-210432_ref26","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-98192-5_40"},{"key":"10.3233\/SW-210432_ref27","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-42019-6_18"},{"key":"10.3233\/SW-210432_ref28","unstructured":"S. Idreos, I. Alagiannis, R. Johnson and A. Ailamaki, Here are my data files. Here are my queries. Where are my results? in: Proceedings of 5th Biennial Conference on Innovative Data Systems Research, 2011."},{"key":"10.3233\/SW-210432_ref29","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412881"},{"key":"10.3233\/SW-210432_ref30","unstructured":"A. Iglesias-Molina, D. Chaves-Fraga, F. Priyatna and O. Corcho, Enhancing the maintainability of the Bio2RDF project using declarative mappings, in: Proceedings of the 12th International Conference on Semantic Web Applications and Tools for Healthcare and Life Sciences, 2019."},{"key":"10.3233\/SW-210432_ref31","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-62419-4_16"},{"key":"10.3233\/SW-210432_ref32","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-33246-4_4"},{"key":"10.3233\/SW-210432_ref33","doi-asserted-by":"publisher","DOI":"10.1145\/3011141.3011152"},{"issue":"2","key":"10.3233\/SW-210432_ref34","doi-asserted-by":"publisher","first-page":"413","DOI":"10.3233\/SW-180336","article-title":"VIG: Data scaling for OBDA benchmarks","volume":"10","author":"Lanti","year":"2019","journal-title":"Semantic Web"},{"key":"10.3233\/SW-210432_ref35","doi-asserted-by":"publisher","DOI":"10.1145\/543613.543644"},{"key":"10.3233\/SW-210432_ref36","unstructured":"S.M.H. Mahmud, M. Hossin, H. Jahan, S. Noori and M. Hossain, CSV2RDF: Generating RDF data from CSV file using semantic web technologies, Journal of Theoretical and Applied Information Technology 96 (2018)."},{"key":"10.3233\/SW-210432_ref37","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-30796-7_15"},{"key":"10.3233\/SW-210432_ref38","doi-asserted-by":"publisher","DOI":"10.5220\/0005448304430454"},{"key":"10.3233\/SW-210432_ref39","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11964-9_36"},{"key":"10.3233\/SW-210432_ref40","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-77688-8_5"},{"key":"10.3233\/SW-210432_ref41","doi-asserted-by":"publisher","DOI":"10.1145\/2566486.2567981"},{"key":"10.3233\/SW-210432_ref42","unstructured":"A.L.A. Rajaraman, J. Ordille et al., Querying heterogeneous information sources using source descriptions, in: Proc. of VLDB, 1996."},{"key":"10.3233\/SW-210432_ref43","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1016\/j.websem.2015.03.001","article-title":"Efficient SPARQL-to-SQL with R2RML mappings","volume":"33","author":"Rodriguez-Muro","year":"2015","journal-title":"Web Semantics"},{"key":"10.3233\/SW-210432_ref45","unstructured":"M.T. Roth and P.M. Schwarz, Don\u2019t scrap it, wrap it! A wrapper architecture for legacy data sources, in: VLDB, Vol. 97, 1997, pp. 25\u201329."},{"key":"10.3233\/SW-210432_ref46","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1016\/j.websem.2013.08.002","article-title":"Ultrawrap: SPARQL execution on relational data","volume":"22","author":"Sequeda","year":"2013","journal-title":"Journal of Web Semantics"},{"key":"10.3233\/SW-210432_ref47","unstructured":"U. \u015eim\u015fek, E. K\u00e4rle and D. Fensel, RocketRML-A NodeJS implementation of a use-case specific RML mapper, in: Proceeding of the First International Workshop on Knowledge Graph Building, 2019."},{"key":"10.3233\/SW-210432_ref48","unstructured":"J. Slepicka, C. Yin, P.A. Szekely and C.A. Knoblock, KR2RML: An alternative interpretation of R2RML for heterogenous sources, in: COLD, 2015."},{"key":"10.3233\/SW-210432_ref49","unstructured":"J. Tennison, G. Kellogg and I. Herman, Model for tabular data and metadata on the web. W3C recommendation, in: World Wide Web Consortium (W3C), 2015."},{"key":"10.3233\/SW-210432_ref50","doi-asserted-by":"publisher","DOI":"10.1109\/COOPIS.1998.706180"},{"key":"10.3233\/SW-210432_ref51","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13486-9_16"},{"issue":"3","key":"10.3233\/SW-210432_ref52","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1109\/2.121508","article-title":"Mediators in the architecture of future information systems","volume":"25","author":"Wiederhold","year":"1992","journal-title":"Computer"},{"key":"10.3233\/SW-210432_ref53","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18"},{"key":"10.3233\/SW-210432_ref54","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/777"},{"key":"10.3233\/SW-210432_ref55","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-00671-6_21"}],"container-title":["Semantic Web"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/SW-210432","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T11:01:06Z","timestamp":1756206066000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/SW-210432"}},"subtitle":[],"editor":[{"given":"Axel-Cyrille Ngonga","family":"Ngomo","sequence":"additional","affiliation":[{"name":"University of Paderborn, Germany"}]},{"given":"Muhammad","family":"Saleem","sequence":"additional","affiliation":[{"name":"University of Leipzig, Germany"}]},{"given":"Ruben","family":"Verborgh","sequence":"additional","affiliation":[{"name":"Ghent University \u2013 IMEC, Germany"}]},{"given":"Muhammad","family":"Saleem","sequence":"additional","affiliation":[]},{"given":"Ruben","family":"Verborgh","sequence":"additional","affiliation":[]},{"given":"Muhammad Intizar","family":"Ali","sequence":"additional","affiliation":[]},{"given":"Olaf","family":"Hartig","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,10,4]]},"references-count":52,"journal-issue":{"issue":"6"},"URL":"https:\/\/doi.org\/10.3233\/sw-210432","relation":{},"ISSN":["2210-4968","1570-0844"],"issn-type":[{"type":"electronic","value":"2210-4968"},{"type":"print","value":"1570-0844"}],"subject":[],"published":{"date-parts":[[2021,10,4]]}}}