{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T23:16:36Z","timestamp":1780442196750,"version":"3.54.1"},"reference-count":28,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2022,1,31]],"date-time":"2022-01-31T00:00:00Z","timestamp":1643587200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2022,1,31]]},"abstract":"<jats:p>A full-fledged data exploration system must combine different access modalities with a powerful concept of guiding the user in the exploration process, by being reactive and anticipative both for data discovery and for data linking. Such systems are a real opportunity for our community to cater to users with different domain and data science expertise.<\/jats:p>\n          <jats:p>We introduce INODE - an end-to-end data exploration system - that leverages, on the one hand, Machine Learning and, on the other hand, semantics for the purpose of Data Management (DM). Our vision is to develop a classic unified, comprehensive platform that provides extensive access to open datasets, and we demonstrate it in three significant use cases in the fields of Cancer Biomarker Research, Research and Innovation Policy Making, and Astrophysics. INODE offers sustainable services in (a) data modeling and linking, (b) integrated query processing using natural language, (c) guidance, and (d) data exploration through visualization, thus facilitating the user in discovering new insights. We demonstrate that our system is uniquely accessible to a wide range of users from larger scientific communities to the public. Finally, we briefly illustrate how this work paves the way for new research opportunities in DM.<\/jats:p>","DOI":"10.1145\/3516431.3516436","type":"journal-article","created":{"date-parts":[[2022,1,31]],"date-time":"2022-01-31T23:31:58Z","timestamp":1643671918000},"page":"23-29","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["INODE"],"prefix":"10.1145","volume":"50","author":[{"given":"Sihem","family":"Amer-Yahia","sequence":"first","affiliation":[{"name":"University Grenoble Alpes, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Georgia","family":"Koutrika","sequence":"additional","affiliation":[{"name":"Athena Research Center, Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Martin","family":"Braschler","sequence":"additional","affiliation":[{"name":"Zurich University of Applied Sciences, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Diego","family":"Calvanese","sequence":"additional","affiliation":[{"name":"Free University of Bozen-Bolzano, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Davide","family":"Lanti","sequence":"additional","affiliation":[{"name":"Free University of Bozen-Bolzano, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hendrik","family":"L\u00fccke-Tieke","sequence":"additional","affiliation":[{"name":"Fraunhofer IGD, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alessandro","family":"Mosca","sequence":"additional","affiliation":[{"name":"Free University of Bozen-Bolzano, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tarcisio","family":"Mendes de Farias","sequence":"additional","affiliation":[{"name":"Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dimitris","family":"Papadopoulos","sequence":"additional","affiliation":[{"name":"Infili, Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yogendra","family":"Patil","sequence":"additional","affiliation":[{"name":"University Grenoble Alpes, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Guillem","family":"Rull","sequence":"additional","affiliation":[{"name":"SIRIS Academic, Spain"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ellery","family":"Smith","sequence":"additional","affiliation":[{"name":"Zurich University of Applied Sciences, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dimitrios","family":"Skoutas","sequence":"additional","affiliation":[{"name":"Athena Research Center, Greece"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Srividya","family":"Subramanian","sequence":"additional","affiliation":[{"name":"Max Planck Institute for Extraterrestrial Physics, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kurt","family":"Stockinger","sequence":"additional","affiliation":[{"name":"Zurich University of Applied Sciences, Switzerland"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,1,31]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-019-00567-8"},{"key":"e_1_2_1_2_1","volume-title":"INODE: building an end-to-end data exploration system in practice [extended vision]. CoRR, abs\/2104.04194","author":"Amer-Yahia S.","year":"2021","unstructured":"S. Amer-Yahia , G. Koutrika , F. Bastian , T. Belmpas , M. Braschler , U. Brunner , D. Calvanese , M. Fabricius , O. Gkini , C. Kosten , D. Lanti , A. Litke , H. L\u00a8ucke-Tieke , F. A. Massucci , T. M. de Farias , A. Mosca , F. Multari , N. Papadakis , D. Papadopoulos , Y. Patil , A. Personnaz , G. Rull , A. C. Sima , E. Smith , D. Skoutas , S. Subramanian , G. Xiao , and K. Stockinger . INODE: building an end-to-end data exploration system in practice [extended vision]. CoRR, abs\/2104.04194 , 2021 . S. Amer-Yahia, G. Koutrika, F. Bastian, T. Belmpas, M. Braschler, U. Brunner, D. Calvanese, M. Fabricius, O. Gkini, C. Kosten, D. Lanti, A. Litke, H. L\u00a8ucke-Tieke, F. A. Massucci, T. M. de Farias, A. Mosca, F. Multari, N. Papadakis, D. Papadopoulos, Y. Patil, A. Personnaz, G. Rull, A. C. Sima, E. Smith, D. Skoutas, S. Subramanian, G. Xiao, and K. Stockinger. INODE: building an end-to-end data exploration system in practice [extended vision]. CoRR, abs\/2104.04194, 2021."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.14778\/2336664.2336667"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE51399.2021.00220"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871501"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2599168"},{"key":"e_1_2_1_7_1","volume-title":"Pyexplore: Clustering-based sql query recommendations. In under submission","author":"Glenis A.","year":"2020","unstructured":"A. Glenis , Y. Stavrakas , and G. Koutrika . Pyexplore: Clustering-based sql query recommendations. In under submission , 2020 . A. Glenis, Y. Stavrakas, and G. Koutrika. Pyexplore: Clustering-based sql query recommendations. In under submission, 2020."},{"key":"e_1_2_1_8_1","volume-title":"SQL query completion for data exploration. CoRR, abs\/1802.02872","author":"Guilly M. L.","year":"2018","unstructured":"M. L. Guilly , J. Petit , and V. Scuturici . SQL query completion for data exploration. CoRR, abs\/1802.02872 , 2018 . M. L. Guilly, J. Petit, and V. Scuturici. SQL query completion for data exploration. CoRR, abs\/1802.02872, 2018."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989487"},{"key":"e_1_2_1_10_1","volume-title":"ISO 9241--210: 2019 - Ergonomics of Human-System Interaction - Part 210: Human-Centred Design for Interactive Systems","author":"International Organization for Standardization.","year":"2019","unstructured":"International Organization for Standardization. ISO 9241--210: 2019 - Ergonomics of Human-System Interaction - Part 210: Human-Centred Design for Interactive Systems , 2019 . International Organization for Standardization. ISO 9241--210:2019 - Ergonomics of Human-System Interaction - Part 210: Human-Centred Design for Interactive Systems, 2019."},{"key":"e_1_2_1_11_1","volume-title":"CIDR","author":"John R. J. L.","year":"2017","unstructured":"R. J. L. John , N. Potti , and J. M. Patel . Ava: From data to insights through conversations . In CIDR , 2017 . R. J. L. John, N. Potti, and J. M. Patel. Ava: From data to insights through conversations. In CIDR, 2017."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213929"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196909"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389752"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/543613.543644"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(02)00372-7"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3335783.3335800"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/1394399"},{"key":"e_1_2_1_19_1","volume-title":"A methodology for open information extraction and representation from large scientific corpora: The cord-19 data exploration use case. Applied Sciences, 10(16)","author":"Papadopoulos D.","year":"2020","unstructured":"D. Papadopoulos , N. Papadakis , and A. Litke . A methodology for open information extraction and representation from large scientific corpora: The cord-19 data exploration use case. Applied Sciences, 10(16) , 2020 . D. Papadopoulos, N. Papadakis, and A. Litke. A methodology for open information extraction and representation from large scientific corpora: The cord-19 data exploration use case. Applied Sciences, 10(16), 2020."},{"key":"e_1_2_1_20_1","volume-title":"Proc. of the 14th Int. Semantic Web Conf., Posters & Demonstrations Track (ISWC)","author":"Sequeda J. F.","year":"2015","unstructured":"J. F. Sequeda and D. P. Miranker . Ultrawrap Mapper: A semi-automatic relational database to RDF (RDB2RDF) mapping tool . In Proc. of the 14th Int. Semantic Web Conf., Posters & Demonstrations Track (ISWC) , 2015 . J. F. Sequeda and D. P. Miranker. Ultrawrap Mapper: A semi-automatic relational database to RDF (RDB2RDF) mapping tool. In Proc. of the 14th Int. Semantic Web Conf., Posters & Demonstrations Track (ISWC), 2015."},{"key":"e_1_2_1_21_1","first-page":"2019","article-title":"Enabling semantic queries across federated bioinformatics databases","author":"Sima A. C.","year":"2019","unstructured":"A. C. Sima , T. Mendes de Farias , E. Zbinden , M. Anisimova , M. Gil , H. Stockinger , K. Stockinger , M. Robinson-Rechavi , and C. Dessimoz . Enabling semantic queries across federated bioinformatics databases . Database , 2019 , 2019 . A. C. Sima, T. Mendes de Farias, E. Zbinden, M. Anisimova, M. Gil, H. Stockinger, K. Stockinger, M. Robinson-Rechavi, and C. Dessimoz. Enabling semantic queries across federated bioinformatics databases. Database, 2019, 2019.","journal-title":"Database"},{"key":"e_1_2_1_22_1","volume-title":"Information Systems","author":"Smith E.","year":"2021","unstructured":"E. Smith , D. Papadopoulos , M. Braschler , and K. Stockinger . Lillie: Information extraction and database integration using linguistics and learning-based algorithms . Information Systems , 2021 . E. Smith, D. Papadopoulos, M. Braschler, and K. Stockinger. Lillie: Information extraction and database integration using linguistics and learning-based algorithms. Information Systems, 2021."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2017.109"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/2854210.2854265"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3393817"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304652.3304791"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-00671-6_21"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-62466-8_17"}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3516431.3516436","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3516431.3516436","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:21Z","timestamp":1750188621000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3516431.3516436"}},"subtitle":["Building an End-to-End Data Exploration System in Practice"],"short-title":[],"issued":{"date-parts":[[2022,1,31]]},"references-count":28,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,1,31]]}},"alternative-id":["10.1145\/3516431.3516436"],"URL":"https:\/\/doi.org\/10.1145\/3516431.3516436","relation":{},"ISSN":["0163-5808"],"issn-type":[{"value":"0163-5808","type":"print"}],"subject":[],"published":{"date-parts":[[2022,1,31]]},"assertion":[{"value":"2022-01-31","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}