{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:55:57Z","timestamp":1775638557616,"version":"3.50.1"},"reference-count":13,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2021,7]]},"abstract":"<jats:p>Data science growing success relies on knowing where a relevant dataset exists, understanding its impact on a specific task, finding ways to enrich a dataset, and leveraging insights derived from it. With the growth of open data initiatives, data scientists need an extensible set of effective discovery operations to find relevant data from their enterprise datasets accessible via data discovery systems or open datasets accessible via data portals. Existing portals and systems suffer from limited discovery support and do not track the use of a dataset and insights derived from it. We will demonstrate KGLac, a system that captures metadata and semantics of datasets to construct a knowledge graph (GLac) interconnecting data items, e.g., tables and columns. KGLac supports various data discovery operations via SPARQL queries for table discovery, unionable and joinable tables, plus annotation with related derived insights. We harness a broad range of Machine Learning (ML) approaches with GLac to enable automatic graph learning for advanced and semantic data discovery. The demo will showcase how KGLac facilitates data discovery and enrichment while developing an ML pipeline to evaluate potential gender salary bias in IT jobs.<\/jats:p>","DOI":"10.14778\/3476311.3476317","type":"journal-article","created":{"date-parts":[[2021,10,28]],"date-time":"2021-10-28T22:48:43Z","timestamp":1635461323000},"page":"2675-2678","source":"Crossref","is-referenced-by-count":19,"title":["A demonstration of KGLac"],"prefix":"10.14778","volume":"14","author":[{"given":"Ahmed","family":"Helal","sequence":"first","affiliation":[{"name":"Concordia University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mossad","family":"Helali","sequence":"additional","affiliation":[{"name":"Concordia University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Khaled","family":"Ammar","sequence":"additional","affiliation":[{"name":"Borealis AI"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Essam","family":"Mansour","sequence":"additional","affiliation":[{"name":"Concordia University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,10,28]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/3312004"},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Alex Bogatu Alvaro A. A. Fernandes Norman W. Paton and Nikolaos Konstantinou. 2020. Dataset Discovery in Data Lakes. In ICDE.  Alex Bogatu Alvaro A. A. Fernandes Norman W. Paton and Nikolaos Konstantinou. 2020. Dataset Discovery in Data Lakes. In ICDE .","DOI":"10.1109\/ICDE48307.2020.00067"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407810"},{"key":"e_1_2_1_4_1","volume-title":"Aurum: A Data Discovery System. In ICDE.","author":"Fernandez Raul Castro","year":"2018","unstructured":"Raul Castro Fernandez , Ziawasch Abedjan , Famien Koko , Gina Yuan , Samuel Madden , and Michael Stonebraker . 2018 . Aurum: A Data Discovery System. In ICDE. Raul Castro Fernandez, Ziawasch Abedjan, Famien Koko, Gina Yuan, Samuel Madden, and Michael Stonebraker. 2018. Aurum: A Data Discovery System. In ICDE."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/3016100.3016266"},{"key":"e_1_2_1_6_1","volume-title":"SEM4TRA-AMAR@SEMANTICS","author":"Hartig Olaf","unstructured":"Olaf Hartig . 2019. Foundations to Query Labeled Property Graphs using SPARQL . In SEM4TRA-AMAR@SEMANTICS , Vol. 2447 . Olaf Hartig. 2019. Foundations to Query Labeled Property Graphs using SPARQL. In SEM4TRA-AMAR@SEMANTICS, Vol. 2447."},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Jonas Mueller and Alex Smola. 2019. Recognizing Variables from Their Data via Deep Embeddings of Distributions. In ICDM.  Jonas Mueller and Alex Smola. 2019. Recognizing Variables from Their Data via Deep Embeddings of Distributions. In ICDM .","DOI":"10.1109\/ICDM.2019.00158"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380605"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.14778\/3192965.3192973"},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Z. Wu S. Pan F. Chen G. Long C. Zhang and P. S. Yu. 2020. A Comprehensive Survey on Graph Neural Networks. TNNLS.  Z. Wu S. Pan F. Chen G. Long C. Zhang and P. S. Yu. 2020. A Comprehensive Survey on Graph Neural Networks. TNNLS .","DOI":"10.1109\/TNNLS.2020.2978386"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389726"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2981333"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3300065"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3476311.3476317","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T11:26:59Z","timestamp":1672226819000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3476311.3476317"}},"subtitle":["a data discovery and enrichment platform for data science"],"short-title":[],"issued":{"date-parts":[[2021,7]]},"references-count":13,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2021,7]]}},"alternative-id":["10.14778\/3476311.3476317"],"URL":"https:\/\/doi.org\/10.14778\/3476311.3476317","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2021,7]]}}}