{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,6]],"date-time":"2025-12-06T04:48:09Z","timestamp":1764996489908,"version":"3.44.0"},"reference-count":14,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:p>The task of entity resolution (ER) aims to detect multiple records describing the same real-world entity in datasets and to consolidate them into a single consistent record. ER plays a fundamental role in guaranteeing good data quality, e.g., as input for data science pipelines. Yet, the traditional approach to ER requires cleaning the entire data before being able to run consistent queries on it; hence, users struggle to tackle common scenarios with limited time or resources (e.g., when the data changes frequently or the user is only interested in a portion of the dataset for the task).<\/jats:p>\n          <jats:p>We previously introduced BrewER, a framework to evaluate SQL SP queries on dirty data while progressively returning results as if they were issued on cleaned data, according to a priority defined by the user. In this demonstration, we show how BrewER can be exploited to ease the burden of ER, allowing data scientists to save a significant amount of resources for their tasks.<\/jats:p>","DOI":"10.14778\/3611540.3611612","type":"journal-article","created":{"date-parts":[[2023,9,15]],"date-time":"2023-09-15T11:32:37Z","timestamp":1694777557000},"page":"4026-4029","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["BrewER: Entity Resolution On-Demand"],"prefix":"10.14778","volume":"16","author":[{"given":"Luca","family":"Zecchini","sequence":"first","affiliation":[{"name":"University of Modena and Reggio Emilia, Modena, Italy"}]},{"given":"Giovanni","family":"Simonini","sequence":"additional","affiliation":[{"name":"University of Modena and Reggio Emilia, Modena, Italy"}]},{"given":"Sonia","family":"Bergamaschi","sequence":"additional","affiliation":[{"name":"University of Modena and Reggio Emilia, Modena, Italy"}]},{"given":"Felix","family":"Naumann","sequence":"additional","affiliation":[{"name":"Hasso Plattner Institute, University of Potsdam, Potsdam, Germany"}]}],"member":"320","published-online":{"date-parts":[[2023,8]]},"reference":[{"key":"e_1_2_1_1_1","first-page":"1846","article-title":"Query-Driven Approach to Entity Resolution","volume":"6","author":"Hotham Altwaijry","year":"2013","unstructured":"Hotham Altwaijry et al. 2013. Query-Driven Approach to Entity Resolution. PVLDB 6, 14 (2013), 1846--1857.","journal-title":"PVLDB"},{"key":"e_1_2_1_2_1","first-page":"120","article-title":"QuERy: A Framework for Integrating Entity Resolution with Query Processing","volume":"9","author":"Hotham Altwaijry","year":"2015","unstructured":"Hotham Altwaijry et al. 2015. QuERy: A Framework for Integrating Entity Resolution with Query Processing. PVLDB 9, 3 (2015), 120--131.","journal-title":"PVLDB"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Vassilis Christophides et al. 2021. An Overview of End-to-End Entity Resolution for Big Data. CSUR 53 6 (2021) 127:1--127:42.","DOI":"10.1145\/3418896"},{"key":"e_1_2_1_4_1","volume-title":"Alaska: A Flexible Benchmark for Data Integration Tasks. arXiv preprint arXiv:2101.11259.","author":"Valter Crescenzi","year":"2021","unstructured":"Valter Crescenzi et al. 2021. Alaska: A Flexible Benchmark for Data Integration Tasks. arXiv preprint arXiv:2101.11259."},{"volume-title":"Big Data Integration","author":"Dong Xin Luna","key":"e_1_2_1_5_1","unstructured":"Xin Luna Dong and Divesh Srivastava. 2015. Big Data Integration. Morgan & Claypool Publishers."},{"key":"e_1_2_1_6_1","unstructured":"Luca Gagliardelli et al. 2019. SparkER: Scaling Entity Resolution in Spark. In EDBT. OpenProceedings.org 602--605."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3444831.3444835"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994535"},{"key":"e_1_2_1_9_1","first-page":"50","article-title":"Deep Entity Matching with Pre-Trained Language Models","volume":"14","author":"Yuliang Li","year":"2020","unstructured":"Yuliang Li et al. 2020. Deep Entity Matching with Pre-Trained Language Models. PVLDB 14, 1 (2020), 50--60.","journal-title":"PVLDB"},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Sidharth Mudgal et al. 2018. Deep Learning for Entity Matching: A Design Space Exploration. In SIGMOD. ACM 19--34.","DOI":"10.1145\/3183713.3196926"},{"key":"e_1_2_1_11_1","first-page":"1316","article-title":"Progressive Duplicate Detection","volume":"27","author":"Thorsten Papenbrock","year":"2015","unstructured":"Thorsten Papenbrock et al. 2015. Progressive Duplicate Detection. TKDE 27, 5 (2015), 1316--1329.","journal-title":"TKDE"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Giovanni Simonini et al. 2018. Schema-agnostic Progressive Entity Resolution. In ICDE. IEEE Computer Society 53--64.","DOI":"10.1109\/ICDE.2018.00015"},{"key":"e_1_2_1_13_1","first-page":"1506","article-title":"Entity Resolution On-Demand","volume":"15","author":"Giovanni Simonini","year":"2022","unstructured":"Giovanni Simonini et al. 2022. Entity Resolution On-Demand. PVLDB 15, 7 (2022), 1506--1518.","journal-title":"PVLDB"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2012.43"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3611540.3611612","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,10]],"date-time":"2025-09-10T22:34:59Z","timestamp":1757543699000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3611540.3611612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8]]},"references-count":14,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2023,8]]}},"alternative-id":["10.14778\/3611540.3611612"],"URL":"https:\/\/doi.org\/10.14778\/3611540.3611612","relation":{},"ISSN":["2150-8097"],"issn-type":[{"type":"print","value":"2150-8097"}],"subject":[],"published":{"date-parts":[[2023,8]]},"assertion":[{"value":"2023-08-01","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}