{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T15:33:55Z","timestamp":1770564835437,"version":"3.49.0"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2016,2,1]],"date-time":"2016-02-01T00:00:00Z","timestamp":1454284800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Technol."],"published-print":{"date-parts":[[2016,2,24]]},"abstract":"<jats:p>\n            One of the main challenges in data matching and data cleaning, in highly integrated systems, is\n            <jats:italic>duplicates detection<\/jats:italic>\n            . While the literature abounds of approaches detecting duplicates corresponding to the same real-world entity, most of these approaches tend to eliminate duplicates (wrong information) from the sources, hence leading to what is called\n            <jats:italic>data repair.<\/jats:italic>\n            In this article, we propose a framework that automatically detects duplicates at query time and effectively identifies the consistent version of the data, while keeping inconsistent data in the sources. Our framework uses matching dependencies (MDs) to detect duplicates through the concept of data reconciliation rules (DRR) and conditional function dependencies (CFDs) to assess the quality of different attribute values. We also build a duplicate reconciliation index (\n            <jats:italic>DRI<\/jats:italic>\n            ), based on clusters of duplicates detected by a set of DRRs to speed up the online data reconciliation process. Our experiments of a real-world data collection show the efficiency and effectiveness of our framework.\n          <\/jats:p>","DOI":"10.1145\/2806888","type":"journal-article","created":{"date-parts":[[2016,2,1]],"date-time":"2016-02-01T20:37:54Z","timestamp":1454359074000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Quality-Based Online Data Reconciliation"],"prefix":"10.1145","volume":"16","author":[{"given":"Asma","family":"Abboura","sequence":"first","affiliation":[{"name":"University of Oran1, Algeria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Soror","family":"Sahri","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Descartes Sorbonnes Paris Cit\u00e9, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Latifa","family":"Baba-Hamed","sequence":"additional","affiliation":[{"name":"University of Oran1, Algeria"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mourad","family":"Ouziri","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Descartes Sorbonnes Paris Cit\u00e9, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Salima","family":"Benbernou","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Descartes Sorbonnes Paris Cit\u00e9, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2016,2]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148177"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2006.35"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-008-0098-x"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1147376.1147391"},{"key":"e_1_2_1_5_1","volume-title":"Tractable vs. intractable cases of matching dependencies for query answering under entity resolution. arXiv preprint arXiv:1309.1884","author":"Bertossi Leopoldo","year":"2013","unstructured":"Leopoldo Bertossi and Jaffer Gardezi . 2013. Tractable vs. intractable cases of matching dependencies for query answering under entity resolution. arXiv preprint arXiv:1309.1884 ( 2013 ). Leopoldo Bertossi and Jaffer Gardezi. 2013. Tractable vs. intractable cases of matching dependencies for query answering under entity resolution. arXiv preprint arXiv:1309.1884 (2013)."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00224-012-9402-7"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2007.367920"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2008.4497460"},{"key":"e_1_2_1_9_1","volume-title":"Proc. of VLDB (VLDB\u201907)","author":"Bravo Loreto","year":"2007","unstructured":"Loreto Bravo , Wenfei Fan , and Shuai Ma . 2007 . Extending dependencies with conditions . In Proc. of VLDB (VLDB\u201907) . VLDB Endowment, 243--254. Loreto Bravo, Wenfei Fan, and Shuai Ma. 2007. Extending dependencies with conditions. In Proc. of VLDB (VLDB\u201907). VLDB Endowment, 243--254."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453980"},{"key":"e_1_2_1_11_1","volume-title":"Proc. of VLDB (VLDB\u201907)","author":"Cong Gao","year":"2007","unstructured":"Gao Cong , Wenfei Fan , Floris Geerts , Xibei Jia , and Shuai Ma . 2007 . Improving data quality: Consistency and accuracy . In Proc. of VLDB (VLDB\u201907) . VLDB Endowment, 315--326. Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, and Shuai Ma. 2007. Improving data quality: Consistency and accuracy. In Proc. of VLDB (VLDB\u201907). VLDB Endowment, 315--326."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1559845.1559895"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.9"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687674"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989373"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-32925-8_10"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536360.2536363"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453900"},{"key":"e_1_2_1_19_1","volume-title":"Proc. of CIDR.","author":"Jeffery Shawn R","year":"2013","unstructured":"Shawn R Jeffery , Liwen Sun , Matt DeLand , Nick Pendar , Rick Barber , and Andrew Galdi . 2013 . Arnold: Declarative crowd-machine data integration . In Proc. of CIDR. Shawn R Jeffery, Liwen Sun, Matt DeLand, Nick Pendar, Rick Barber, and Andrew Galdi. 2013. Arnold: Declarative crowd-machine data integration. In Proc. of CIDR."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2009.10.003"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807202"},{"key":"e_1_2_1_22_1","first-page":"11","volume-title":"Proc. VLDB Endow. 4","author":"Liu Xuan","year":"2011","unstructured":"Xuan Liu , Xin Luna Dong , Beng Chin Ooi , and Divesh Srivastava . 2011 . Online data fusion . Proc. VLDB Endow. 4 , 11 (2011). Xuan Liu, Xin Luna Dong, Beng Chin Ooi, and Divesh Srivastava. 2011. Online data fusion. Proc. VLDB Endow. 4, 11 (2011)."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2593674"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-03599-4_20"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1076034.1076047"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1646135"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277788"},{"key":"e_1_2_1_28_1","volume-title":"Anish Das Sarma, Michael J. Franklin, and Alon Y. Halevy.","author":"Wang Daisy Zhe","year":"2009","unstructured":"Daisy Zhe Wang , Xin Luna Dong , Anish Das Sarma, Michael J. Franklin, and Alon Y. Halevy. 2009 . Functional dependency generation and applications in Pay-As-You-Go data integration systems. In Proc. of WebDB. Daisy Zhe Wang, Xin Luna Dong, Anish Das Sarma, Michael J. Franklin, and Alon Y. Halevy. 2009. Functional dependency generation and applications in Pay-As-You-Go data integration systems. In Proc. of WebDB."},{"key":"e_1_2_1_29_1","unstructured":"We Wayne. 2004. Data quality and the bottom line: Achieving business success through a commitment to high quality data. TDWI Report.  We Wayne. 2004. Data quality and the bottom line: Achieving business success through a commitment to high quality data. TDWI Report."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/1952376.1952378"}],"container-title":["ACM Transactions on Internet Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2806888","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2806888","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:07:22Z","timestamp":1750223242000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2806888"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,2]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,2,24]]}},"alternative-id":["10.1145\/2806888"],"URL":"https:\/\/doi.org\/10.1145\/2806888","relation":{},"ISSN":["1533-5399","1557-6051"],"issn-type":[{"value":"1533-5399","type":"print"},{"value":"1557-6051","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,2]]},"assertion":[{"value":"2014-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-02-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}