{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T07:17:32Z","timestamp":1779175052612,"version":"3.51.4"},"reference-count":83,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,1,6]],"date-time":"2021-01-06T00:00:00Z","timestamp":1609891200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2021,3,31]]},"abstract":"<jats:p>Entity matching refers to the task of determining whether two different representations refer to the same real-world entity. It continues to be a prevalent problem for many organizations where data resides in different sources and duplicates the need to be identified and managed. The term \u201centity matching\u201d also loosely refers to the broader problem of determining whether two heterogeneous representations of<jats:italic>different entities<\/jats:italic>should be associated together. This problem has an even wider scope of applications, from determining the subsidiaries of companies to matching jobs to job seekers, which has impactful consequences.<\/jats:p><jats:p>In this article, we first report our recent system D<jats:sc>ITTO<\/jats:sc>, which is an example of a modern entity matching system based on pretrained language models. Then we summarize recent solutions in applying deep learning and pre-trained language models for solving the entity matching task. Finally, we discuss research directions beyond entity matching, including the promise of synergistically integrating blocking and entity matching steps together, the need to examine methods to alleviate steep training data requirements that are typical of deep learning or pre-trained language models, and the importance of generalizing entity matching solutions to handle the broader entity matching problem, which leads to an even more pressing need to explain matching outcomes.<\/jats:p>","DOI":"10.1145\/3431816","type":"journal-article","created":{"date-parts":[[2021,1,6]],"date-time":"2021-01-06T13:22:41Z","timestamp":1609939361000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":49,"title":["Deep Entity Matching"],"prefix":"10.1145","volume":"13","author":[{"given":"Yuliang","family":"Li","sequence":"first","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jinfeng","family":"Li","sequence":"additional","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yoshihiko","family":"Suhara","sequence":"additional","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jin","family":"Wang","sequence":"additional","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wataru","family":"Hirota","sequence":"additional","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wang-Chiew","family":"Tan","sequence":"additional","affiliation":[{"name":"Megagon Labs, Mountain View, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,1,6]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"To index or not to index: Optimizing exact maximum inner product search","author":"Abuzaid Firas","unstructured":"Firas Abuzaid , Geet Sethi , Peter Bailis , and Matei Zaharia . 2019. To index or not to index: Optimizing exact maximum inner product search . In ICDE. IEEE , 1250--1261. Firas Abuzaid, Geet Sethi, Peter Bailis, and Matei Zaharia. 2019. To index or not to index: Optimizing exact maximum inner product search. In ICDE. IEEE, 1250--1261."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3306618.3314243"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209581"},{"key":"e_1_2_1_4_1","volume-title":"Fairness in machine learning. NIPS Tutorial 1","author":"Barocas Solon","year":"2017","unstructured":"Solon Barocas , Moritz Hardt , and Arvind Narayanan . 2017. Fairness in machine learning. NIPS Tutorial 1 ( 2017 ). Solon Barocas, Moritz Hardt, and Arvind Narayanan. 2017. Fairness in machine learning. NIPS Tutorial 1 (2017)."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00051"},{"key":"e_1_2_1_6_1","unstructured":"Ursin Brunner and Kurt Stockinger. 2020. Entity matching with transformer architectures-a step forward in data integration. In EDBT. 463--473. Ursin Brunner and Kurt Stockinger. 2020. Entity matching with transformer architectures-a step forward in data integration. In EDBT. 463--473."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2894748"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Riccardo Cappuzzo Paolo Papotti and Saravanan Thirumuruganathan. 2020. Creating embeddings of heterogeneous relational datasets for data integration tasks. In SIGMOD. 1335--1349. Riccardo Cappuzzo Paolo Papotti and Saravanan Thirumuruganathan. 2020. Creating embeddings of heterogeneous relational datasets for data integration tasks. In SIGMOD. 1335--1349.","DOI":"10.1145\/3318464.3389742"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3376898"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2011.127"},{"key":"e_1_2_1_11_1","volume-title":"ACL Workshop BlackboxNLP. 276--286","author":"Clark Kevin","unstructured":"Kevin Clark , Urvashi Khandelwal , Omer Levy , and Christopher D. Manning . 2019. What does BERT look at? An analysis of BERT\u2019s attention . In ACL Workshop BlackboxNLP. 276--286 . Kevin Clark, Urvashi Khandelwal, Omer Levy, and Christopher D. Manning. 2019. What does BERT look at? An analysis of BERT\u2019s attention. In ACL Workshop BlackboxNLP. 276--286."},{"key":"e_1_2_1_12_1","unstructured":"Chris DeBrusk. 2018. The risk of machine-learning bias (and how to prevent it). MIT Sloan Management Review. Chris DeBrusk. 2018. The risk of machine-learning bias (and how to prevent it). MIT Sloan Management Review."},{"key":"e_1_2_1_13_1","volume-title":"BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT. 4171--4186.","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT. 4171--4186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT. 4171--4186."},{"key":"e_1_2_1_14_1","first-page":"1454","article-title":"Distributed representations of tuples for entity resolution","volume":"11","author":"Ebraheem Muhammad","year":"2018","unstructured":"Muhammad Ebraheem , Saravanan Thirumuruganathan , Shafiq Joty , Mourad Ouzzani , and Nan Tang . 2018 . Distributed representations of tuples for entity resolution . PVLDB 11 , 11 (2018), 1454 -- 1467 . Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, and Nan Tang. 2018. Distributed representations of tuples for entity resolution. PVLDB 11, 11 (2018), 1454--1467.","journal-title":"PVLDB"},{"key":"e_1_2_1_15_1","unstructured":"Vasilis Efthymiou George Papadakis Kostas Stefanidis and Vassilis Christophides. 2019. MinoanER: Schema-agnostic non-iterative massively parallel resolution of web entities. In EDBT Melanie Herschel Helena Galhardas Berthold Reinwald Irini Fundulaki Carsten Binnig and Zoi Kaoudi (Eds.). 373--384. Vasilis Efthymiou George Papadakis Kostas Stefanidis and Vassilis Christophides. 2019. MinoanER: Schema-agnostic non-iterative massively parallel resolution of web entities. In EDBT Melanie Herschel Helena Galhardas Berthold Reinwald Irini Fundulaki Carsten Binnig and Zoi Kaoudi (Eds.). 373--384."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1969.10501049"},{"key":"e_1_2_1_17_1","volume-title":"Fong and Andrea Vedaldi","author":"Ruth","year":"2017","unstructured":"Ruth C. Fong and Andrea Vedaldi . 2017 . Interpretable explanations of black boxes by meaningful perturbation. In ICCV. 3429--3437. Ruth C. Fong and Andrea Vedaldi. 2017. Interpretable explanations of black boxes by meaningful perturbation. In ICCV. 3429--3437."},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Cheng Fu Xianpei Han Jiaming He and Le Sun. 2020. Hierarchical matching network for heterogeneous entity resolution. In IJCAI. 3665--3671. Cheng Fu Xianpei Han Jiaming He and Le Sun. 2020. Hierarchical matching network for heterogeneous entity resolution. In IJCAI. 3665--3671.","DOI":"10.24963\/ijcai.2020\/507"},{"key":"e_1_2_1_19_1","volume-title":"End-to-end multi-perspective matching for entity resolution","author":"Fu Cheng","unstructured":"Cheng Fu , Xianpei Han , Le Sun , Bo Chen , Wei Zhang , Suhui Wu , and Hao Kong . 2019. End-to-end multi-perspective matching for entity resolution . In IJCAI. AAAI Press , 4961--4967. Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. End-to-end multi-perspective matching for entity resolution. In IJCAI. AAAI Press, 4961--4967."},{"key":"e_1_2_1_20_1","first-page":"1","article-title":"Domain-adversarial training of neural networks","volume":"17","author":"Ganin Yaroslav","year":"2016","unstructured":"Yaroslav Ganin , Evgeniya Ustinova , Hana Ajakan , Pascal Germain , Hugo Larochelle , Fran\u00e7ois Laviolette , Mario March , and Victor Lempitsky . 2016 . Domain-adversarial training of neural networks . J. Mach. Learn. Res. 17 , 59 (2016), 1 -- 35 . Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Fran\u00e7ois Laviolette, Mario March, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. J. Mach. Learn. Res. 17, 59 (2016), 1--35.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_2_1_21_1","doi-asserted-by":"crossref","unstructured":"Sudipto Guha H. V. Jagadish Nick Koudas Divesh Srivastava and Ting Yu. 2002. Approximate XML joins. In SIGMOD. 287--298. Sudipto Guha H. V. Jagadish Nick Koudas Divesh Srivastava and Ting Yu. 2002. Approximate XML joins. In SIGMOD. 287--298.","DOI":"10.1145\/564691.564725"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236009"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Sairam Gurajada Lucian Popa Kun Qian and Prithviraj Sen. 2019. Learning-based methods with human-in-the-loop for entity resolution. In CIKM. Sairam Gurajada Lucian Popa Kun Qian and Prithviraj Sen. 2019. Learning-based methods with human-in-the-loop for entity resolution. In CIKM.","DOI":"10.1145\/3357384.3360316"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/358916.358995"},{"key":"e_1_2_1_25_1","volume-title":"Stolfo","author":"Hern\u00e1ndez Mauricio A.","year":"1995","unstructured":"Mauricio A. Hern\u00e1ndez and Salvatore J . Stolfo . 1995 . The merge\/purge problem for large databases. In SIGMOD. 127--138. Mauricio A. Hern\u00e1ndez and Salvatore J. Stolfo. 1995. The merge\/purge problem for large databases. In SIGMOD. 127--138."},{"key":"e_1_2_1_26_1","volume-title":"Wallace","author":"Jain Sarthak","year":"2019","unstructured":"Sarthak Jain and Byron C . Wallace . 2019 . Attention is not explanation. In NAACL-HLT\u2019 19. 3543--3556. Sarthak Jain and Byron C. Wallace. 2019. Attention is not explanation. In NAACL-HLT\u201919. 3543--3556."},{"key":"e_1_2_1_27_1","unstructured":"Heng Ji and Ralph Grishman. 2011. Knowledge base population: Successful approaches and challenges. In ACL: Human Language Technologies. 1148--1158. Heng Ji and Ralph Grishman. 2011. Knowledge base population: Successful approaches and challenges. In ACL: Human Language Technologies. 1148--1158."},{"key":"e_1_2_1_28_1","doi-asserted-by":"crossref","unstructured":"Jungo Kasai Kun Qian Sairam Gurajada Yunyao Li and Lucian Popa. 2019. Low-resource deep entity resolution with transfer and active learning. In ACL. 5851--5861. Jungo Kasai Kun Qian Sairam Gurajada Yunyao Li and Lucian Popa. 2019. Low-resource deep entity resolution with transfer and active learning. In ACL. 5851--5861.","DOI":"10.18653\/v1\/P19-1586"},{"key":"e_1_2_1_29_1","unstructured":"Pang Wei Koh and Percy Liang. 2017. Understanding black-box predictions via influence functions. In ICML. 1885--1894. Pang Wei Koh and Percy Liang. 2017. Understanding black-box predictions via influence functions. In ICML. 1885--1894."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994535"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2009.10.003"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920904"},{"key":"e_1_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Nick Koudas Sunita Sarawagi and Divesh Srivastava. 2006. Record linkage: Similarity measures and algorithms. In SIGMOD. 802--803. Nick Koudas Sunita Sarawagi and Divesh Srivastava. 2006. Record linkage: Similarity measures and algorithms. In SIGMOD. 802--803.","DOI":"10.1145\/1142473.1142599"},{"key":"e_1_2_1_34_1","volume-title":"Efficient merging and filtering algorithms for approximate string searches","author":"Li Chen","unstructured":"Chen Li , Jiaheng Lu , and Yiming Lu. 2008. Efficient merging and filtering algorithms for approximate string searches . In ICDE. IEEE , 257--266. Chen Li, Jiaheng Lu, and Yiming Lu. 2008. Efficient merging and filtering algorithms for approximate string searches. In ICDE. IEEE, 257--266."},{"key":"e_1_2_1_35_1","volume-title":"Deep entity matching with pre-trained language models. PVLDB 14, 1","author":"Li Yuliang","year":"2021","unstructured":"Yuliang Li , Jinfeng Li , Yoshihiko Suhara , AnHai Doan , and Wang-Chiew Tan . 2021. Deep entity matching with pre-trained language models. PVLDB 14, 1 ( 2021 ). The full version is available at https:\/\/arxiv.org\/abs\/2004.00584. Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, and Wang-Chiew Tan. 2021. Deep entity matching with pre-trained language models. PVLDB 14, 1 (2021). The full version is available at https:\/\/arxiv.org\/abs\/2004.00584."},{"key":"e_1_2_1_36_1","volume-title":"RoBERTa: A robustly optimized bert pretraining approach. arXiv","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu , Myle Ott , Naman Goyal , Jingfei Du , Mandar Joshi , Danqi Chen , Omer Levy , Mike Lewis , Luke Zettlemoyer , and Veselin Stoyanov . 2019. RoBERTa: A robustly optimized bert pretraining approach. arXiv ( 2019 ). Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A robustly optimized bert pretraining approach. arXiv (2019)."},{"key":"e_1_2_1_37_1","volume-title":"Zemel","author":"Louizos Christos","year":"2016","unstructured":"Christos Louizos , Kevin Swersky , Yujia Li , Max Welling , and Richard S . Zemel . 2016 . The variational fair autoencoder. In ICLR, Yoshua Bengio and Yann LeCun (Eds .). Christos Louizos, Kevin Swersky, Yujia Li, Max Welling, and Richard S. Zemel. 2016. The variational fair autoencoder. In ICLR, Yoshua Bengio and Yann LeCun (Eds.)."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-019-0138-9"},{"key":"e_1_2_1_39_1","volume-title":"Lundberg and Su-In Lee","author":"Scott","year":"2017","unstructured":"Scott M. Lundberg and Su-In Lee . 2017 . A unified approach to interpreting model predictions. In NeurIPS. 4765--4774. Scott M. Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. In NeurIPS. 4765--4774."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2507157.2507163"},{"key":"e_1_2_1_41_1","unstructured":"Venkata Vamsikrishna Meduri Lucian Popa Prithviraj Sen and Mohamed Sarwat. 2020. A comprehensive benchmark framework for active learning methods in entity matching. In SIGMOD. 1133--1147. Venkata Vamsikrishna Meduri Lucian Popa Prithviraj Sen and Mohamed Sarwat. 2020. A comprehensive benchmark framework for active learning methods in entity matching. In SIGMOD. 1133--1147."},{"key":"e_1_2_1_42_1","volume-title":"A survey on bias and fairness in machine learning. arXiv preprint arXiv:1908.09635","author":"Mehrabi Ninareh","year":"2019","unstructured":"Ninareh Mehrabi , Fred Morstatter , Nripsuta Saxena , Kristina Lerman , and Aram Galstyan . 2019. A survey on bias and fairness in machine learning. arXiv preprint arXiv:1908.09635 ( 2019 ). Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, and Aram Galstyan. 2019. A survey on bias and fairness in machine learning. arXiv preprint arXiv:1908.09635 (2019)."},{"key":"e_1_2_1_43_1","volume-title":"Snippext: Semi-supervised opinion mining with augmented data. In WWW. 617--628.","author":"Miao Zhengjie","year":"2020","unstructured":"Zhengjie Miao , Yuliang Li , Xiaolan Wang , and Wang-Chiew Tan . 2020 . Snippext: Semi-supervised opinion mining with augmented data. In WWW. 617--628. Zhengjie Miao, Yuliang Li, Xiaolan Wang, and Wang-Chiew Tan. 2020. Snippext: Semi-supervised opinion mining with augmented data. In WWW. 617--628."},{"key":"e_1_2_1_44_1","unstructured":"Rada Mihalcea and Paul Tarau. 2004. TextRank: Bringing order into text. In EMNLP. 404--411. Rada Mihalcea and Paul Tarau. 2004. TextRank: Bringing order into text. In EMNLP. 404--411."},{"key":"e_1_2_1_45_1","unstructured":"Tomas Mikolov Ilya Sutskever Kai Chen Greg S. Corrado and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In NeurIPS. 3111--3119. Tomas Mikolov Ilya Sutskever Kai Chen Greg S. Corrado and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In NeurIPS. 3111--3119."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3276742"},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Sidharth Mudgal Han Li Theodoros Rekatsinas AnHai Doan Youngchoon Park Ganesh Krishnan Rohit Deep Esteban Arcaute and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In SIGMOD. 19--34. Sidharth Mudgal Han Li Theodoros Rekatsinas AnHai Doan Youngchoon Park Ganesh Krishnan Rohit Deep Esteban Arcaute and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In SIGMOD. 19--34.","DOI":"10.1145\/3183713.3196926"},{"key":"e_1_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Jonas Mueller and Aditya Thyagarajan. 2016. Siamese recurrent architectures for learning sentence similarity. In AAAI. 2786--2792. Jonas Mueller and Aditya Thyagarajan. 2016. Siamese recurrent architectures for learning sentence similarity. In AAAI. 2786--2792.","DOI":"10.1609\/aaai.v30i1.10350"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.2200\/S00262ED1V01Y201003DTM003"},{"key":"e_1_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Hao Nie Xianpei Han Ben He Le Sun Bo Chen Wei Zhang Suhui Wu and Hao Kong. 2019. Deep sequence-to-sequence entity matching for heterogeneous entity resolution. In CIKM. 629--638. Hao Nie Xianpei Han Ben He Le Sun Bo Chen Wei Zhang Suhui Wu and Hao Kong. 2019. Deep sequence-to-sequence entity matching for heterogeneous entity resolution. In CIKM. 629--638.","DOI":"10.1145\/3357384.3358018"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_2_1_52_1","unstructured":"Ralph Peeters Christian Bizer and Goran Glava\u0161. [n.d.]. Intermediate training of BERT for product matching. In DI2KG Workshop at VLDB\u201920. Ralph Peeters Christian Bizer and Goran Glava\u0161. [n.d.]. Intermediate training of BERT for product matching. In DI2KG Workshop at VLDB\u201920."},{"key":"e_1_2_1_53_1","volume-title":"Manning","author":"Pennington Jeffrey","year":"2014","unstructured":"Jeffrey Pennington , Richard Socher , and Christopher D . Manning . 2014 . GloVe: Global vectors for word representation. In EMNLP. 1532--1543. Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In EMNLP. 1532--1543."},{"key":"e_1_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Matthew Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee and Luke Zettlemoyer. 2018. Deep contextualized word representations. In NAACL-HLT. 2227--2237. Matthew Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee and Luke Zettlemoyer. 2018. Deep contextualized word representations. In NAACL-HLT. 2227--2237.","DOI":"10.18653\/v1\/N18-1202"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3316609"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/3336499.3338010"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352068"},{"key":"e_1_2_1_58_1","doi-asserted-by":"crossref","unstructured":"Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In EMNLP-IJCNLP. 3982--3992. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In EMNLP-IJCNLP. 3982--3992.","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_2_1_59_1","doi-asserted-by":"crossref","unstructured":"Marco Tulio Ribeiro Sameer Singh and Carlos Guestrin. 2016. \u201cWhy should I trust you?\u201d Explaining the predictions of any classifier. In KDD. 1135--1144. Marco Tulio Ribeiro Sameer Singh and Carlos Guestrin. 2016. \u201cWhy should I trust you?\u201d Explaining the predictions of any classifier. In KDD. 1135--1144.","DOI":"10.18653\/v1\/N16-3020"},{"key":"e_1_2_1_60_1","unstructured":"Alexander M. Rush Sumit Chopra and Jason Weston. 2015. A neural attention model for abstractive sentence summarization. In EMNLP. Alexander M. Rush Sumit Chopra and Jason Weston. 2015. A neural attention model for abstractive sentence summarization. In EMNLP."},{"key":"e_1_2_1_61_1","doi-asserted-by":"crossref","unstructured":"Babak Salimi Luke Rodriguez Bill Howe and Dan Suciu. 2019. Interventional fairness: Causal database repair for algorithmic fairness. In SIGMOD. 793--810. Babak Salimi Luke Rodriguez Bill Howe and Dan Suciu. 2019. Interventional fairness: Causal database repair for algorithmic fairness. In SIGMOD. 793--810.","DOI":"10.1145\/3299869.3319901"},{"key":"e_1_2_1_62_1","unstructured":"Victor Sanh Lysandre Debut Julien Chaumond and Thomas Wolf. 2019. DistilBERT a distilled version of BERT: Smaller faster cheaper and lighter. In EMC2. Victor Sanh Lysandre Debut Julien Chaumond and Thomas Wolf. 2019. DistilBERT a distilled version of BERT: Smaller faster cheaper and lighter. In EMC 2 ."},{"key":"e_1_2_1_63_1","doi-asserted-by":"crossref","unstructured":"Amit Sharma and Dan Cosley. 2013. Do social explanations work? Studying and modeling the effects of social explanations in recommender systems. In WWW. 1133--1144. Amit Sharma and Dan Cosley. 2013. Do social explanations work? Studying and modeling the effects of social explanations in recommender systems. In WWW. 1133--1144.","DOI":"10.1145\/2488388.2488487"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2014.2327028"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415570"},{"key":"e_1_2_1_66_1","volume-title":"Talburt and Yinle Zhou","author":"John","year":"2013","unstructured":"John R. Talburt and Yinle Zhou . 2013 . A practical guide to entity resolution with OYSTER. In Handbook of Data Quality. Springer , 235--270. John R. Talburt and Yinle Zhou. 2013. A practical guide to entity resolution with OYSTER. In Handbook of Data Quality. Springer, 235--270."},{"key":"e_1_2_1_67_1","volume-title":"Talburt and Yinle Zhou","author":"John","year":"2015","unstructured":"John R. Talburt and Yinle Zhou . 2015 . Entity Information Life Cycle for Big Data: Master Data Management and Information Integration. Morgan Kaufmann . John R. Talburt and Yinle Zhou. 2015. Entity Information Life Cycle for Big Data: Master Data Management and Information Integration. Morgan Kaufmann."},{"key":"e_1_2_1_68_1","first-page":"1","article-title":"Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD","volume":"10","author":"Thirumuruganathan Saravanan","year":"2019","unstructured":"Saravanan Thirumuruganathan , Mourad Ouzzani , and Nan Tang . 2019 . Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD . ACM , 10 : 1 -- 10 :6. Saravanan Thirumuruganathan, Mourad Ouzzani, and Nan Tang. 2019. Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD. ACM, 10:1--10:6.","journal-title":"ACM"},{"key":"e_1_2_1_69_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS. 5998--6008. Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS. 5998--6008."},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.14778\/2021017.2021020"},{"key":"e_1_2_1_71_1","doi-asserted-by":"crossref","unstructured":"Jin Wang Chunbin Lin and Carlo Zaniolo. 2019. MF-join: Efficient fuzzy string similarity join with multi-level filtering. In ICDE. 386--397. Jin Wang Chunbin Lin and Carlo Zaniolo. 2019. MF-join: Efficient fuzzy string similarity join with multi-level filtering. In ICDE. 386--397.","DOI":"10.1109\/ICDE.2019.00042"},{"key":"e_1_2_1_72_1","volume-title":"A reinforcement learning framework for explainable recommendation","author":"Wang Xiting","unstructured":"Xiting Wang , Yiru Chen , Jie Yang , Le Wu , Zhengtao Wu , and Xing Xie . 2018. A reinforcement learning framework for explainable recommendation . In ICDM. IEEE , 587--596. Xiting Wang, Yiru Chen, Jie Yang, Le Wu, Zhengtao Wu, and Xing Xie. 2018. A reinforcement learning framework for explainable recommendation. In ICDM. IEEE, 587--596."},{"key":"e_1_2_1_73_1","volume-title":"Xin Luna Dong, and Shuiwang Ji","author":"Wang Zhengyang","year":"2020","unstructured":"Zhengyang Wang , Bunyamin Sisman , Hao Wei , Xin Luna Dong, and Shuiwang Ji . 2020 . CorDEL: A contrastive deep learning approach for entity linkage. In ICDM. Zhengyang Wang, Bunyamin Sisman, Hao Wei, Xin Luna Dong, and Shuiwang Ji. 2020. CorDEL: A contrastive deep learning approach for entity linkage. In ICDM."},{"key":"e_1_2_1_74_1","doi-asserted-by":"crossref","unstructured":"Michael J. Welch Aamod Sane and Chris Drome. 2012. Fast and accurate incremental entity resolution relative to an entity knowledge base. In CIKM. 2667--2670. Michael J. Welch Aamod Sane and Chris Drome. 2012. Fast and accurate incremental entity resolution relative to an entity knowledge base. In CIKM. 2667--2670.","DOI":"10.1145\/2396761.2398719"},{"key":"e_1_2_1_75_1","doi-asserted-by":"crossref","unstructured":"Sarah Wiegreffe and Yuval Pinter. 2019. Attention is not not explanation. In EMNLP\/IJCNLP. 11--20. Sarah Wiegreffe and Yuval Pinter. 2019. Attention is not not explanation. In EMNLP\/IJCNLP. 11--20.","DOI":"10.18653\/v1\/D19-1002"},{"key":"e_1_2_1_76_1","volume-title":"et\u00a0al","author":"Wolf Thomas","year":"2019","unstructured":"Thomas Wolf , Lysandre Debut , Victor Sanh , Julien Chaumond , Clement Delangue , Anthony Moi , Pierric Cistac , Tim Rault , R\u00e9mi Louf , Morgan Funtowicz , et\u00a0al . 2019 . HuggingFace\u2019s transformers: State-of-the-art natural language processing. arXiv (2019). Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, R\u00e9mi Louf, Morgan Funtowicz, et\u00a0al. 2019. HuggingFace\u2019s transformers: State-of-the-art natural language processing. arXiv (2019)."},{"key":"e_1_2_1_77_1","unstructured":"Renzhi Wu Sanya Chaba Saurabh Sawlani Xu Chu and Saravanan Thirumuruganathan. 2020. ZeroER: Entity resolution using zero labeled examples. In SIGMOD. 1149--1164. Renzhi Wu Sanya Chaba Saurabh Sawlani Xu Chu and Saravanan Thirumuruganathan. 2020. ZeroER: Entity resolution using zero labeled examples. In SIGMOD. 1149--1164."},{"key":"e_1_2_1_78_1","volume-title":"Flame: A probabilistic model combining aspect based opinion mining and collaborative filtering. In WSDM. 199--208.","author":"Wu Yao","year":"2015","unstructured":"Yao Wu and Martin Ester . 2015 . Flame: A probabilistic model combining aspect based opinion mining and collaborative filtering. In WSDM. 199--208. Yao Wu and Martin Ester. 2015. Flame: A probabilistic model combining aspect based opinion mining and collaborative filtering. In WSDM. 199--208."},{"key":"e_1_2_1_79_1","doi-asserted-by":"crossref","unstructured":"Diego Zardetto Monica Scannapieco and Tiziana Catarci. 2010. Effective automated object matching. In ICDE. 757--768. Diego Zardetto Monica Scannapieco and Tiziana Catarci. 2010. Effective automated object matching. In ICDE. 757--768.","DOI":"10.1109\/ICDE.2010.5447904"},{"key":"e_1_2_1_80_1","volume-title":"Christos Faloutsos, and Davd Page.","author":"Zhang Wei","year":"2020","unstructured":"Wei Zhang , Hao Wei , Bunyamin Sisman , Xin Luna Dong , Christos Faloutsos, and Davd Page. 2020 . AutoBlock: A hands-off blocking framework for entity matching. In WSDM. 744--752. Wei Zhang, Hao Wei, Bunyamin Sisman, Xin Luna Dong, Christos Faloutsos, and Davd Page. 2020. AutoBlock: A hands-off blocking framework for entity matching. In WSDM. 744--752."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1561\/1500000066"},{"key":"e_1_2_1_82_1","doi-asserted-by":"crossref","unstructured":"Yongfeng Zhang Guokun Lai Min Zhang Yi Zhang Yiqun Liu and Shaoping Ma. 2014. Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In SIGIR. 83--92. Yongfeng Zhang Guokun Lai Min Zhang Yi Zhang Yiqun Liu and Shaoping Ma. 2014. Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In SIGIR. 83--92.","DOI":"10.1145\/2600428.2609579"},{"key":"e_1_2_1_83_1","doi-asserted-by":"crossref","unstructured":"Chen Zhao and Yeye He. 2019. Auto-EM: End-to-end fuzzy entity-matching using pre-trained deep models and transfer learning. In WWW. 2413--2424. Chen Zhao and Yeye He. 2019. Auto-EM: End-to-end fuzzy entity-matching using pre-trained deep models and transfer learning. In WWW. 2413--2424.","DOI":"10.1145\/3308558.3313578"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3431816","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3431816","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:46Z","timestamp":1750195486000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3431816"}},"subtitle":["Challenges and Opportunities"],"short-title":[],"issued":{"date-parts":[[2021,1,6]]},"references-count":83,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,3,31]]}},"alternative-id":["10.1145\/3431816"],"URL":"https:\/\/doi.org\/10.1145\/3431816","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"value":"1936-1955","type":"print"},{"value":"1936-1963","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,6]]},"assertion":[{"value":"2020-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-01-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}