{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T15:06:16Z","timestamp":1775747176515,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":49,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,17]],"date-time":"2022-06-17T00:00:00Z","timestamp":1655424000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,17]]},"DOI":"10.1145\/3533702.3534910","type":"proceedings-article","created":{"date-parts":[[2022,8,11]],"date-time":"2022-08-11T22:49:06Z","timestamp":1660258146000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Machop"],"prefix":"10.1145","author":[{"given":"Jin","family":"Wang","sequence":"first","affiliation":[{"name":"Megagon Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuliang","family":"Li","sequence":"additional","affiliation":[{"name":"Megagon Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wataru","family":"Hirota","sequence":"additional","affiliation":[{"name":"Megagon Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eser","family":"Kandogan","sequence":"additional","affiliation":[{"name":"Megagon Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,8,11]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2019.12.012"},{"key":"e_1_3_2_1_2_1","volume-title":"Longformer: The long-document transformer. CoRR, abs\/2004.05150","author":"Beltagy I.","year":"2020","unstructured":"I. Beltagy , M. E. Peters , and A. Cohan . Longformer: The long-document transformer. CoRR, abs\/2004.05150 , 2020 . I. Beltagy, M. E. Peters, and A. Cohan. Longformer: The long-document transformer. CoRR, abs\/2004.05150, 2020."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-48881-3_56"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3411929"},{"key":"e_1_3_2_1_5_1","first-page":"463","volume-title":"Entity matching with transformer architectures - A step forward in data integration","author":"Brunner U.","year":"2020","unstructured":"U. Brunner and K. Stockinger . Entity matching with transformer architectures - A step forward in data integration . In A. Bonifati, Y. Zhou, M. A. V. Salles, A. B\u00f6hm, D. Olteanu, G. H. L. Fletcher, A. Khan, and B. Yang, editors, EDBT , pages 463 -- 473 , 2020 . U. Brunner and K. Stockinger. Entity matching with transformer architectures - A step forward in data integration. In A. Bonifati, Y. Zhou, M. A. V. Salles, A. B\u00f6hm, D. Olteanu, G. H. L. Fletcher, A. Khan, and B. Yang, editors, EDBT, pages 463--473, 2020."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2011.127"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035960"},{"key":"e_1_3_2_1_8_1","first-page":"4171","volume-title":"NAACL-HLT","author":"Devlin J.","year":"2019","unstructured":"J. Devlin , M. Chang , K. Lee , and K. Toutanova . BERT: pre-training of deep bidirectional transformers for language understanding . In NAACL-HLT , pages 4171 -- 4186 , 2019 . J. Devlin, M. Chang, K. Lee, and K. Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT, pages 4171--4186, 2019."},{"key":"e_1_3_2_1_9_1","volume-title":"Morgan Kaufmann","author":"Doan A.","year":"2012","unstructured":"A. Doan , A. Y. Halevy , and Z. G. Ives . Principles of Data Integration . Morgan Kaufmann , 2012 . A. Doan, A. Y. Halevy, and Z. G. Ives. Principles of Data Integration. Morgan Kaufmann, 2012."},{"issue":"11","key":"e_1_3_2_1_10_1","first-page":"1454","article-title":"Distributed representations of tuples for entity resolution","volume":"11","author":"Ebraheem M.","year":"2018","unstructured":"M. Ebraheem , S. Thirumuruganathan , S. R. Joty , M. Ouzzani , and N. Tang . Distributed representations of tuples for entity resolution . PVLDB , 11 ( 11 ): 1454 -- 1467 , 2018 . M. Ebraheem, S. Thirumuruganathan, S. R. Joty, M. Ouzzani, and N. Tang. Distributed representations of tuples for entity resolution. PVLDB, 11(11):1454--1467, 2018.","journal-title":"PVLDB"},{"key":"e_1_3_2_1_11_1","first-page":"3665","volume-title":"IJCAI","author":"Fu C.","year":"2020","unstructured":"C. Fu , X. Han , J. He , and L. Sun . Hierarchical matching network for heterogeneous entity resolution. In C. Bessiere, editor , IJCAI , pages 3665 -- 3671 , 2020 . C. Fu, X. Han, J. He, and L. Sun. Hierarchical matching network for heterogeneous entity resolution. In C. Bessiere, editor, IJCAI, pages 3665--3671, 2020."},{"key":"e_1_3_2_1_12_1","first-page":"4961","volume-title":"IJCAI","author":"Fu C.","year":"2019","unstructured":"C. Fu , X. Han , L. Sun , B. Chen , W. Zhang , S. Wu , and H. Kong . End-to-end multi-perspective matching for entity resolution. In S. Kraus, editor , IJCAI , pages 4961 -- 4967 , 2019 . C. Fu, X. Han, L. Sun, B. Chen, W. Zhang, S. Wu, and H. Kong. End-to-end multi-perspective matching for entity resolution. In S. Kraus, editor, IJCAI, pages 4961--4967, 2019."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412717"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1586"},{"key":"e_1_3_2_1_15_1","volume-title":"ICLR","author":"Kingma D. P.","year":"2015","unstructured":"D. P. Kingma and J. Ba . Adam: A method for stochastic optimization. In Y. Bengio and Y. LeCun, editors , ICLR , 2015 . D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. In Y. Bengio and Y. LeCun, editors, ICLR, 2015."},{"key":"e_1_3_2_1_16_1","volume-title":"ICLR","author":"Kitaev N.","year":"2020","unstructured":"N. Kitaev , L. Kaiser , and A. Levskaya . Reformer: The efficient transformer . In ICLR , 2020 . N. Kitaev, L. Kaiser, and A. Levskaya. Reformer: The efficient transformer. In ICLR, 2020."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994535"},{"key":"e_1_3_2_1_18_1","volume-title":"ICLR","author":"Lan Z.","year":"2020","unstructured":"Z. Lan , M. Chen , S. Goodman , K. Gimpel , P. Sharma , and R. Soricut . ALBERT: A lite BERT for self-supervised learning of language representations . In ICLR , 2020 . Z. Lan, M. Chen, S. Goodman, K. Gimpel, P. Sharma, and R. Soricut. ALBERT: A lite BERT for self-supervised learning of language representations. In ICLR, 2020."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357949"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939721"},{"issue":"1","key":"e_1_3_2_1_21_1","first-page":"50","article-title":"Deep entity matching with pre-trained language models","volume":"14","author":"Li Y.","year":"2021","unstructured":"Y. Li , J. Li , Y. Suhara , A. Doan , and W. Tan . Deep entity matching with pre-trained language models . PVLDB , 14 ( 1 ): 50 -- 60 , 2021 . Y. Li, J. Li, Y. Suhara, A. Doan, and W. Tan. Deep entity matching with pre-trained language models. PVLDB, 14(1):50--60, 2021.","journal-title":"PVLDB"},{"key":"e_1_3_2_1_22_1","volume-title":"Deep entity matching: Challenges and opportunities. Journal of Data and Information Quality (JDIQ), 13(1):1--17","author":"Li Y.","year":"2021","unstructured":"Y. Li , J. Li , Y. Suhara , J. Wang , W. Hirota , and W.-C. Tan . Deep entity matching: Challenges and opportunities. Journal of Data and Information Quality (JDIQ), 13(1):1--17 , 2021 . Y. Li, J. Li, Y. Suhara, J. Wang, W. Hirota, and W.-C. Tan. Deep entity matching: Challenges and opportunities. Journal of Data and Information Quality (JDIQ), 13(1):1--17, 2021."},{"key":"e_1_3_2_1_23_1","volume-title":"Roberta: A robustly optimized BERT pretraining approach. CoRR, abs\/1907.11692","author":"Liu Y.","year":"2019","unstructured":"Y. Liu , M. Ott , N. Goyal , J. Du , M. Joshi , D. Chen , O. Levy , M. Lewis , L. Zettlemoyer , and V. Stoyanov . Roberta: A robustly optimized BERT pretraining approach. CoRR, abs\/1907.11692 , 2019 . Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov. Roberta: A robustly optimized BERT pretraining approach. CoRR, abs\/1907.11692, 2019."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3360319"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457258"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380144"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196926"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3358018"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377455"},{"key":"e_1_3_2_1_30_1","volume-title":"Intermediate training of BERT for product matching","author":"Peeters R.","year":"2020","unstructured":"R. Peeters , C. Bizer , and G. Glavas . Intermediate training of BERT for product matching . In F. Piai, D. Firmani, V. Crescenzi, A. D. Angelis, X. L. Dong, M. Mazzei, P. Merialdo, and D. Srivastava, editors, DI 2KG@VLDB, 2020 . R. Peeters, C. Bizer, and G. Glavas. Intermediate training of BERT for product matching. In F. Piai, D. Firmani, V. Crescenzi, A. D. Angelis, X. L. Dong, M. Mazzei, P. Merialdo, and D. Srivastava, editors, DI2KG@VLDB, 2020."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1969.10501049"},{"key":"e_1_3_2_1_32_1","first-page":"381","volume-title":"WWW","author":"Primpeli A.","year":"2019","unstructured":"A. Primpeli , R. Peeters , and C. Bizer . The WDC training dataset and gold standard for large-scale product matching . In WWW , pages 381 -- 386 , 2019 . A. Primpeli, R. Peeters, and C. Bizer. The WDC training dataset and gold standard for large-scale product matching. In WWW, pages 381--386, 2019."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209978.3210025"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_3_2_1_35_1","volume-title":"Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, abs\/1910.01108","author":"Sanh V.","year":"2019","unstructured":"V. Sanh , L. Debut , J. Chaumond , and T. Wolf . Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, abs\/1910.01108 , 2019 . V. Sanh, L. Debut, J. Chaumond, and T. Wolf. Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, abs\/1910.01108, 2019."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304260"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403338"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415570"},{"key":"e_1_3_2_1_39_1","first-page":"1","volume-title":"Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD","author":"Thirumuruganathan S.","year":"2019","unstructured":"S. Thirumuruganathan , M. Ouzzani , and N. Tang . Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD , pages 10: 1 -- 10 :6, 2019 . S. Thirumuruganathan, M. Ouzzani, and N. Tang. Explaining entity resolution predictions: Where are we and what needs to be done? In HILDA@SIGMOD, pages 10:1--10:6, 2019."},{"key":"e_1_3_2_1_40_1","first-page":"277","volume-title":"EDBT","author":"Thirumuruganathan S.","year":"2020","unstructured":"S. Thirumuruganathan , N. Tang , M. Ouzzani , and A. Doan . Data curation with deep learning . In EDBT , pages 277 -- 286 , 2020 . S. Thirumuruganathan, N. Tang, M. Ouzzani, and A. Doan. Data curation with deep learning. In EDBT, pages 277--286, 2020."},{"key":"e_1_3_2_1_41_1","first-page":"5998","volume-title":"NeurIPS","author":"Vaswani A.","year":"2017","unstructured":"A. Vaswani , N. Shazeer , N. Parmar , J. Uszkoreit , L. Jones , A. N. Gomez , L. Kaiser , and I. Polosukhin . Attention is all you need . In NeurIPS , pages 5998 -- 6008 , 2017 . A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin. Attention is all you need. In NeurIPS, pages 5998--6008, 2017."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457328"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482008"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00042"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00167"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389743"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330963"},{"key":"e_1_3_2_1_49_1","volume-title":"Graphformers: Gnn-nested language models for linked text representation. CoRR, abs\/2105.02605","author":"Yang J.","year":"2021","unstructured":"J. Yang , Z. Liu , S. Xiao , C. Li , G. Sun , and X. Xie . Graphformers: Gnn-nested language models for linked text representation. CoRR, abs\/2105.02605 , 2021 . J. Yang, Z. Liu, S. Xiao, C. Li, G. Sun, and X. Xie. Graphformers: Gnn-nested language models for linked text representation. CoRR, abs\/2105.02605, 2021."}],"event":{"name":"SIGMOD\/PODS '22: International Conference on Management of Data","location":"Philadelphia Pennsylvania","acronym":"SIGMOD\/PODS '22","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"]},"container-title":["Proceedings of the Fifth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533702.3534910","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3533702.3534910","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:18Z","timestamp":1750186818000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3533702.3534910"}},"subtitle":["an end-to-end generalized entity matching framework"],"short-title":[],"issued":{"date-parts":[[2022,6,17]]},"references-count":49,"alternative-id":["10.1145\/3533702.3534910","10.1145\/3533702"],"URL":"https:\/\/doi.org\/10.1145\/3533702.3534910","relation":{},"subject":[],"published":{"date-parts":[[2022,6,17]]},"assertion":[{"value":"2022-08-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}