{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,4]],"date-time":"2025-09-04T14:02:36Z","timestamp":1756994556453,"version":"3.41.0"},"reference-count":18,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,8,10]],"date-time":"2023-08-10T00:00:00Z","timestamp":1691625600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2023,8,10]]},"abstract":"<jats:p>We report our experience in running three editions (2020, 2021, 2022) of the SIGMOD programming contest, a well-known event for students to engage in solving exciting data management problems. During this period we had the opportunity of introducing participants to the entity resolution task, which is of paramount importance in the data integration community. We aim at sharing the executive decisions, made by the people coauthoring this report, and the lessons learned.<\/jats:p>","DOI":"10.1145\/3615952.3615965","type":"journal-article","created":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T10:12:20Z","timestamp":1691748740000},"page":"43-47","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Experiences and Lessons Learned from the SIGMOD Entity Resolution Programming Contests"],"prefix":"10.1145","volume":"52","author":[{"given":"Andrea","family":"De Angelis","sequence":"first","affiliation":[{"name":"Roma Tre University"}]},{"given":"Maurizio","family":"Mazzei","sequence":"additional","affiliation":[{"name":"Roma Tre University"}]},{"given":"Federico","family":"Piai","sequence":"additional","affiliation":[{"name":"Roma Tre University"}]},{"given":"Paolo","family":"Merialdo","sequence":"additional","affiliation":[{"name":"Roma Tre University"}]},{"given":"Giovanni","family":"Simonini","sequence":"additional","affiliation":[{"name":"University of Modena and Reggio Emilia"}]},{"given":"Luca","family":"Zecchini","sequence":"additional","affiliation":[{"name":"University of Modena and Reggio Emilia"}]},{"given":"Sonia","family":"Bergamaschi","sequence":"additional","affiliation":[{"name":"University of Modena and Reggio Emilia"}]},{"given":"Donatella","family":"Firmani","sequence":"additional","affiliation":[{"name":"Sapienza University"}]},{"given":"Xu","family":"Chu","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology"}]},{"given":"Peng","family":"Li","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology"}]},{"given":"Renzhi","family":"Wu","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology"}]}],"member":"320","published-online":{"date-parts":[[2023,8,11]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2015.2435779"},{"key":"e_1_2_1_2_1","series-title":"CEUR Workshop Proceedings","volume-title":"DI2KG@VLDB","author":"Blacher M.","year":"2020","unstructured":"M. Blacher , J. Klaus , M. Mitterreiter , J. Giesen , and S. Laue . Fast Entity Resolution With Mock Labels and Sorted Integer Sets . In DI2KG@VLDB 2020 , volume 2726 of CEUR Workshop Proceedings . CEUR-WS. org, 2020. M. Blacher, J. Klaus, M. Mitterreiter, J. Giesen, and S. Laue. Fast Entity Resolution With Mock Labels and Sorted Integer Sets. In DI2KG@VLDB 2020, volume 2726 of CEUR Workshop Proceedings. CEUR-WS.org, 2020."},{"key":"e_1_2_1_3_1","first-page":"785","volume-title":"XGBoost: A Scalable Tree Boosting System. In KDD 2016","author":"Chen T.","year":"2016","unstructured":"T. Chen and C. Guestrin . XGBoost: A Scalable Tree Boosting System. In KDD 2016 , pages 785 -- 794 . ACM, 2016 . T. Chen and C. Guestrin. XGBoost: A Scalable Tree Boosting System. In KDD 2016, pages 785--794. ACM, 2016."},{"key":"e_1_2_1_4_1","first-page":"2085","volume-title":"ReproZip: Computational Reproducibility With Ease. In SIGMOD 2016","author":"Chirigati F.","year":"2016","unstructured":"F. Chirigati , R. Rampin , D. Shasha , and J. Freire . ReproZip: Computational Reproducibility With Ease. In SIGMOD 2016 , pages 2085 -- 2088 . ACM, 2016 . F. Chirigati, R. Rampin, D. Shasha, and J. Freire. ReproZip: Computational Reproducibility With Ease. In SIGMOD 2016, pages 2085--2088. ACM, 2016."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3418896"},{"key":"e_1_2_1_6_1","volume-title":"Alaska: A Flexible Benchmark for Data Integration Tasks. arXiv:2101.11259","author":"Crescenzi V.","year":"2021","unstructured":"V. Crescenzi , A. De Angelis , D. Firmani , M. Mazzei , P. Merialdo , F. Piai , and D. Srivastava . Alaska: A Flexible Benchmark for Data Integration Tasks. arXiv:2101.11259 , 2021 . V. Crescenzi, A. De Angelis, D. Firmani, M. Mazzei, P. Merialdo, F. Piai, and D. Srivastava. Alaska: A Flexible Benchmark for Data Integration Tasks. arXiv:2101.11259, 2021."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.14778\/2856318.2856330"},{"key":"e_1_2_1_8_1","first-page":"905","volume-title":"Overlap Set Similarity Joins with Theoretical Guarantees. In SIGMOD 2018","author":"Deng D.","year":"2018","unstructured":"D. Deng , Y. Tao , and G. Li . Overlap Set Similarity Joins with Theoretical Guarantees. In SIGMOD 2018 , pages 905 -- 920 . ACM, 2018 . D. Deng, Y. Tao, and G. Li. Overlap Set Similarity Joins with Theoretical Guarantees. In SIGMOD 2018, pages 905--920. ACM, 2018."},{"key":"e_1_2_1_9_1","series-title":"CEUR Workshop Proceedings","volume-title":"DI2KG@VLDB","author":"Deng N.","year":"2020","unstructured":"N. Deng , W. Luan , H. Liu , and B. Tang . CheetahER: A Fast Entity Resolution System for Heterogeneous Camera Data . In DI2KG@VLDB 2020 , volume 2726 of CEUR Workshop Proceedings . CEUR-WS. org, 2020. N. Deng, W. Luan, H. Liu, and B. Tang. CheetahER: A Fast Entity Resolution System for Heterogeneous Camera Data. In DI2KG@VLDB 2020, volume 2726 of CEUR Workshop Proceedings. CEUR-WS.org, 2020."},{"key":"e_1_2_1_10_1","first-page":"4171","volume-title":"NAACL-HLT","author":"Devlin J.","year":"2019","unstructured":"J. Devlin , M. Chang , K. Lee , and K. Toutanova . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In NAACL-HLT 2019 , volume 1 , pages 4171 -- 4186 . ACL , 2019. J. Devlin, M. Chang, K. Lee, and K. Toutanova. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT 2019, volume 1, pages 4171--4186. ACL, 2019."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3405476"},{"key":"e_1_2_1_12_1","first-page":"602","volume-title":"SparkER: Scaling Entity Resolution in Spark. In EDBT 2019","author":"Gagliardelli L.","year":"2019","unstructured":"L. Gagliardelli , G. Simonini , D. Beneventano , and S. Bergamaschi . SparkER: Scaling Entity Resolution in Spark. In EDBT 2019 , pages 602 -- 605 . OpenProceedings.org , 2019 . L. Gagliardelli, G. Simonini, D. Beneventano, and S. Bergamaschi. SparkER: Scaling Entity Resolution in Spark. In EDBT 2019, pages 602--605. OpenProceedings.org, 2019."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.14778\/3554821.3554823"},{"key":"e_1_2_1_14_1","volume-title":"Supervised Contrastive Learning. In NeurIPS 2020","volume":"33","author":"Khosla P.","year":"1866","unstructured":"P. Khosla , P. Teterwak , C. Wang , A. Sarna , Y. Tian , P. Isola , A. Maschinot , C. Liu , and D. Krishnan . Supervised Contrastive Learning. In NeurIPS 2020 , volume 33 of Advances in Neural Information Processing Systems, pages 1866 1--18673. Curran Associates, Inc., 2020. P. Khosla, P. Teterwak, C. Wang, A. Sarna, Y. Tian, P. Isola, A. Maschinot, C. Liu, and D. Krishnan. Supervised Contrastive Learning. In NeurIPS 2020, volume 33 of Advances in Neural Information Processing Systems, pages 18661--18673. Curran Associates, Inc., 2020."},{"key":"e_1_2_1_15_1","volume-title":"XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation. arXiv:2106.04563","author":"Mukherjee S.","year":"2021","unstructured":"S. Mukherjee , A. H. Awadallah , and J. Gao . XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation. arXiv:2106.04563 , 2021 . S. Mukherjee, A. H. Awadallah, and J. Gao. XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation. arXiv:2106.04563, 2021."},{"key":"e_1_2_1_16_1","first-page":"381","volume-title":"WWW (Companion Volume)","author":"Primpeli A.","year":"2019","unstructured":"A. Primpeli , R. Peeters , and C. Bizer . The WDC Training Dataset and Gold Standard for Large-Scale Product Matching. In ECNLP@ WWW 2019 , WWW (Companion Volume) , pages 381 -- 386 . ACM, 2019 . A. Primpeli, R. Peeters, and C. Bizer. The WDC Training Dataset and Gold Standard for Large-Scale Product Matching. In ECNLP@ WWW 2019, WWW (Companion Volume), pages 381--386. ACM, 2019."},{"issue":"12","key":"e_1_2_1_17_1","first-page":"2735","article-title":"Demonstration of Panda","volume":"14","author":"Wu R.","year":"2021","unstructured":"R. Wu , P. Sakala , P. Li , X. Chu , and Y. He . Demonstration of Panda : A Weakly Supervised Entity Matching System. PVLDB , 14 ( 12 ): 2735 -- 2738 , 2021 . R. Wu, P. Sakala, P. Li, X. Chu, and Y. He. Demonstration of Panda: A Weakly Supervised Entity Matching System. PVLDB, 14(12):2735--2738, 2021.","journal-title":"A Weakly Supervised Entity Matching System. PVLDB"},{"key":"e_1_2_1_18_1","series-title":"CEUR Workshop Proceedings","volume-title":"DI2KG@VLDB","author":"Zecchini L.","year":"2020","unstructured":"L. Zecchini , G. Simonini , and S. Bergamaschi . Entity Resolution on Camera Records without Machine Learning . In DI2KG@VLDB 2020 , volume 2726 of CEUR Workshop Proceedings . CEUR-WS. org, 2020. L. Zecchini, G. Simonini, and S. Bergamaschi. Entity Resolution on Camera Records without Machine Learning. In DI2KG@VLDB 2020, volume 2726 of CEUR Workshop Proceedings. CEUR-WS.org, 2020."}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3615952.3615965","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3615952.3615965","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:29Z","timestamp":1750178189000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3615952.3615965"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,10]]},"references-count":18,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,8,10]]}},"alternative-id":["10.1145\/3615952.3615965"],"URL":"https:\/\/doi.org\/10.1145\/3615952.3615965","relation":{},"ISSN":["0163-5808"],"issn-type":[{"type":"print","value":"0163-5808"}],"subject":[],"published":{"date-parts":[[2023,8,10]]},"assertion":[{"value":"2023-08-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}