{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:40:07Z","timestamp":1750192807084,"version":"3.41.0"},"reference-count":17,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2022,5,23]],"date-time":"2022-05-23T00:00:00Z","timestamp":1653264000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100003407","name":"MIUR","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100003407","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Italian Ministry of Education, University and Research","award":["20174LF3T8"],"award-info":[{"award-number":["20174LF3T8"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2022,9,30]]},"abstract":"<jats:p>\n            Conferences play a major role in some disciplines such as computer science and are often used in research quality evaluation exercises. Differently from journals and books, for which ISSN and ISBN codes provide unambiguous keys, recognizing the conference series in which a paper was published is a rather complex endeavor: There is no unique code assigned to conferences, and the way their names are written may greatly vary across years and catalogs. In this article, we propose a technique for the entity resolution of conferences based on the analysis of different semantic parts of their names. We present the results of an investigation of our technique on a dataset of 42,395 distinct computer science conference names excerpted from the DBLP computer science repository,\n            <jats:xref ref-type=\"fn\">\n              <jats:sup>1<\/jats:sup>\n            <\/jats:xref>\n            which we automatically link to different authority files. With suitable data cleaning, the precision of our record linkage algorithm can be as high as 94%. A comparison with results obtainable using state-of-the-art general-purpose record linkage algorithms rounds off the article, showing that our\n            <jats:italic>ad hoc<\/jats:italic>\n            solution largely outperforms them in terms of the quality of the results.\n          <\/jats:p>","DOI":"10.1145\/3519031","type":"journal-article","created":{"date-parts":[[2022,3,4]],"date-time":"2022-03-04T22:24:08Z","timestamp":1646432648000},"page":"1-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Which Conference Is That? A Case Study in Computer Science"],"prefix":"10.1145","volume":"14","author":[{"given":"Camil","family":"Demetrescu","sequence":"first","affiliation":[{"name":"Department of Computer, Control, and Management Engineering \u201cAntonioRuberti,\u201d Sapienza University of Rome, Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Irene","family":"Finocchi","sequence":"additional","affiliation":[{"name":"Department of Business and Management, Luiss Guido Carli University, Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0281-4257","authenticated-orcid":false,"given":"Andrea","family":"Ribichini","sequence":"additional","affiliation":[{"name":"Department of Computer, Control, and Management Engineering \u201cAntonio Ruberti,\u201d Sapienza University of Rome, Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marco","family":"Schaerf","sequence":"additional","affiliation":[{"name":"Department of Computer, Control, and Management Engineering \u201cAntonio Ruberti,\u201d Sapienza University of Rome, Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,5,23]]},"reference":[{"doi-asserted-by":"publisher","key":"e_1_3_2_2_2","DOI":"10.1007\/s11192-014-1436-y"},{"unstructured":"Jonathan de Bruin. 2016. Python Record Linkage Toolkit. Retrieved April 2021 from https:\/\/recordlinkage.readthedocs.io\/en\/latest\/index.html.","key":"e_1_3_2_3_2"},{"doi-asserted-by":"publisher","key":"e_1_3_2_4_2","DOI":"10.3844\/jcssp.2017.68.77"},{"doi-asserted-by":"publisher","key":"e_1_3_2_5_2","DOI":"10.1007\/s11192-020-03548-9"},{"doi-asserted-by":"publisher","key":"e_1_3_2_6_2","DOI":"10.1007\/s11192-018-2945-x"},{"doi-asserted-by":"publisher","key":"e_1_3_2_7_2","DOI":"10.1007\/s11192-012-0810-x"},{"unstructured":"Forest Gregg and Derek Eder. 2019. Dedupe. Retrieved from April 2021 https:\/\/docs.dedupe.io\/en\/latest\/.","key":"e_1_3_2_8_2"},{"doi-asserted-by":"publisher","key":"e_1_3_2_9_2","DOI":"10.1002\/asi.24100"},{"unstructured":"Michael Ley. 2020. Personal communication.","key":"e_1_3_2_10_2"},{"doi-asserted-by":"publisher","key":"e_1_3_2_11_2","DOI":"10.1016\/j.bdr.2016.08.002"},{"doi-asserted-by":"publisher","key":"e_1_3_2_12_2","DOI":"10.1145\/3377455"},{"doi-asserted-by":"publisher","key":"e_1_3_2_13_2","DOI":"10.1007\/s00799-020-00289-1"},{"doi-asserted-by":"publisher","key":"e_1_3_2_14_2","DOI":"10.1109\/JCDL.2014.6970153"},{"doi-asserted-by":"publisher","key":"e_1_3_2_15_2","DOI":"10.1145\/3196959.3196984"},{"doi-asserted-by":"publisher","key":"e_1_3_2_16_2","DOI":"10.1002\/asi.23349"},{"doi-asserted-by":"publisher","key":"e_1_3_2_17_2","DOI":"10.1007\/978-3-319-48740-3_3"},{"doi-asserted-by":"publisher","key":"e_1_3_2_18_2","DOI":"10.1145\/3148011.3154470"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3519031","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3519031","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:20Z","timestamp":1750191140000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3519031"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,23]]},"references-count":17,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,9,30]]}},"alternative-id":["10.1145\/3519031"],"URL":"https:\/\/doi.org\/10.1145\/3519031","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"type":"print","value":"1936-1955"},{"type":"electronic","value":"1936-1963"}],"subject":[],"published":{"date-parts":[[2022,5,23]]},"assertion":[{"value":"2021-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-05-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}