{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T17:19:21Z","timestamp":1776446361549,"version":"3.51.2"},"reference-count":25,"publisher":"MIT Press","issue":"4","content-domain":{"domain":["www.mitpressjournals.org"],"crossmark-restriction":true},"short-container-title":["Quantitative Science Studies"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:p> Adequately disambiguating author names in bibliometric databases is a precondition for conducting reliable analyses at the author level. In the case of bibliometric studies that include many researchers, it is not possible to disambiguate each single researcher manually. Several approaches have been proposed for author name disambiguation, but there has not yet been a comparison of them under controlled conditions. In this study, we compare a set of unsupervised disambiguation approaches. Unsupervised approaches specify a model to assess the similarity of author mentions a priori instead of training a model with labeled data. To evaluate the approaches, we applied them to a set of author mentions annotated with a ResearcherID, this being an author identifier maintained by the researchers themselves. Apart from comparing the overall performance, we take a more detailed look at the role of the parametrization of the approaches and analyze the dependence of the results on the complexity of the disambiguation task. Furthermore, we examine which effects the differences in the set of metadata considered by the different approaches have on the disambiguation results. In the context of this study, the approach proposed by Caron and van Eck (2014) produced the best results. <\/jats:p>","DOI":"10.1162\/qss_a_00081","type":"journal-article","created":{"date-parts":[[2020,8,10]],"date-time":"2020-08-10T20:55:32Z","timestamp":1597092932000},"page":"1510-1528","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":50,"title":["Author name disambiguation of bibliometric data: A comparison of several unsupervised approaches"],"prefix":"10.1162","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8765-9331","authenticated-orcid":true,"given":"Alexander","family":"Tekles","sequence":"first","affiliation":[{"name":"Division for Science and Innovation Studies, Administrative Headquarters of the Max Planck Society, Hofgartenstr. 8, 80539 Munich, Germany"},{"name":"Ludwig-Maximilians-Universit\u00e4t Munich, Department of Sociology, Konradstr. 6, 80801 Munich, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0810-7091","authenticated-orcid":true,"given":"Lutz","family":"Bornmann","sequence":"additional","affiliation":[{"name":"Division for Science and Innovation Studies, Administrative Headquarters of the Max Planck Society, Hofgartenstr. 8, 80539 Munich, Germany"}]}],"member":"281","reference":[{"key":"bib1","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1145\/3197026.3197036","volume-title":"Proceedings of the 18th ACM\/IEEE on Joint Conference on Digital Libraries","author":"Backes T.","year":"2018"},{"key":"bib2","doi-asserted-by":"crossref","first-page":"803","DOI":"10.1145\/3269206.3271699","volume-title":"Proceedings of the 27th ACM International Conference on Information and Knowledge Management","author":"Backes T.","year":"2018"},{"key":"bib3","first-page":"79","volume-title":"Proceedings of the Science and Technology Indicators Conference 2014 Leiden","author":"Caron E.","year":"2014"},{"key":"bib4","volume-title":"Paper presented at the XXII Simp\u00f3sio Brasileiro de Banco de Dados","author":"Cota R. G.","year":"2007"},{"issue":"2","key":"bib5","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1145\/2350036.2350040","volume":"41","author":"Ferreira A. A.","year":"2012","journal-title":"ACM SIGMOD Record"},{"key":"bib6","volume-title":"Paper presented at the Proceedings of the 10th Annual Joint Conference on Digital Libraries","author":"Ferreira A. A.","year":"2010"},{"issue":"6","key":"bib7","doi-asserted-by":"crossref","first-page":"1257","DOI":"10.1002\/asi.22992","volume":"65","author":"Ferreira A. A.","year":"2014","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"bib8","doi-asserted-by":"crossref","DOI":"10.1017\/S0269888917000182","volume":"32","author":"Hussain I.","year":"2017","journal-title":"The Knowledge Engineering Review"},{"issue":"6","key":"bib9","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1177\/0165551518761011","volume":"44","author":"Hussain I.","year":"2018","journal-title":"Journal of Information Science"},{"issue":"3","key":"bib10","doi-asserted-by":"crossref","first-page":"1867","DOI":"10.1007\/s11192-018-2824-5","volume":"116","author":"Kim J.","year":"2018","journal-title":"Scientometrics"},{"issue":"7","key":"bib11","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1002\/asi.24158","volume":"70","author":"Kim J.","year":"2019","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"6","key":"bib12","doi-asserted-by":"crossref","first-page":"1446","DOI":"10.1002\/asi.23489","volume":"67","author":"Kim J.","year":"2016","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"5","key":"bib13","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.1002\/asi.22621","volume":"63","author":"Levin M.","year":"2012","journal-title":"Journal of the American Society for Information Science and Technology"},{"issue":"6","key":"bib14","doi-asserted-by":"crossref","first-page":"941","DOI":"10.1016\/j.respol.2014.01.012","volume":"43","author":"Li G.-C.","year":"2014","journal-title":"Research Policy"},{"issue":"3","key":"bib15","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1002\/asi.23183","volume":"66","author":"Liu Y.","year":"2015","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"1","key":"bib16","doi-asserted-by":"crossref","first-page":"208","DOI":"10.14778\/1920841.1920871","volume":"3","author":"Menestrina D.","year":"2010","journal-title":"Proceedings of the VLDB Endowment"},{"issue":"4","key":"bib17","doi-asserted-by":"crossref","first-page":"767","DOI":"10.1016\/j.joi.2013.06.006","volume":"7","author":"Milojevic\u00b4 S.","year":"2013","journal-title":"Journal of Informetrics"},{"issue":"3","key":"bib18","first-page":"335","volume":"19","author":"Newcombe H. B.","year":"1967","journal-title":"American Journal of Human Genetics"},{"key":"bib19","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1145\/1065385.1065463","volume-title":"Proceedings of the 5th ACM\/IEEE-CS Joint Conference on Digital Libraries","author":"On B.-W.","year":"2005"},{"issue":"11","key":"bib20","volume":"3","author":"Schulz C.","year":"2014","journal-title":"EPJ Data Science"},{"issue":"1","key":"bib21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1002\/aris.2009.1440430113","volume":"43","author":"Smalheiser N. R.","year":"2009","journal-title":"Annual Review of Information Science and Technology"},{"issue":"3","key":"bib22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1552303.1552304","volume":"3","author":"Torvik V. I.","year":"2009","journal-title":"ACM Transactions on Knowledge Discovery from Data"},{"issue":"3","key":"bib23","doi-asserted-by":"crossref","first-page":"1955","DOI":"10.1007\/s11192-014-1283-x","volume":"101","author":"Wu H.","year":"2014","journal-title":"Scientometrics"},{"issue":"3","key":"bib24","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1007\/s11192-013-0978-8","volume":"96","author":"Wu J.","year":"2013","journal-title":"Scientometrics"},{"issue":"3","key":"bib25","doi-asserted-by":"crossref","first-page":"781","DOI":"10.1007\/s11192-017-2611-8","volume":"114","author":"Zhu J.","year":"2017","journal-title":"Scientometrics"}],"container-title":["Quantitative Science Studies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/qss_a_00081","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:30:36Z","timestamp":1615584636000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/qss\/article\/1\/4\/1510-1528\/96105"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12]]},"references-count":25,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["10.1162\/qss_a_00081"],"URL":"https:\/\/doi.org\/10.1162\/qss_a_00081","relation":{},"ISSN":["2641-3337"],"issn-type":[{"value":"2641-3337","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12]]},"assertion":[{"value":"2019-06-19","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-05-28","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}