{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T06:07:56Z","timestamp":1774591676458,"version":"3.50.1"},"reference-count":46,"publisher":"MIT Press","issue":"2","license":[{"start":{"date-parts":[[2021,2,17]],"date-time":"2021-02-17T00:00:00Z","timestamp":1613520000000},"content-version":"vor","delay-in-days":413,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>This paper presents a new method for identifying scholars who have a Twitter account from bibliometric data from Web of Science (WoS) and Twitter data from Altmetric.com. The method reliably identifies matches between Twitter accounts and scholarly authors. It consists of a matching of elements such as author names, usernames, handles, and URLs, followed by a rule-based scoring system that weights the common occurrence of these elements related to the activities of Twitter users and scholars. The method proceeds by matching the Twitter accounts against a database of millions of disambiguated bibliographic profiles from WoS. This paper describes the implementation and validation of the matching method, and performs verification through precision-recall analysis. We also explore the geographical, disciplinary, and demographic variations in the distribution of scholars matched to a Twitter account. This approach represents a step forward in the development of more advanced forms of social media studies of science by opening up an important door for studying the interactions between science and social media in general, and for studying the activities of scholars on Twitter in particular.<\/jats:p>","DOI":"10.1162\/qss_a_00047","type":"journal-article","created":{"date-parts":[[2020,6,14]],"date-time":"2020-06-14T18:28:26Z","timestamp":1592159306000},"page":"771-791","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":30,"title":["Large-scale identification and characterization of scholars on Twitter"],"prefix":"10.1162","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7465-6462","authenticated-orcid":false,"given":"Rodrigo","family":"Costas","sequence":"first","affiliation":[{"name":"Centre for Science and Technology Studies (CWTS), Leiden University, Leiden (the Netherlands)"},{"name":"Centre for Research on Evaluation, Science and Technology (CREST), Stellenbosch University, Stellenbosch (South Africa)"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1021-059X","authenticated-orcid":false,"given":"Philippe","family":"Mongeon","sequence":"additional","affiliation":[{"name":"Centre for Science and Technology Studies (CWTS), Leiden University, Leiden (the Netherlands)"},{"name":"Centre for Studies in Research and Research Policy (CFA), Aarhus University, Aarhus (Denmark)"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5337-4637","authenticated-orcid":false,"given":"M\u00e1rcia R.","family":"Ferreira","sequence":"additional","affiliation":[{"name":"Centre for Science and Technology Studies (CWTS), Leiden University, Leiden (the Netherlands)"},{"name":"Complexity Science Hub Vienna, Vienna (Austria)"},{"name":"Institute of Information Systems Engineering, Vienna University of Technology, Vienna (Austria)"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1115-3032","authenticated-orcid":false,"given":"Jeroen","family":"van Honk","sequence":"additional","affiliation":[{"name":"Centre for Science and Technology Studies (CWTS), Leiden University, Leiden (the Netherlands)"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4852-8862","authenticated-orcid":false,"given":"Thomas","family":"Franssen","sequence":"additional","affiliation":[{"name":"Centre for Science and Technology Studies (CWTS), Leiden University, Leiden (the Netherlands)"}]}],"member":"281","published-online":{"date-parts":[[2020,6,1]]},"reference":[{"key":"2025073014025303900_bib1","doi-asserted-by":"crossref","unstructured":"Bowman,  T. D.\n           (2015). Differences in personal and professional tweets of scholars. Aslib Journal of Information Management, 67(3), 356\u2013371.","DOI":"10.1108\/AJIM-12-2014-0180"},{"key":"2025073014025303900_bib2","unstructured":"Caron,  E., & Van Eck,  N. J. (2014). Large scale author name disambiguation using rule-based scoring and clustering. In E.Noyons (Ed.), 19th International Conference on Science and Technology Indicators. \u201cContext Counts: Pathways to Master Big Data and Little Data.\u201dLeiden: CWTS-Leiden University."},{"key":"2025073014025303900_bib3","doi-asserted-by":"crossref","unstructured":"Chretien,  K., Azar,  J., & Kind,  T. (2011). Physicians on Twitter. Journal of the American Medical Association, 305(6), 566\u2013568.","DOI":"10.1001\/jama.2011.68"},{"key":"2025073014025303900_bib4","doi-asserted-by":"crossref","unstructured":"Collins,  K., Shiffman,  D., & Rock,  J. (2016). How are scientists using social media in the workplace?PLOS ONE, 11, 1\u201310.","DOI":"10.1371\/journal.pone.0162680"},{"key":"2025073014025303900_bib5","unstructured":"Costas,  R.\n           (2017). Towards the social media studies of science: Social media metrics, present and future. Anales de Investigaci\u00f3n, 13(1), 1\u20135."},{"key":"2025073014025303900_bib6","unstructured":"Costas,  R., Nane,  T., & Larivi\u00e8re,  V. (2015). Is the year of first publication a good proxy of scholars\u2019 academic age? In A. A.Salah, Y.Tonta, et al (Eds.) Proceedings of the 15th International Conference on Scientometrics and Informetrics (pp. 988\u2013998). Istanbul: Bogazi\u00e7i University Printhouse."},{"key":"2025073014025303900_bib7","doi-asserted-by":"crossref","unstructured":"Costas,  R., Zahedi,  Z., & Wouters,  P. (2015a). Do \u201caltmetrics\u201d correlate with citations? Extensive comparison of altmetric indicators with citations from a multidisciplinary perspective. Journal of the Association for Information Science and Technology, 66(10), 2003\u20132019.","DOI":"10.1002\/asi.23309"},{"key":"2025073014025303900_bib8","doi-asserted-by":"crossref","unstructured":"Costas,  R., Zahedi,  Z., & Wouters,  P. (2015b). The thematic orientation of publications mentioned on social media: Large-scale disciplinary comparison of social media metrics with citations. Aslib Journal of Information Management, 67, 260\u2013288.","DOI":"10.1108\/AJIM-12-2014-0173"},{"key":"2025073014025303900_bib9","doi-asserted-by":"crossref","unstructured":"Das,  A. K., & Mishra,  S. (2014). Genesis of altmetrics or article-level metrics for measuring efficacy of scholarly communications: Current perspectives. arXiv preprint, arXiv: 1408.0090.","DOI":"10.2139\/ssrn.2499467"},{"key":"2025073014025303900_bib10","doi-asserted-by":"crossref","unstructured":"D\u00edaz-Faes,  A. A., Bowman,  T. D., & Costas,  R. (2019). Towards a second generation of \u201csocial media metrics\u201d: Characterizing Twitter communities of attention around science. PLOS ONE, 14(5), e0216408.","DOI":"10.1371\/journal.pone.0216408"},{"key":"2025073014025303900_bib11","doi-asserted-by":"crossref","unstructured":"Dinsmore,  A., Allen,  L., & Dolby,  K. (2014). Alternative perspectives on impact: The potential of ALMs and altmetrics to inform funders about research impact. PLOS Biology, 12(11), e1002003.","DOI":"10.1371\/journal.pbio.1002003"},{"key":"2025073014025303900_bib12","doi-asserted-by":"crossref","unstructured":"Eysenbach,  G.\n           (2011). Can tweets predict citations? Metrics of social impact based on Twitter and correlation with traditional metrics of scientific impact. Journal of Medical Internet Research, 13(4), e123.","DOI":"10.2196\/jmir.2012"},{"key":"2025073014025303900_bib13","unstructured":"Friedrich,  N., Bowman,  T. D., Stock,  W. G., & Haustein,  S. (2015). Adapting sentiment analysis for tweets linking to scientific papers. arXiv preprint, arXiv: 1507.01967."},{"key":"2025073014025303900_bib14","doi-asserted-by":"crossref","unstructured":"Gl\u00e4nzel,  W., & Schubert,  A. (1988). Characteristic scores and scales in assessing citation impact. Journal of Information Science, 14(2), 123\u2013127.","DOI":"10.1177\/016555158801400208"},{"key":"2025073014025303900_bib15","unstructured":"Haak,  L., Brown,  J., Buys,  M., Cardoso,  A. P.Demain,  P., \u2026 Wright,  D. (2016). ORCID Public Data File 2016. figshare. https:\/\/doi.org\/10.6084\/m9.figshare.4134027.v1"},{"key":"2025073014025303900_bib16","unstructured":"Hadgu,  A. T., & J\u00e4schke,  R. (2014). Identifying and analyzing scholars on Twitter. CEUR Workshop Proceedings, 1226, 164\u2013165."},{"key":"2025073014025303900_bib17","doi-asserted-by":"crossref","unstructured":"Haustein,  S.\n           (2016). Grand challenges in altmetrics: Heterogeneity, data quality and dependencies. Scientometrics, 108, 413\u2013423. https:\/\/doi.org\/10.1007\/s11192-016-1910-9","DOI":"10.1007\/s11192-016-1910-9"},{"key":"2025073014025303900_bib18","doi-asserted-by":"crossref","unstructured":"Haustein,  S., Bowman,  T. D., Holmberg,  K., Peters,  I., & Larivi\u00e8re,  V. (2014). Astrophysicists on Twitter: An in-depth analysis of tweeting and scientific publication behavior. Aslib Journal of Information Management, 66(3), 279\u2013296. https:\/\/doi.org\/10.1108\/AJIM-09-2013-0081","DOI":"10.1108\/AJIM-09-2013-0081"},{"key":"2025073014025303900_bib19","doi-asserted-by":"crossref","unstructured":"Haustein,  S., Costas,  R., & Larivi\u00e8re,  V. (2015). Characterizing social media metrics of scholarly papers: The effect of document properties and collaboration patterns. PLOS ONE, 10(3), e0120495.","DOI":"10.1371\/journal.pone.0120495"},{"key":"2025073014025303900_bib20","doi-asserted-by":"crossref","unstructured":"Haustein,  S., Peters,  I., Sugimoto,  C. R., Thelwall,  M., & Larivi\u00e8re,  V. (2014). Tweeting biomedicine: An analysis of tweets and citations in the biomedical literature. Journal of the Association for Information Science and Technology, 65(4), 656\u2013669.","DOI":"10.1002\/asi.23101"},{"key":"2025073014025303900_bib21","doi-asserted-by":"crossref","unstructured":"Holmberg,  K., & Thelwall,  M. (2014). Disciplinary differences in Twitter scholarly communication. Scientometrics, 101, 1027\u20131042.","DOI":"10.1007\/s11192-014-1229-3"},{"key":"2025073014025303900_bib22","doi-asserted-by":"crossref","unstructured":"Hwong,  Y.-L., Oliver,  C., Van Kranendonk,  M., Sammut,  C., & Seroussi,  Y. (2016). What makes you tick? The psychology of social media engagement in space science communication. Computers in Human Behavior, 68, 480\u2013492.","DOI":"10.1016\/j.chb.2016.11.068"},{"key":"2025073014025303900_bib23","doi-asserted-by":"crossref","unstructured":"Ke,  Q., Ahn,  Y.-Y., & Sugimoto,  C. R. (2016). A systematic identification and analysis of scientists on Twitter. PLOS ONE, 12(4), e0175368. Retrieved from http:\/\/arxiv.org\/abs\/1608.06229","DOI":"10.1371\/journal.pone.0175368"},{"key":"2025073014025303900_bib24","doi-asserted-by":"crossref","unstructured":"Larivi\u00e8re,  V., & Costas,  R. (2016). How many is too many? On the relationship between research productivity and impact. PLOS ONE, 11, e0162709.","DOI":"10.1371\/journal.pone.0162709"},{"key":"2025073014025303900_bib25","doi-asserted-by":"crossref","unstructured":"Larivi\u00e8re,  V., Ni,  C., Gingras,  Y., Cronin,  B., & Sugimoto,  C. R. (2013). Bibliometrics: Global gender disparities in science. Nature News, 504(7479), 211.","DOI":"10.1038\/504211a"},{"key":"2025073014025303900_bib26","unstructured":"Letierce,  J., Passant,  A., Breslin,  J., & Decker,  S. (2010). Understanding how Twitter is used to spread scientific messages. In Proceedings of the WebSci10: Extending the Frontiers of Society On-Line. Raleigh, North Carolina."},{"key":"2025073014025303900_bib27","doi-asserted-by":"crossref","unstructured":"Lulic,  I., & Kovic,  I. (2013). Analysis of emergency physicians\u2019 Twitter accounts. Emergency Medicine Journal, 30, 371\u2013376.","DOI":"10.1136\/emermed-2012-201132"},{"key":"2025073014025303900_bib28","doi-asserted-by":"crossref","unstructured":"Ortega,  J. L.\n           (2016). To be or not to be on Twitter, and its relationship with the tweeting and citation of research papers. Scientometrics, 109(2), 1353\u20131364.","DOI":"10.1007\/s11192-016-2113-0"},{"key":"2025073014025303900_bib29","unstructured":"Paglione,  L., Peters,  R., Wilmers,  C., Simpson,  W., Montenegro,  A., \u2026 Haak,  L. (2015). ORCID Public Data File 2015. figshare. https:\/\/doi.org\/10.6084\/m9.figshare.1582705.v1"},{"key":"2025073014025303900_bib30","doi-asserted-by":"crossref","unstructured":"Priem,  J., & Costello,  K. L. (2010). How and why scholars cite on Twitter. Proceedings of the American Society for Information Science and Technology, 47(1), 1\u20134.","DOI":"10.1002\/meet.14504701201"},{"key":"2025073014025303900_bib31","unstructured":"Priem,  J., Piwowar,  H. A., & Hemminger,  B. M. (2012). Altmetrics in the wild: Using social media to explore scholarly impact. arXiv preprint. arXiv: 1203.4745."},{"key":"2025073014025303900_bib32","unstructured":"Priem,  J., Taraborelli,  D., Groth,  P., & Neylon,  C. (2010). Altmetrics: A Manifesto. http:\/\/altmetrics.org\/manifesto"},{"key":"2025073014025303900_bib33","doi-asserted-by":"crossref","unstructured":"Robinson-Garcia,  N., Costas,  R., Isett,  K., Melkers,  J., & Hicks,  D. (2017). The unbearable emptiness of tweeting\u2014About journal articles. PLOS ONE, 12(8), e0183551.","DOI":"10.1371\/journal.pone.0183551"},{"key":"2025073014025303900_bib34","doi-asserted-by":"crossref","unstructured":"Robinson-Garcia,  N., van Leeuwen,  T., & Rafols,  I. (2018). Using altmetrics for contextualized mapping of societal impact: from hits to networks. Science and Public Policy, 45(6), 815\u2013826. https:\/\/doi.org\/10.1093\/scipol\/scy024","DOI":"10.1093\/scipol\/scy024"},{"key":"2025073014025303900_bib35","doi-asserted-by":"crossref","unstructured":"Ross,  C., Terras,  C., Warwick,  M., & Welsh,  A. (2011). Enabled backchannel: Conference Twitter use by digital humanists. Journal of Documentation, 67, 214\u2013237.","DOI":"10.1108\/00220411111109449"},{"key":"2025073014025303900_bib36","doi-asserted-by":"crossref","unstructured":"Rowlands,  I., Nicholas,  D., Russell,  B., Canty,  N., & Watkinson,  A. (2011). Social media use in the research workflow. Learned Publishing, 24(3), 183\u2013195. https:\/\/doi.org\/10.1087\/20110306","DOI":"10.1087\/20110306"},{"key":"2025073014025303900_bib37","doi-asserted-by":"crossref","unstructured":"Sharma,  N. K., Ghosh,  S., Benevenuto,  F., Ganguly,  N., & Gummadi,  K. (2012). Inferring who-is-who in the Twitter social network. ACM SIGCOMM Computer Communication Review, 42, 533.","DOI":"10.1145\/2377677.2377782"},{"key":"2025073014025303900_bib38","doi-asserted-by":"crossref","unstructured":"Sugimoto,  C., Work,  S., Larivi\u00e8re,  V., & Haustein. (2017). Scholarly use of social media and altmetrics: Review of the literature. Journal of the Association for Information Science and Technology, 68(9), 2037\u20132062. https:\/\/doi.org\/10.1002\/asi.23833","DOI":"10.1002\/asi.23833"},{"key":"2025073014025303900_bib39","doi-asserted-by":"crossref","unstructured":"Thelwall,  M., Haustein,  S., Larivi\u00e8re,  V., & Sugimoto,  C. R. (2013). Do altmetrics work? Twitter and ten other social web services. PLOS ONE, 8(5), e64841. https:\/\/doi.org\/10.1371\/journal.pone.0064841","DOI":"10.1371\/journal.pone.0064841"},{"key":"2025073014025303900_bib40","doi-asserted-by":"crossref","unstructured":"Vainio,  J., & Holmberg,  K. (2017). Highly tweeted science articles: Who tweets them? An analysis of Twitter user profile descriptions. Scientometrics, 112(1), 345\u2013366.","DOI":"10.1007\/s11192-017-2368-0"},{"key":"2025073014025303900_bib41","doi-asserted-by":"crossref","unstructured":"Van Noorden,  R.\n           (2014). Online collaboration: Scientists and the social network. Nature, 512, 126\u2013129.","DOI":"10.1038\/512126a"},{"key":"2025073014025303900_bib42","doi-asserted-by":"crossref","unstructured":"Veletsianos,  G.\n           (2012). Higher education scholars\u2019 participation and practices on Twitter. Journal of Computer Assisted Learning, 28, 336\u2013349.","DOI":"10.1111\/j.1365-2729.2011.00449.x"},{"key":"2025073014025303900_bib43","doi-asserted-by":"crossref","unstructured":"Veletsianos,  G., & Kimmons,  R. (2016). Scholars in an increasingly open and digital world: How do education professors and students use Twitter?Internet and Higher Education, 30, 1\u201310.","DOI":"10.1016\/j.iheduc.2016.02.002"},{"key":"2025073014025303900_bib44","doi-asserted-by":"crossref","unstructured":"Waltman,  L., & Van Eck,  N. J. (2012). A new methodology for constructing a publication-level classification system of science. Journal of the American Society for Information Science and Technology, 63(12), 2378\u20132392.","DOI":"10.1002\/asi.22748"},{"key":"2025073014025303900_bib45","doi-asserted-by":"crossref","unstructured":"Wouters,  P., Zahedi,  Z., & Costas,  R. (2019). Social media metrics for new research evaluation. In: W.Gl\u00e4nzle, H.Moed, & M.Thelwall (Eds.) Springer Handbook of Science and Technology Indicators. Dordrecht: Springer.","DOI":"10.1007\/978-3-030-02511-3_26"},{"key":"2025073014025303900_bib46","unstructured":"Zahedi,  Z., & Costas,  R. (2017). How visible are the research of different countries on WoS and Twitter? An analysis of global vs. local reach of WoS publications on Twitter. In: 16th International Conference on Scientometrics & Informetrics (ISS), Wuhan, China. Retrieved from: https:\/\/figshare.com\/articles\/How_visible_are_the_research_of_different_countries_on_WoS_and_Twitter_an_analysis_of_global_vs_local_reach_of_WoS_publications_on_Twitter\/5481283\/files\/9479545.pdf"}],"container-title":["Quantitative Science Studies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/qss\/article-pdf\/1\/2\/771\/1885906\/qss_a_00047.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/qss\/article-pdf\/1\/2\/771\/1885906\/qss_a_00047.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T18:03:21Z","timestamp":1753898601000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/qss\/article\/1\/2\/771\/96149\/Large-scale-identification-and-characterization-of"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020]]},"references-count":46,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,6,1]]}},"URL":"https:\/\/doi.org\/10.1162\/qss_a_00047","relation":{},"ISSN":["2641-3337"],"issn-type":[{"value":"2641-3337","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020]]},"published":{"date-parts":[[2020]]}}}