{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,1]],"date-time":"2026-06-01T23:09:32Z","timestamp":1780355372658,"version":"3.54.1"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,3,17]],"date-time":"2025-03-17T00:00:00Z","timestamp":1742169600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,3,17]],"date-time":"2025-03-17T00:00:00Z","timestamp":1742169600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100012338","name":"Alan Turing Institute","doi-asserted-by":"publisher","award":["Effective discovery, tracking, and response to mis- and disinformation."],"award-info":[{"award-number":["Effective discovery, tracking, and response to mis- and disinformation."]}],"id":[{"id":"10.13039\/100012338","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EPJ Data Sci."],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Misinformation and disinformation are growing threats in the digital age, affecting people across languages and borders. However, no research has investigated the prevalence of multilingual misinformation and quantified the extent to which misinformation diffuses across languages. This paper investigates the prevalence and dynamics of multilingual misinformation through an analysis of 264,487 fact-checks spanning 95 languages. To study the evolution of claims over time and mutations across languages, we represent fact-checks with multilingual sentence embeddings and build a graph where semantically similar claims are linked. We provide quantitative evidence of repeated fact-checking efforts and establish that claims diffuse across languages. Specifically, we find that while the majority of misinformation claims are only fact-checked once, 10.26%, corresponding to more than 27,000 claims, are checked multiple times. Using fact-checks as a proxy for the spread of misinformation, we find 32.26% of repeated claims cross linguistic boundaries, suggesting that some misinformation permeates language barriers. However, spreading patterns exhibit strong assortativity, with misinformation more likely to spread within the same language or language family. Next we show that fact-checkers take more time to fact-check claims that have crossed language barriers and model the temporal and cross-lingual evolution of claims. We analyze connected components and shortest paths connecting different versions of a claim finding that claims gradually drift over time and undergo greater alteration when traversing languages. Misinformation changes over time, reducing the effectiveness of static claim matching algorithms. The findings advocate for expanded information sharing between fact-checkers globally while underscoring the importance of localized verification.<\/jats:p>","DOI":"10.1140\/epjds\/s13688-025-00520-6","type":"journal-article","created":{"date-parts":[[2025,3,17]],"date-time":"2025-03-17T11:43:13Z","timestamp":1742211793000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Lost in translation: using global fact-checks to measure multilingual misinformation prevalence, spread, and evolution"],"prefix":"10.1140","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8257-7214","authenticated-orcid":false,"given":"Dorian","family":"Quelle","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Calvin Yixiang","family":"Cheng","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alexandre","family":"Bovet","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6894-4951","authenticated-orcid":false,"given":"Scott A.","family":"Hale","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,3,17]]},"reference":[{"issue":"2","key":"520_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3412869","volume":"2","author":"L Konstantinovskiy","year":"2021","unstructured":"Konstantinovskiy L, Price O, Babakar M, Zubiaga A (2021) Toward automated factchecking: developing an annotation schema and benchmark for consistent automated claim detection. Digit Treats Res Pract 2(2):1\u201316","journal-title":"Digit Treats Res Pract"},{"key":"520_CR2","doi-asserted-by":"crossref","unstructured":"Nakov P, Corney D, Hasanain M, Alam F, Elsayed T, Barr\u00f3n-Cede\u00f1o A, Papotti P, Shaar S, Martino GDS (2021) Automated fact-checking for assisting human fact-checkers. arXiv:2103.07769","DOI":"10.24963\/ijcai.2021\/619"},{"key":"520_CR3","unstructured":"Schifferes S, Newman N, Thurman N, Corney D, G\u00f6ker A, Martin C (2017) Identifying and verifying news through social media: Developing a user-centred tool for professional journalists, pp\u00a0325\u2013336"},{"key":"520_CR4","volume-title":"Computation + journalism symposium","author":"B Adair","year":"2017","unstructured":"Adair B, Li C, Yang J, Yu C (2017) Progress toward \u201cthe holy grail\u201d: the continued quest to automate fact-checking. In: Computation + journalism symposium"},{"key":"520_CR5","doi-asserted-by":"publisher","unstructured":"Kazemi A, Garimella K, Gaffney D, Hale S (2021) Claim matching beyond English to scale global fact-checking, pp\u00a04504\u20134517. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.347","DOI":"10.18653\/v1\/2021.acl-long.347"},{"key":"520_CR6","doi-asserted-by":"crossref","unstructured":"Hale SA (2013) Multilinguals and Wikipedia editing. arXiv:1312.0976","DOI":"10.1145\/2615569.2615684"},{"key":"520_CR7","doi-asserted-by":"publisher","first-page":"618","DOI":"10.1145\/3289600.3291021","volume-title":"Proceedings of the twelfth ACM international conference on web search and data mining","author":"F Lemmerich","year":"2019","unstructured":"Lemmerich F, S\u00e1ez-Trumper D, West R, Zia L (2019) Why the world reads Wikipedia: beyond English speakers. In: Proceedings of the twelfth ACM international conference on web search and data mining, pp\u00a0618\u2013626"},{"key":"520_CR8","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1145\/2556288.2557203","volume-title":"Proceedings of the SIGCHI conference on human factors in computing systems","author":"SA Hale","year":"2014","unstructured":"Hale SA (2014) Global connectivity and multilinguals in the Twitter network. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp\u00a0833\u2013842"},{"issue":"1","key":"520_CR9","doi-asserted-by":"publisher","DOI":"10.1177\/20563051221150412","volume":"9","author":"S Altay","year":"2023","unstructured":"Altay S, Berriche M, Acerbi A (2023) Misinformation on misinformation: conceptual and methodological challenges. Soc Media Soc 9(1):20563051221150412","journal-title":"Soc Media Soc"},{"issue":"8","key":"520_CR10","first-page":"5","volume":"53","author":"JM Burkhardt","year":"2017","unstructured":"Burkhardt JM (2017) History of fake news. Libr Technol Rep 53(8):5\u20139","journal-title":"Libr Technol Rep"},{"issue":"2","key":"520_CR11","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1016\/0740-624X(95)90052","volume":"12","author":"P Hernon","year":"1995","unstructured":"Hernon P (1995) Disinformation and misinformation through the internet: findings of an exploratory study. Gov Inf Q 12(2):133\u2013139. https:\/\/doi.org\/10.1016\/0740-624X(95)90052","journal-title":"Gov Inf Q"},{"key":"520_CR12","doi-asserted-by":"crossref","unstructured":"Johansson P, Enock F, Hale S, Vidgen B, Bereskin C, Margetts H, Bright J (2022) How can we combat online misinformation? A systematic overview of current interventions and their efficacy. arXiv:2212.11864","DOI":"10.2139\/ssrn.4648332"},{"key":"520_CR13","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1111\/pops.12797","volume":"42","author":"M Pantazi","year":"2021","unstructured":"Pantazi M, Hale S, Klein O (2021) Social and cognitive aspects of the vulnerability to political misinformation. Polit Psychol 42:267\u2013304","journal-title":"Polit Psychol"},{"key":"520_CR14","doi-asserted-by":"publisher","unstructured":"Graves L, Cherubini F (2016) The rise of fact-checking sites in Europe. Digital News Project Report. https:\/\/doi.org\/10.60625\/risj-tdn4-p140","DOI":"10.60625\/risj-tdn4-p140"},{"key":"520_CR15","doi-asserted-by":"crossref","unstructured":"Siwakoti S, Yadav K, Bariletto N, Zanotti L, Erdogdu U, Shapiro JN (2021) How covid drove the evolution of fact-checking. Harvard Kennedy School Misinformation Review","DOI":"10.37016\/mr-2020-69"},{"key":"520_CR16","unstructured":"Meta\u2019s Third-Party Fact-Checking Program (2021) https:\/\/www.facebook.com\/formedia\/blog\/third-party-fact-checking-how-it-works. Accessed: 2023-10-12"},{"key":"520_CR17","doi-asserted-by":"crossref","unstructured":"Kazemi A, Garimella K, Shahi GK, Gaffney D, Hale SA (2022) Research note: Tiplines to uncover misinformation on encrypted platforms: a case study of the 2019 Indian general election on WhatsApp. Harvard Kennedy School Misinformation Review 3(1)","DOI":"10.37016\/mr-2020-91"},{"key":"520_CR18","unstructured":"Fact Check (ClaimReview) structured data (2015) https:\/\/developers.google.com\/search\/docs\/appearance\/structured-data\/factcheck. Accessed: 2023-10-12"},{"issue":"2","key":"520_CR19","doi-asserted-by":"publisher","first-page":"92","DOI":"10.20901\/pm.58.2.04","volume":"58","author":"M Slijep\u010devi\u0107","year":"2021","unstructured":"Slijep\u010devi\u0107 M, Holy M, Bor\u010di\u0107 N (2021) Media ecosystems and the fact-checking movement: a comparison of trends in the EU and ASEAN. Politi\u010dka misao: \u010dasopis za politologiju 58(2):92\u2013112","journal-title":"Politi\u010dka misao: \u010dasopis za politologiju"},{"issue":"9","key":"520_CR20","first-page":"1077","volume":"23","author":"T Lelo","year":"2022","unstructured":"Lelo T (2022) The rise of the Brazilian fact-checking movement: between economic sustainability and editorial independence. Journal Stud 23(9):1077\u20131095","journal-title":"Journal Stud"},{"key":"520_CR21","unstructured":"International Fact-Checking Network: verified signatories of the IFCN code of principles (2024) Archived at https:\/\/web.archive.org\/web\/20241008134958\/https:\/\/ifcncodeofprinciples.poynter.org\/signatories. https:\/\/ifcncodeofprinciples.poynter.org\/signatories. Accessed: 2024-10-08"},{"key":"520_CR22","unstructured":"Stencel M, Ryan E, Luther J (2023) Misinformation spreads, but fact-checking has leveled off. Accessed: 2023-08-07"},{"issue":"5","key":"520_CR23","doi-asserted-by":"publisher","first-page":"1009","DOI":"10.1068\/a44497","volume":"44","author":"M Graham","year":"2012","unstructured":"Graham M, Hale S, Stephens M (2012) Featured graphic: digital divide: the geography of Internet access. Environ Plan A 44(5):1009\u20131010","journal-title":"Environ Plan A"},{"issue":"3","key":"520_CR24","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1177\/019251219501600305","volume":"16","author":"GA Barnett","year":"1995","unstructured":"Barnett GA, Choi Y (1995) Physical distance and language as determinants of the international telecommunications network. Int Polit Sci Rev 16(3):249\u2013265. https:\/\/doi.org\/10.1177\/019251219501600305","journal-title":"Int Polit Sci Rev"},{"key":"520_CR25","doi-asserted-by":"publisher","unstructured":"Hale S (2012) Impact of platform design on cross-language information exchange, pp\u00a01363\u20131368. https:\/\/doi.org\/10.1145\/2212776.2212456","DOI":"10.1145\/2212776.2212456"},{"issue":"2","key":"520_CR26","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1111\/j.1083-6101.2011.01568.x","volume":"17","author":"SA Hale","year":"2012","unstructured":"Hale SA (2012) Net increase? Cross-lingual linking in the blogosphere. J Comput-Mediat Commun 17(2):135\u2013151. https:\/\/doi.org\/10.1111\/j.1083-6101.2011.01568.x","journal-title":"J Comput-Mediat Commun"},{"key":"520_CR27","doi-asserted-by":"publisher","unstructured":"Hale SA (2014) Multilinguals and Wikipedia editing, pp\u00a099\u2013108. https:\/\/doi.org\/10.1145\/2615569.2615684","DOI":"10.1145\/2615569.2615684"},{"issue":"1","key":"520_CR28","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1016\/j.socnet.2011.05.006","volume":"34","author":"Y Takhteyev","year":"2012","unstructured":"Takhteyev Y, Gruzd A, Wellman B (2012) Geography of Twitter networks. Soc Netw 34(1):73\u201381. https:\/\/doi.org\/10.1016\/j.socnet.2011.05.006","journal-title":"Soc Netw"},{"issue":"1","key":"520_CR29","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1080\/15295039109366779","volume":"8","author":"JD Straubhaar","year":"1991","unstructured":"Straubhaar JD (1991) Beyond media imperialism: assymetrical interdependence and cultural proximity. Crit Stud Mass Commun 8(1):39\u201359. https:\/\/doi.org\/10.1080\/15295039109366779","journal-title":"Crit Stud Mass Commun"},{"issue":"3","key":"520_CR30","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1080\/08838150802205876","volume":"52","author":"TB Ksiazek","year":"2008","unstructured":"Ksiazek TB, Webster JG (2008) Cultural proximity and audience behavior: the role of language in patterns of polarization and multicultural fluency. J Broadcast Electron Media 52(3):485\u2013503. https:\/\/doi.org\/10.1080\/08838150802205876","journal-title":"J Broadcast Electron Media"},{"issue":"2","key":"520_CR31","doi-asserted-by":"publisher","first-page":"202","DOI":"10.1177\/13678779030062004","volume":"6","author":"M Curtin","year":"2003","unstructured":"Curtin M (2003) Media capital: towards the study of spatial flows. Int J Cult Stud 6(2):202\u2013228. https:\/\/doi.org\/10.1177\/13678779030062004","journal-title":"Int J Cult Stud"},{"key":"520_CR32","doi-asserted-by":"publisher","unstructured":"Straubhaar J (2007) World television: from global to local. https:\/\/doi.org\/10.4135\/9781452204147. https:\/\/sk.sagepub.com\/books\/world-television. Accessed: 2023-08-07","DOI":"10.4135\/9781452204147"},{"key":"520_CR33","volume-title":"Rewire: digital cosmopolitans in the age of connection","author":"E Zuckerman","year":"2013","unstructured":"Zuckerman E (2013) Rewire: digital cosmopolitans in the age of connection. Norton, New York"},{"key":"520_CR34","doi-asserted-by":"crossref","unstructured":"Madraki G, Grasso I, Otala JM, Liu Y, Matthews J (2021) Characterizing and comparing COVID-19 misinformation across languages, countries and platforms, pp\u00a0213\u2013223","DOI":"10.1145\/3442442.3452304"},{"key":"520_CR35","unstructured":"Shahi GK, Nandini D (2020) Fakecovid\u2014a multilingual cross-domain fact check news dataset for COVID-19. arXiv:2006.11343"},{"key":"520_CR36","doi-asserted-by":"publisher","first-page":"4325","DOI":"10.1109\/BigData50022.2020.9378472","volume-title":"2020 IEEE international conference on big data (big data)","author":"Y Li","year":"2020","unstructured":"Li Y, Jiang B, Shu K, Liu H (2020) Toward a multilingual and multimodal data repository for covid-19 disinformation. In: 2020 IEEE international conference on big data (big data). IEEE, pp\u00a04325\u20134330"},{"key":"520_CR37","doi-asserted-by":"crossref","unstructured":"Nielsen DS, McConville R (2022) Mumin: a large-scale multilingual multimodal fact-checked misinformation social network dataset, pp\u00a03141\u20133153","DOI":"10.1145\/3477495.3531744"},{"key":"520_CR38","unstructured":"Fact Check Feed (2019) Data Commons. Accessed: 2023-10-11"},{"key":"520_CR39","doi-asserted-by":"publisher","unstructured":"Feng F, Yang Y, Cer D, Arivazhagan N, Wang W (2022) Language-agnostic BERT sentence embedding, pp\u00a0878\u2013891. https:\/\/doi.org\/10.18653\/v1\/2022.acl-long.62","DOI":"10.18653\/v1\/2022.acl-long.62"},{"key":"520_CR40","doi-asserted-by":"crossref","unstructured":"Reimers N, Gurevych I (2019) Sentence-BERT: sentence embeddings using Siamese BERT-networks. arXiv:1908.10084","DOI":"10.18653\/v1\/D19-1410"},{"key":"520_CR41","unstructured":"ANNOY library (2015) https:\/\/github.com\/spotify\/annoy. Accessed: 2023-08-01"},{"key":"520_CR42","doi-asserted-by":"publisher","unstructured":"McInnes L, Healy J (2017) Accelerated hierarchical density based clustering. https:\/\/doi.org\/10.1109\/2Ficdmw.2017.12","DOI":"10.1109\/2Ficdmw.2017.12"},{"key":"520_CR43","first-page":"226","volume-title":"KDD\u201996: proceedings of the second international conference on knowledge discovery and data mining","author":"M Ester","year":"1996","unstructured":"Ester M, Kriegel H-P, Sander J, Xu X, et al. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD\u201996: proceedings of the second international conference on knowledge discovery and data mining, pp\u00a0226\u2013231"},{"key":"520_CR44","unstructured":"MacQueen J (1967) An enriched k-means clustering method for grouping fractures with meliorated initial centers. Some methods for classification and analysis of multivariate observations, 14"},{"key":"520_CR45","unstructured":"McInnes L, Healy J, Melville J (2020) Umap: Uniform manifold approximation and projection for dimension reduction. arXiv:1802.03426"},{"issue":"4","key":"520_CR46","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1108\/OIR-09-2020-0417","volume":"45","author":"J Zeng","year":"2021","unstructured":"Zeng J, Chan C-H (2021) A cross-national diagnosis of infodemics: comparing the topical and temporal features of misinformation around COVID-19 in China, India, the US, Germany and France. Online Inf Rev 45(4):709\u2013728","journal-title":"Online Inf Rev"},{"key":"520_CR47","volume-title":"Natural language processing: Python and NLTK","author":"N Hardeniya","year":"2016","unstructured":"Hardeniya N, Perkins J, Chopra D, Joshi N, Mathur I (2016) Natural language processing: Python and NLTK. Packt Publishing Ltd, Birmingham"},{"key":"520_CR48","doi-asserted-by":"publisher","unstructured":"Hammarstr\u00f6m H, Forkel R, Haspelmath M, Bank S (2024) Glottolog 5.0. Max Planck Institute for Evolutionary Anthropology, Leipzig. Accessed: 2024-03-11. https:\/\/doi.org\/10.5281\/zenodo.8131084. http:\/\/glottolog.org","DOI":"10.5281\/zenodo.8131084"},{"key":"520_CR49","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1016\/j.chb.2018.02.008","volume":"83","author":"J Shin","year":"2018","unstructured":"Shin J, Jian L, Driscoll K, Bar F (2018) The diffusion of misinformation on social media: temporal pattern, message, and source. Comput Hum Behav 83:278\u2013287. https:\/\/doi.org\/10.1016\/j.chb.2018.02.008","journal-title":"Comput Hum Behav"}],"container-title":["EPJ Data Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-025-00520-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1140\/epjds\/s13688-025-00520-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-025-00520-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,17]],"date-time":"2025-03-17T11:43:23Z","timestamp":1742211803000},"score":1,"resource":{"primary":{"URL":"https:\/\/epjdatascience.springeropen.com\/articles\/10.1140\/epjds\/s13688-025-00520-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,17]]},"references-count":49,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["520"],"URL":"https:\/\/doi.org\/10.1140\/epjds\/s13688-025-00520-6","relation":{},"ISSN":["2193-1127"],"issn-type":[{"value":"2193-1127","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,17]]},"assertion":[{"value":"1 May 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 January 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 March 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"SAH consults for Meedan, a non-profit organization that creates software and multistakeholder projects for fact-checking and other activities.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"22"}}