{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T21:33:20Z","timestamp":1778708000384,"version":"3.51.4"},"reference-count":143,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,3,31]],"date-time":"2021-03-31T00:00:00Z","timestamp":1617148800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,3,31]],"date-time":"2021-03-31T00:00:00Z","timestamp":1617148800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000086","name":"Directorate for Mathematical and Physical Sciences","doi-asserted-by":"publisher","award":["447634"],"award-info":[{"award-number":["447634"]}],"id":[{"id":"10.13039\/100000086","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100006785","name":"Google","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006785","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Massachusetts Mutual Life Insurance Company"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EPJ Data Sci."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Working from a dataset of 118 billion messages running from the start of 2009 to the end of 2019, we identify and explore the relative daily use of over 150 languages on Twitter. We find that eight languages comprise 80% of all tweets, with English, Japanese, Spanish, Arabic, and Portuguese being the most dominant. To quantify social spreading in each language over time, we compute the \u2018contagion ratio\u2019: The balance of retweets to organic messages. We find that for the most common languages on Twitter there is a growing tendency, though not universal, to retweet rather than share new content. By the end of 2019, the contagion ratios for half of the top 30 languages, including English and Spanish, had reached above 1\u2014the naive contagion threshold. In 2019, the top 5 languages with the highest average daily ratios were, in order, Thai (7.3), Hindi, Tamil, Urdu, and Catalan, while the bottom 5 were Russian, Swedish, Esperanto, Cebuano, and Finnish (0.26). Further, we show that over time, the contagion ratios for most common languages are growing more strongly than those of rare languages.<\/jats:p>","DOI":"10.1140\/epjds\/s13688-021-00271-0","type":"journal-article","created":{"date-parts":[[2021,3,31]],"date-time":"2021-03-31T07:03:09Z","timestamp":1617174189000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":60,"title":["The growing amplification of social media: measuring temporal and social contagion dynamics for over 150 languages on Twitter for 2009\u20132020"],"prefix":"10.1140","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8971-4434","authenticated-orcid":false,"given":"Thayer","family":"Alshaabi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David Rushing","family":"Dewhurst","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joshua R.","family":"Minot","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael V.","family":"Arnold","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jane L.","family":"Adams","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christopher M.","family":"Danforth","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter Sheridan","family":"Dodds","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,3,31]]},"reference":[{"key":"271_CR1","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1109\/SocialCom.2010.33","volume-title":"2010 IEEE second international conference on social computing","author":"B Suh","year":"2010","unstructured":"Suh B, Hong L, Pirolli P, Chi EH (2010) Want to be retweeted? Large scale analytics on factors impacting retweet in Twitter network. In: 2010 IEEE second international conference on social computing. IEEE, pp\u00a0177\u2013184"},{"key":"271_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/HICSS.2010.412","volume-title":"2010 43rd Hawaii international conference on system sciences","author":"D Boyd","year":"2010","unstructured":"Boyd D, Golder S, Lotan G (2010) Tweet, tweet, retweet: conversational aspects of retweeting on Twitter. In: 2010 43rd Hawaii international conference on system sciences. IEEE, pp\u00a01\u201310. https:\/\/doi.org\/10.1109\/HICSS.2010.412"},{"key":"271_CR3","volume-title":"Proceedings of the international AAAI conference on web and social media","author":"M Nagarajan","year":"2010","unstructured":"Nagarajan M, Purohit H, Sheth A (2010) A\u00a0qualitative examination of topical tweet and retweet practices. In: Proceedings of the international AAAI conference on web and social media, vol\u00a04"},{"key":"271_CR4","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1109\/SocialCom-PASSAT.2012.129","volume-title":"2012 international conference on privacy, security, risk and trust and 2012 international conference on social computing","author":"NO Hodas","year":"2012","unstructured":"Hodas NO, Lerman K (2012) How visibility and divided attention constrain social contagion. In: 2012 international conference on privacy, security, risk and trust and 2012 international conference on social computing. IEEE, pp\u00a0249\u2013257"},{"issue":"4","key":"271_CR5","doi-asserted-by":"publisher","first-page":"470","DOI":"10.1016\/j.socnet.2012.02.005","volume":"34","author":"N Harrigan","year":"2012","unstructured":"Harrigan N, Achananuparp P, Lim E-P (2012) Influentials, novelty, and social contagion: the viral power of average friends, close communities, and old news. Soc Netw 34(4):470\u2013480","journal-title":"Soc Netw"},{"key":"271_CR6","doi-asserted-by":"publisher","DOI":"10.1038\/srep04343","volume":"4","author":"NO Hodas","year":"2014","unstructured":"Hodas NO, Lerman K (2014) The simple rules of social contagion. Sci Rep 4:4343","journal-title":"Sci Rep"},{"key":"271_CR7","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1038\/204225a0","volume":"204","author":"W Goffman","year":"1964","unstructured":"Goffman W, Newill VA (1964) Generalization of epidemic theory: an application to the transmission of ideas. Nature 204:225\u2013228","journal-title":"Nature"},{"key":"271_CR8","first-page":"42","volume":"1","author":"DJ Daley","year":"1965","unstructured":"Daley DJ, Kendall DG (1965) Stochastic rumours. J\u00a0Inst Math Appl 1:42\u201355","journal-title":"J\u00a0Inst Math Appl"},{"key":"271_CR9","first-page":"143","volume":"1","author":"TC Schelling","year":"1971","unstructured":"Schelling TC (1971) Dynamic models of segregation. J\u00a0Math Sociol 1:143\u2013186","journal-title":"J\u00a0Math Sociol"},{"issue":"6","key":"271_CR10","doi-asserted-by":"publisher","first-page":"1420","DOI":"10.1086\/226707","volume":"83","author":"M Granovetter","year":"1978","unstructured":"Granovetter M (1978) Threshold models of collective behavior. Am J Sociol 83(6):1420\u20131443","journal-title":"Am J Sociol"},{"key":"271_CR11","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.92.218701","volume":"92","author":"PS Dodds","year":"2004","unstructured":"Dodds PS, Watts DJ (2004) Universal behavior in a generalized model of contagion. Phys Rev Lett 92:218701","journal-title":"Phys Rev Lett"},{"key":"271_CR12","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1016\/j.jtbi.2004.09.006","volume":"232","author":"PS Dodds","year":"2005","unstructured":"Dodds PS, Watts DJ (2005) A\u00a0generalized model of social and biological contagion. J\u00a0Theor Biol 232:587\u2013604. https:\/\/doi.org\/10.1016\/j.jtbi.2004.09.006","journal-title":"J\u00a0Theor Biol"},{"key":"271_CR13","doi-asserted-by":"publisher","first-page":"702","DOI":"10.1086\/521848","volume":"113","author":"D Centola","year":"2007","unstructured":"Centola D, Macy MW (2007) Complex contagions and the weakness of long ties. Am J Sociol 113:702\u2013734","journal-title":"Am J Sociol"},{"issue":"16","key":"271_CR14","doi-asserted-by":"publisher","first-page":"5962","DOI":"10.1073\/pnas.1116502109","volume":"109","author":"J Ugander","year":"2012","unstructured":"Ugander J, Backstrom L, Marlow C, Kleinberg J (2012) Structural diversity in social contagion. Proc Natl Acad Sci 109(16):5962\u20135966","journal-title":"Proc Natl Acad Sci"},{"issue":"5","key":"271_CR15","volume":"88","author":"E Cozzo","year":"2013","unstructured":"Cozzo E, Banos RA, Meloni S, Moreno Y (2013) Contact-based social contagion in multiplex networks. Phys Rev\u00a0E 88(5):050801","journal-title":"Phys Rev\u00a0E"},{"issue":"2","key":"271_CR16","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0118093","volume":"10","author":"A Bessi","year":"2015","unstructured":"Bessi A, Coletto M, Davidescu GA, Scala A, Caldarelli G, Quattrociocchi W (2015) Science vs conspiracy: collective narratives in the age of misinformation. PLoS ONE 10(2):0118093","journal-title":"PLoS ONE"},{"key":"271_CR17","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1287\/mnsc.15.5.215","volume":"15","author":"F Bass","year":"1969","unstructured":"Bass F (1969) A\u00a0new product growth model for consumer durables. Manag Sci 15:215\u2013227","journal-title":"Manag Sci"},{"issue":"3","key":"271_CR18","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1287\/mksc.1060.0224","volume":"26","author":"C Van den Bulte","year":"2007","unstructured":"Van den Bulte C, Joshi YV (2007) New product diffusion with influentials and imitators. Mark Sci 26(3):400\u2013421","journal-title":"Mark Sci"},{"issue":"5","key":"271_CR19","first-page":"90","volume":"73","author":"M Trusov","year":"2009","unstructured":"Trusov M, Bucklin RE, Pauwels K (2009) Effects of word-of-mouth versus traditional marketing: findings from an internet social networking site. J\u00a0Mark 73(5):90\u2013102","journal-title":"J\u00a0Mark"},{"issue":"2","key":"271_CR20","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1287\/mksc.1100.0566","volume":"30","author":"R Iyengar","year":"2011","unstructured":"Iyengar R, Van den Bulte C, Valente TW (2011) Opinion leadership and social contagion in new product diffusion. Mark Sci 30(2):195\u2013212","journal-title":"Mark Sci"},{"issue":"5","key":"271_CR21","doi-asserted-by":"publisher","first-page":"1110","DOI":"10.1257\/aer.90.5.1110","volume":"90","author":"M Kelly","year":"2000","unstructured":"Kelly M, O\u00a0Grada C (2000) Market contagion: evidence from the panics of 1854 and 1857. Am Econ Rev 90(5):1110\u20131124","journal-title":"Am Econ Rev"},{"issue":"1","key":"271_CR22","first-page":"1","volume":"8","author":"M Cipriani","year":"2008","unstructured":"Cipriani M, Guarino A (2008) Herd behavior and contagion in financial markets. B\u00a0E J Theor Econ 8(1):1\u201356","journal-title":"B\u00a0E J Theor Econ"},{"key":"271_CR23","series-title":"Handbooks in finance","first-page":"1","volume-title":"Handbook of financial markets: dynamics and evolution","author":"D Hirshleifer","year":"2009","unstructured":"Hirshleifer D, Teoh SH (2009) Thought and behavior contagion in capital markets. In: Hens T, Schenk-Hopp\u00e9 KR (eds) Handbook of financial markets: dynamics and evolution. Handbooks in finance. North-Holland, San Diego, pp\u00a01\u201356. http:\/\/www.sciencedirect.com\/science\/article\/pii\/B9780123742582500051"},{"issue":"1","key":"271_CR24","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1080\/15427560.2012.655383","volume":"13","author":"T Fenzl","year":"2012","unstructured":"Fenzl T, Pelzmann L (2012) Psychological and social forces behind aggregate financial market behavior. J\u00a0Behav Finance 13(1):56\u201365","journal-title":"J\u00a0Behav Finance"},{"issue":"1","key":"271_CR25","first-page":"133","volume":"8","author":"JD Hamilton","year":"1981","unstructured":"Hamilton JD, Hamilton LC (1981) Models of social contagion. J\u00a0Math Sociol 8(1):133\u2013160","journal-title":"J\u00a0Math Sociol"},{"issue":"11","key":"271_CR26","doi-asserted-by":"publisher","first-page":"1419","DOI":"10.1177\/001872679604901103","volume":"49","author":"G Bovasso","year":"1996","unstructured":"Bovasso G (1996) A\u00a0network analysis of social contagion processes in an organizational intervention. Hum Relat 49(11):1419\u20131435","journal-title":"Hum Relat"},{"key":"271_CR27","series-title":"Cambridge handbooks in psychology","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1017\/CBO9780511816840.037","volume-title":"Social contagion of violence","author":"J Fagan","year":"2007","unstructured":"Fagan J, Wilkinson DL, Davies G (2007) In: Flannery DJ, Vazsonyi AT, Waldman IDE (eds) Social contagion of violence. Cambridge handbooks in psychology. Cambridge University Press, Cambridge, pp\u00a0688\u2013724. https:\/\/doi.org\/10.1017\/CBO9780511816840.037"},{"issue":"4","key":"271_CR28","doi-asserted-by":"publisher","first-page":"556","DOI":"10.1002\/sim.5408","volume":"32","author":"NA Christakis","year":"2013","unstructured":"Christakis NA, Fowler JH (2013) Social contagion theory: examining dynamic social networks and human behavior. Stat Med 32(4):556\u2013577","journal-title":"Stat Med"},{"key":"271_CR29","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1016\/j.socscimed.2014.01.056","volume":"125","author":"AV Papachristos","year":"2015","unstructured":"Papachristos AV, Wildeman C, Roberto E (2015) Tragic, but not random: the social contagion of nonfatal gunshot injuries. Soc Sci Med 125:139\u2013150","journal-title":"Soc Sci Med"},{"issue":"8","key":"271_CR30","doi-asserted-by":"publisher","DOI":"10.1093\/jnci\/djw330","volume":"109","author":"CE Pollack","year":"2017","unstructured":"Pollack CE, Soulos PR, Herrin J, Xu X, Christakis NA, Forman HP, Yu JB, Killelea BK, Wang S-Y, Gross CP (2017) The impact of social contagion on physician adoption of advanced imaging tests in breast cancer. J\u00a0Natl Cancer Inst 109(8):330","journal-title":"J\u00a0Natl Cancer Inst"},{"issue":"7415","key":"271_CR31","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1038\/nature11421","volume":"489","author":"RM Bond","year":"2012","unstructured":"Bond RM, Fariss CJ, Jones JJ, Kramer AD, Marlow C, Settle JE, Fowler JH (2012) A\u00a061-million-person experiment in social influence and political mobilization. Nature 489(7415):295\u2013298","journal-title":"Nature"},{"issue":"24","key":"271_CR32","doi-asserted-by":"publisher","first-page":"8788","DOI":"10.1073\/pnas.1320040111","volume":"111","author":"AD Kramer","year":"2014","unstructured":"Kramer AD, Guillory JE, Hancock JT (2014) Experimental evidence of massive-scale emotional contagion through social networks. Proc Natl Acad Sci 111(24):8788\u20138790","journal-title":"Proc Natl Acad Sci"},{"issue":"4","key":"271_CR33","doi-asserted-by":"publisher","first-page":"855","DOI":"10.1111\/jcc4.12078","volume":"19","author":"NB Ellison","year":"2014","unstructured":"Ellison NB, Vitak J, Gray R, Lampe C (2014) Cultivating social resources on social network sites: Facebook relationship maintenance behaviors and their role in social capital processes. J\u00a0Comput-Mediat Commun 19(4):855\u2013870","journal-title":"J\u00a0Comput-Mediat Commun"},{"issue":"7","key":"271_CR34","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1145\/2818717","volume":"59","author":"E Ferrara","year":"2016","unstructured":"Ferrara E, Varol O, Davis C, Menczer F, Flammini A (2016) The rise of social bots. Commun ACM 59(7):96\u2013104","journal-title":"Commun ACM"},{"key":"271_CR35","volume-title":"Fourth international AAAI conference on weblogs and social media","author":"K Lerman","year":"2010","unstructured":"Lerman K, Ghosh R (2010) Information contagion: an empirical study of the spread of news on Digg and Twitter social networks. In: Fourth international AAAI conference on weblogs and social media"},{"issue":"2","key":"271_CR36","volume":"85","author":"J Borge-Holthoefer","year":"2012","unstructured":"Borge-Holthoefer J, Moreno Y (2012) Absence of influential spreaders in rumor dynamics. Phys Rev\u00a0E 85(2):026116","journal-title":"Phys Rev\u00a0E"},{"key":"271_CR37","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1109\/ICDM.2013.61","volume-title":"2013 IEEE 13th international conference on data mining","author":"S Kwon","year":"2013","unstructured":"Kwon S, Cha M, Jung K, Chen W, Wang Y (2013) Prominent features of rumor propagation in online social media. In: 2013 IEEE 13th international conference on data mining. IEEE, pp\u00a01103\u20131108"},{"key":"271_CR38","doi-asserted-by":"publisher","first-page":"2406","DOI":"10.1109\/HICSS.2015.288","volume-title":"2015 48th Hawaii international conference on system sciences","author":"P Ozturk","year":"2015","unstructured":"Ozturk P, Li H, Sakamoto Y (2015) Combating rumor spread on social media: the effectiveness of refutation and warning. In: 2015 48th Hawaii international conference on system sciences. IEEE, pp\u00a02406\u20132414"},{"key":"271_CR39","doi-asserted-by":"publisher","first-page":"3985","DOI":"10.1109\/WSC.2015.7408553","volume-title":"2015 winter simulation conference (WSC)","author":"C Kaligotla","year":"2015","unstructured":"Kaligotla C, Y\u00fccesan E, Chick SE (2015) An agent based model of spread of competing rumors through online interactions on social media. In: 2015 winter simulation conference (WSC). IEEE, pp\u00a03985\u20133996"},{"issue":"3","key":"271_CR40","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0150989","volume":"11","author":"A Zubiaga","year":"2016","unstructured":"Zubiaga A, Liakata M, Procter R, Wong Sak Hoi G, Tolmie P (2016) Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS ONE 11(3):0150989","journal-title":"PLoS ONE"},{"issue":"3","key":"271_CR41","doi-asserted-by":"publisher","first-page":"554","DOI":"10.1073\/pnas.1517441113","volume":"113","author":"M Del Vicario","year":"2016","unstructured":"Del Vicario M, Bessi A, Zollo F, Petroni F, Scala A, Caldarelli G, Stanley HE, Quattrociocchi W (2016) The spreading of misinformation online. Proc Natl Acad Sci 113(3):554\u2013559","journal-title":"Proc Natl Acad Sci"},{"issue":"3","key":"271_CR42","first-page":"150","volume":"34","author":"D Spohr","year":"2017","unstructured":"Spohr D (2017) Fake news and ideological polarization: filter bubbles and selective exposure on social media. Bus Inf Rev 34(3):150\u2013160","journal-title":"Bus Inf Rev"},{"issue":"1","key":"271_CR43","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-018-06930-7","volume":"9","author":"C Shao","year":"2018","unstructured":"Shao C, Ciampaglia GL, Varol O, Yang K-C, Flammini A, Menczer F (2018) The spread of low-credibility content by social bots. Nat Commun 9(1):1\u20139","journal-title":"Nat Commun"},{"issue":"9","key":"271_CR44","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0203958","volume":"13","author":"P T\u00f6rnberg","year":"2018","unstructured":"T\u00f6rnberg P (2018) Echo chambers and viral misinformation: modeling fake news as complex contagion. PLoS ONE 13(9):0203958","journal-title":"PLoS ONE"},{"key":"271_CR45","series-title":"NIPS","first-page":"17599","volume-title":"Workshop on computational social science and the wisdom of crowds","author":"TR Zaman","year":"2010","unstructured":"Zaman TR, Herbrich R, Van Gael J, Stern D (2010) Predicting information spreading in Twitter. In: Workshop on computational social science and the wisdom of crowds. NIPS, vol\u00a0104. Citeseer, pp\u00a017599\u201317601"},{"key":"271_CR46","doi-asserted-by":"publisher","first-page":"695","DOI":"10.1145\/1963405.1963503","volume-title":"Proceedings of the 20th international conference on world wide web","author":"DM Romero","year":"2011","unstructured":"Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on Twitter. In: Proceedings of the 20th international conference on world wide web, pp\u00a0695\u2013704"},{"key":"271_CR47","volume":"2","author":"L Weng","year":"2012","unstructured":"Weng L, Flammini A, Vespignani A, Menczer F (2012) Competition among memes in a world with limited attention. Nat Sci Rep 2:335","journal-title":"Nat Sci Rep"},{"issue":"2","key":"271_CR48","first-page":"317","volume":"64","author":"E Colleoni","year":"2014","unstructured":"Colleoni E, Rozza A, Arvidsson A (2014) Echo chamber or public sphere? Predicting political orientation and measuring political homophily in Twitter using big data. J\u00a0Commun 64(2):317\u2013332","journal-title":"J\u00a0Commun"},{"issue":"10","key":"271_CR49","doi-asserted-by":"publisher","first-page":"1531","DOI":"10.1177\/0956797615594620","volume":"26","author":"P Barber\u00e1","year":"2015","unstructured":"Barber\u00e1 P, Jost JT, Nagler J, Tucker JA, Bonneau R (2015) Tweeting from left to right: is online political communication more than an echo chamber? Psychol Sci 26(10):1531\u20131542. https:\/\/doi.org\/10.1177\/0956797615594620. PMID: 26297377","journal-title":"Psychol Sci"},{"issue":"1","key":"271_CR50","doi-asserted-by":"publisher","first-page":"76","DOI":"10.1093\/pan\/mpu011","volume":"23","author":"P Barber\u00e1","year":"2015","unstructured":"Barber\u00e1 P (2015) Birds of the same feather tweet together: Bayesian ideal point estimation using Twitter data. Polit Anal 23(1):76\u201391. https:\/\/doi.org\/10.1093\/pan\/mpu011","journal-title":"Polit Anal"},{"key":"271_CR51","doi-asserted-by":"publisher","first-page":"3500","DOI":"10.1109\/HICSS.2012.476","volume-title":"2012 45th Hawaii international conference on system sciences","author":"S Stieglitz","year":"2012","unstructured":"Stieglitz S, Dang-Xuan L (2012) Political communication and influence through microblogging\u2014an empirical analysis of sentiment in Twitter messages and retweet behavior. In: 2012 45th Hawaii international conference on system sciences. IEEE, pp\u00a03500\u20133509"},{"key":"271_CR52","doi-asserted-by":"publisher","first-page":"591","DOI":"10.1145\/1772690.1772751","volume-title":"Proceedings of the 19th international conference on world wide web","author":"H Kwak","year":"2010","unstructured":"Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on world wide web, pp\u00a0591\u2013600"},{"key":"271_CR53","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijinfomgt.2020.102187","volume":"55","author":"HR Rao","year":"2020","unstructured":"Rao HR, Vemprala N, Akello P, Valecha R (2020) Retweets of officials\u2019 alarming vs reassuring messages during the COVID-19 pandemic: implications for crisis management. Int J Inf Manag 55:102187","journal-title":"Int J Inf Manag"},{"issue":"9","key":"271_CR54","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0184148","volume":"12","author":"B M\u00f8nsted","year":"2017","unstructured":"M\u00f8nsted B, Sapie\u017cy\u0144ski P, Ferrara E, Lehmann S (2017) Evidence of complex contagion of information in social media: an experiment using Twitter bots. PLoS ONE 12(9):0184148","journal-title":"PLoS ONE"},{"key":"271_CR55","volume-title":"Proceedings of the international AAAI conference on web and social media","author":"M Cha","year":"2010","unstructured":"Cha M, Haddadi H, Benevenuto F, Gummadi K (2010) Measuring user influence in Twitter: the million follower fallacy. In: Proceedings of the international AAAI conference on web and social media, vol\u00a04"},{"issue":"1","key":"271_CR56","doi-asserted-by":"publisher","first-page":"3","DOI":"10.3758\/s13423-017-1236-5","volume":"24","author":"WT Fitch","year":"2017","unstructured":"Fitch WT (2017) Empirical approaches to the study of language evolution. Psychon Bull Rev 24(1):3\u201333","journal-title":"Psychon Bull Rev"},{"issue":"11","key":"271_CR57","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1038\/nrn2931","volume":"11","author":"JJ Bolhuis","year":"2010","unstructured":"Bolhuis JJ, Okanoya K, Scharff C (2010) Twitter evolution: converging mechanisms in birdsong and human speech. Nat Rev Neurosci 11(11):747\u2013759","journal-title":"Nat Rev Neurosci"},{"key":"271_CR58","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1145\/2631775.2631824","volume-title":"Proceedings of the 25th ACM conference on hypertext and social media","author":"S Kim","year":"2014","unstructured":"Kim S, Weber I, Wei L, Oh A (2014) Sociolinguistic analysis of Twitter in multilingual societies. In: Proceedings of the 25th ACM conference on hypertext and social media, pp\u00a0243\u2013248"},{"issue":"2","key":"271_CR59","doi-asserted-by":"publisher","first-page":"171","DOI":"10.3390\/info4020171","volume":"4","author":"J F\u00e1brega","year":"2013","unstructured":"F\u00e1brega J, Paredes P (2013) Social contagion and cascade behaviors on Twitter. Information 4(2):171\u2013181","journal-title":"Information"},{"key":"271_CR60","first-page":"427","volume-title":"Proceedings of the 15th conference of the European chapter of the association for computational linguistics: volume\u00a02, short papers","author":"A Joulin","year":"2017","unstructured":"Joulin A, Grave E, Bojanowski P, Mikolov T (2017) Bag of tricks for efficient text classification. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics: volume\u00a02, short papers. Association for Computational Linguistics, Valencia, pp\u00a0427\u2013431. https:\/\/www.aclweb.org\/anthology\/E17-2068"},{"key":"271_CR61","unstructured":"Twitter (2019) Developer application program interface (API). https:\/\/developer.twitter.com\/en\/docs\/ads\/campaign-management\/api-reference"},{"key":"271_CR62","volume-title":"Proceedings of the international AAAI conference on web and social media","author":"L Hong","year":"2011","unstructured":"Hong L, Convertino G, Chi E (2011) Language matters in Twitter: a large scale study. In: Proceedings of the international AAAI conference on web and social media, vol\u00a05"},{"issue":"3","key":"271_CR63","doi-asserted-by":"publisher","first-page":"462","DOI":"10.1002\/asi.23186","volume":"66","author":"A Zubiaga","year":"2015","unstructured":"Zubiaga A, Spina D, Mart\u00ednez R, Fresno V (2015) Real-time classification of Twitter trends. J\u00a0Assoc Inf Sci Technol 66(3):462\u2013473","journal-title":"J\u00a0Assoc Inf Sci Technol"},{"issue":"1","key":"271_CR64","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-020-0220-x","volume":"9","author":"DR Dewhurst","year":"2020","unstructured":"Dewhurst DR, Alshaabi T, Kiley D, Arnold MV, Minot JR, Danforth CM, Dodds PS (2020) The shocklet transform: a decomposition method for the identification of local, mechanism-driven dynamics in sociotechnical time series. EPJ Data Sci 9(1):3","journal-title":"EPJ Data Sci"},{"issue":"3","key":"271_CR65","volume":"4","author":"J Mellon","year":"2017","unstructured":"Mellon J, Prosser C (2017) Twitter and Facebook are not representative of the general population: political attitudes and demographics of British social media users. Res Polit 4(3):2053168017720008","journal-title":"Res Polit"},{"issue":"4","key":"271_CR66","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0175368","volume":"12","author":"Q Ke","year":"2017","unstructured":"Ke Q, Ahn Y-Y, Sugimoto CR (2017) A\u00a0systematic identification and analysis of scientists on Twitter. PLoS ONE 12(4):1\u201317. https:\/\/doi.org\/10.1371\/journal.pone.0175368","journal-title":"PLoS ONE"},{"key":"271_CR67","unstructured":"Mitchell A, Hitlin P (2019) Twitter reaction to events often at odds with overall public opinion. Pew Research Center: Internet, Science & Tech"},{"key":"271_CR68","unstructured":"Wojcik S, Hughes A (2019) How Twitter users compare to the general public. Pew Research Center: Internet, Science & Tech"},{"issue":"6296","key":"271_CR69","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1126\/science.aag2579","volume":"353","author":"L Palen","year":"2016","unstructured":"Palen L, Anderson KM (2016) Crisis informatics\u2014new data for extraordinary times. Science 353(6296):224\u2013225","journal-title":"Science"},{"key":"271_CR70","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1145\/1772690.1772777","volume-title":"Proceedings of the 19th international conference on world wide web","author":"T Sakaki","year":"2010","unstructured":"Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on world wide web. Association for Computing Machinery, New York, pp\u00a0851\u2013860. https:\/\/doi.org\/10.1145\/1772690.1772777"},{"key":"271_CR71","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1109\/CIP.2010.5604088","volume-title":"2010 2nd international workshop on cognitive information processing","author":"V Lampos","year":"2010","unstructured":"Lampos V, Cristianini N (2010) Tracking the flu pandemic by monitoring the social web. In: 2010 2nd international workshop on cognitive information processing, pp\u00a0411\u2013416. https:\/\/doi.org\/10.1109\/CIP.2010.5604088"},{"key":"271_CR72","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1145\/1964858.1964874","volume-title":"Proceedings of the first workshop on social media analytics. SOMA\u00a010","author":"A Culotta","year":"2010","unstructured":"Culotta A (2010) Towards detecting influenza epidemics by analyzing Twitter messages. In: Proceedings of the first workshop on social media analytics. SOMA\u00a010. Assoc. Comput. Mach., New York, pp\u00a0115\u2013122. https:\/\/doi.org\/10.1145\/1964858.1964874"},{"issue":"6055","key":"271_CR73","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1126\/science.1205869","volume":"334","author":"G Pickard","year":"2011","unstructured":"Pickard G, Pan W, Rahwan I, Cebrian M, Crane R, Madan A, Pentland A (2011) Time-critical social mobilization. Science 334(6055):509\u2013512","journal-title":"Science"},{"issue":"3","key":"271_CR74","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1109\/MIS.2011.52","volume":"26","author":"H Gao","year":"2011","unstructured":"Gao H, Barbier G, Goolsby R (2011) Harnessing the crowdsourcing power of social media for disaster relief. IEEE Intell Syst 26(3):10\u201314","journal-title":"IEEE Intell Syst"},{"issue":"1","key":"271_CR75","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-015-0056-y","volume":"4","author":"ZC Steinert-Threlkeld","year":"2015","unstructured":"Steinert-Threlkeld ZC, Mocanu D, Vespignani A, Fowler J (2015) Online social networks and offline protest. EPJ Data Sci 4(1):19","journal-title":"EPJ Data Sci"},{"key":"271_CR76","unstructured":"Dodds PS, Minot JR, Arnold MV, Alshaabi T, Adams JL, Dewhurst DR, Reagan AJ, Danforth CM (2019) Fame and ultrafame: measuring and comparing daily levels of \u2018being talked about\u2019 for United States\u2019 presidents, their rivals, God, countries, and K-pop. http:\/\/arxiv.org\/abs\/1910.00149"},{"key":"271_CR77","first-page":"1524","volume-title":"Proceedings of the 2011 conference on empirical methods in natural language processing","author":"A Ritter","year":"2011","unstructured":"Ritter A, Clark S, Mausam EO (2011) Named entity recognition in tweets: an experimental study. In: Proceedings of the 2011 conference on empirical methods in natural language processing. Association for Computational Linguistics, Edinburgh, pp\u00a01524\u20131534. https:\/\/www.aclweb.org\/anthology\/D11-1141"},{"key":"271_CR78","doi-asserted-by":"publisher","first-page":"1104","DOI":"10.1145\/2339530.2339704","volume-title":"Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. KDD\u201912","author":"A Ritter","year":"2012","unstructured":"Ritter A, Mausam EO, Clark S (2012) Open domain event extraction from Twitter. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. KDD\u201912. Assoc. Comput. Mach., New York, pp\u00a01104\u20131112. https:\/\/doi.org\/10.1145\/2339530.2339704"},{"issue":"6245","key":"271_CR79","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1126\/science.aaa8685","volume":"349","author":"J Hirschberg","year":"2015","unstructured":"Hirschberg J, Manning CD (2015) Advances in natural language processing. Science 349(6245):261\u2013266","journal-title":"Science"},{"key":"271_CR80","first-page":"25","volume-title":"Proceedings of the ACL 2012 system demonstrations","author":"M Lui","year":"2012","unstructured":"Lui M, Baldwin T (2012) langid.py: an off-the-shelf language identification tool. In: Proceedings of the ACL 2012 system demonstrations. Association for Computational Linguistics, pp\u00a025\u201330"},{"key":"271_CR81","first-page":"65","volume-title":"Proceedings of the second workshop on language in social media. LSM\u201912","author":"S Bergsma","year":"2012","unstructured":"Bergsma S, McNamee P, Bagdouri M, Fink C, Wilson T (2012) Language identification for creating language-specific Twitter collections. In: Proceedings of the second workshop on language in social media. LSM\u201912. Association for Computational Linguistics, pp\u00a065\u201374"},{"key":"271_CR82","doi-asserted-by":"publisher","first-page":"17","DOI":"10.3115\/v1\/W14-1303","volume-title":"Proceedings of the 5th workshop on language analysis for social media (LASM)","author":"M Lui","year":"2014","unstructured":"Lui M, Baldwin T (2014) Accurate language identification of Twitter messages. In: Proceedings of the 5th workshop on language analysis for social media (LASM). Association for Computational Linguistics, Gothenburg, pp\u00a017\u201325. https:\/\/doi.org\/10.3115\/v1\/W14-1303. https:\/\/www.aclweb.org\/anthology\/W14-1303"},{"key":"271_CR83","doi-asserted-by":"publisher","first-page":"73","DOI":"10.18653\/v1\/W17-1209","volume-title":"Proceedings of the fourth workshop on NLP for similar languages, varieties and dialects (VarDial)","author":"J Williams","year":"2017","unstructured":"Williams J, Dagli C (2017) Twitter language identification of similar languages and dialects without ground truth. In: Proceedings of the fourth workshop on NLP for similar languages, varieties and dialects (VarDial). Association for Computational Linguistics, Valencia, pp\u00a073\u201383. https:\/\/doi.org\/10.18653\/v1\/W17-1209. https:\/\/www.aclweb.org\/anthology\/W17-1209"},{"issue":"12","key":"271_CR84","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0026752","volume":"6","author":"PS Dodds","year":"2011","unstructured":"Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM (2011) Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PLoS ONE 6(12):e26752. https:\/\/doi.org\/10.1371\/journal.pone.0026752","journal-title":"PLoS ONE"},{"issue":"6","key":"271_CR85","doi-asserted-by":"publisher","first-page":"811","DOI":"10.1109\/TDSC.2012.75","volume":"9","author":"Z Chu","year":"2012","unstructured":"Chu Z, Gianvecchio S, Wang H, Jajodia S (2012) Detecting automation of Twitter accounts: are you a human, bot, or cyborg? IEEE Trans Dependable Secure Comput 9(6):811\u2013824","journal-title":"IEEE Trans Dependable Secure Comput"},{"issue":"11","key":"271_CR86","doi-asserted-by":"publisher","first-page":"5","DOI":"10.5120\/ijca2016908625","volume":"139","author":"V Kharde","year":"2016","unstructured":"Kharde V, Sonawane S (2016) Sentiment analysis of Twitter data: a survey of techniques. Int J Comput Appl 139(11):5\u201315. https:\/\/doi.org\/10.5120\/ijca2016908625","journal-title":"Int J Comput Appl"},{"issue":"3","key":"271_CR87","doi-asserted-by":"publisher","DOI":"10.1126\/sciadv.1500779","volume":"2","author":"Y Kryvasheyeu","year":"2016","unstructured":"Kryvasheyeu Y, Chen H, Obradovich N, Moro E, Van Hentenryck P, Fowler J, Cebrian M (2016) Rapid assessment of disaster damage using social media activity. Sci Adv 2(3):1500779","journal-title":"Sci Adv"},{"key":"271_CR88","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1007\/978-3-319-94105-9_4","volume-title":"Predictive analysis on Twitter: techniques and applications","author":"U Kursuncu","year":"2019","unstructured":"Kursuncu U, Gaur M, Lokala U, Thirunarayan K, Sheth A, Arpinar IB (2019) In: Agarwal N, Dokoohaki N, Tokdemir S (eds) Predictive analysis on Twitter: techniques and applications. Springer, Cham, pp\u00a067\u2013104. https:\/\/doi.org\/10.1007\/978-3-319-94105-9_4"},{"key":"271_CR89","doi-asserted-by":"publisher","first-page":"1532","DOI":"10.3115\/v1\/D14-1162","volume-title":"Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)","author":"J Pennington","year":"2014","unstructured":"Pennington J, Socher R, Manning C (2014) GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Doha, pp\u00a01532\u20131543. https:\/\/doi.org\/10.3115\/v1\/D14-1162. https:\/\/www.aclweb.org\/anthology\/D14-1162"},{"key":"271_CR90","doi-asserted-by":"publisher","first-page":"4171","DOI":"10.18653\/v1\/N19-1423","volume-title":"Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume\u00a01 (long and short papers)","author":"J Devlin","year":"2019","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume\u00a01 (long and short papers). Association for Computational Linguistics, Minneapolis, pp\u00a04171\u20134186. https:\/\/doi.org\/10.18653\/v1\/N19-1423. https:\/\/www.aclweb.org\/anthology\/N19-1423"},{"key":"271_CR91","volume-title":"Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018)","author":"T Mikolov","year":"2018","unstructured":"Mikolov T, Grave E, Bojanowski P, Puhrsch C, Joulin A (2018) Advances in pre-training distributed word representations. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki"},{"key":"271_CR92","volume-title":"Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018)","author":"E Grave","year":"2018","unstructured":"Grave E, Bojanowski P, Gupta P, Joulin A, Mikolov T (2018) Learning word vectors for 157 languages. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki. https:\/\/www.aclweb.org\/anthology\/L18-1550"},{"key":"271_CR93","first-page":"311","volume-title":"Proceedings of the 40th annual meeting of the association for computational linguistics","author":"K Papineni","year":"2002","unstructured":"Papineni K, Roukos S, Ward T, Zhu W-J (2002) Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting of the association for computational linguistics, pp\u00a0311\u2013318"},{"key":"271_CR94","series-title":"Conference track proceedings","volume-title":"3rd international conference on learning representations, ICLR 2015","author":"D Bahdanau","year":"2015","unstructured":"Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: Bengio Y, LeCun Y (eds) 3rd international conference on learning representations, ICLR 2015, San Diego, CA, USA, May 7\u20139, 2015. Conference track proceedings"},{"key":"271_CR95","doi-asserted-by":"publisher","first-page":"1412","DOI":"10.18653\/v1\/D15-1166","volume-title":"Proceedings of the 2015 conference on empirical methods in natural language processing","author":"T Luong","year":"2015","unstructured":"Luong T, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, pp\u00a01412\u20131421. https:\/\/doi.org\/10.18653\/v1\/D15-1166"},{"issue":"3","key":"271_CR96","first-page":"94","volume":"20","author":"P McNamee","year":"2005","unstructured":"McNamee P (2005) Language identification: a solved problem suitable for undergraduate instruction. J\u00a0Comput Sci Coll 20(3):94\u2013101","journal-title":"J\u00a0Comput Sci Coll"},{"key":"271_CR97","volume-title":"Proceedings of the fifth international conference on language resources and evaluation (LREC\u201906)","author":"B Hughes","year":"2006","unstructured":"Hughes B, Baldwin T, Bird S, Nicholson J, MacKinlay A (2006) Reconsidering language identification for written language resources. In: Proceedings of the fifth international conference on language resources and evaluation (LREC\u201906). European Language Resources Association (ELRA), Genoa. http:\/\/www.lrec-conf.org\/proceedings\/lrec2006\/pdf\/459_pdf.pdf"},{"key":"271_CR98","volume-title":"Proceedings of the sixth international conference on language resources and evaluation (LREC\u201908)","author":"L Grothe","year":"2008","unstructured":"Grothe L, De Luca EW, N\u00fcrnberger A (2008) A\u00a0comparative study on language identification methods. In: Proceedings of the sixth international conference on language resources and evaluation (LREC\u201908). European Language Resources Association (ELRA), Marrakech"},{"key":"271_CR99","first-page":"553","volume-title":"Proceedings of 5th international joint conference on natural language processing","author":"M Lui","year":"2011","unstructured":"Lui M, Baldwin T (2011) Cross-domain feature selection for language identification. In: Proceedings of 5th international joint conference on natural language processing. Asian Federation of Natural Language Processing, Chiang Mai, pp\u00a0553\u2013561. https:\/\/www.aclweb.org\/anthology\/I11-1062"},{"key":"271_CR100","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1162\/tacl_a_00163","volume":"2","author":"M Lui","year":"2014","unstructured":"Lui M, Lau JH, Baldwin T (2014) Automatic detection and language identification of multilingual documents. Trans Assoc Comput Linguist 2:27\u201340. https:\/\/doi.org\/10.1162\/tacl_a_00163","journal-title":"Trans Assoc Comput Linguist"},{"issue":"6014","key":"271_CR101","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1126\/science.1199644","volume":"331","author":"J-B Michel","year":"2011","unstructured":"Michel J-B, Shen YK, Aiden AP, Veres A, Gray MK, Pickett JP, Hoiberg D, Clancy D, Norvig P, Orwant J et al. (2011) Quantitative analysis of culture using millions of digitized books. Science 331(6014):176\u2013182","journal-title":"Science"},{"key":"271_CR102","unstructured":"Roomann-Kurrik A (2013) Introducing new metadata for tweets. Twitter"},{"key":"271_CR103","first-page":"27","volume-title":"Proceedings of Benelearn 2011","author":"E Tromp","year":"2011","unstructured":"Tromp E, Pechenizkiy M (2011) Graph-based N-gram language identification on short texts. In: Proceedings of Benelearn 2011, pp\u00a027\u201334"},{"key":"271_CR104","first-page":"287","volume-title":"Proceedings of COLING 2012: posters","author":"H Elfardy","year":"2012","unstructured":"Elfardy H, Diab M (2012) Token level identification of linguistic code switching. In: Proceedings of COLING 2012: posters. The COLING 2012 Organizing Committee, Mumbai, pp\u00a0287\u2013296"},{"issue":"1","key":"271_CR105","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1007\/s10579-012-9195-y","volume":"47","author":"S Carter","year":"2013","unstructured":"Carter S, Weerkamp W, Tsagkias M (2013) Microblog language identification: overcoming the limitations of short, unedited and idiomatic text. Lang Resour Eval 47(1):195\u2013215. https:\/\/doi.org\/10.1007\/s10579-012-9195-y","journal-title":"Lang Resour Eval"},{"key":"271_CR106","unstructured":"Steinmetz K (2013) What Twitter says to linguists. Time Inc. http:\/\/content.time.com\/time\/subscriber\/article\/0,33009,2150609,00.html"},{"key":"271_CR107","first-page":"95","volume-title":"Machine learning and knowledge discovery in databases","author":"M Goldszmidt","year":"2013","unstructured":"Goldszmidt M, Najork M, Paparizos S (2013) Boot-strapping language identifiers for short colloquial postings. In: Blockeel H, Kersting K, Nijssen S, \u017delezn\u00fd F (eds) Machine learning and knowledge discovery in databases. Springer, Berlin, pp\u00a095\u2013111"},{"key":"271_CR108","volume-title":"Proceedings of the international AAAI conference on web and social media","author":"D Nguyen","year":"2015","unstructured":"Nguyen D, Trieschnigg D, Cornips L (2015) Audience and the use of minority languages on Twitter. In: Proceedings of the international AAAI conference on web and social media, vol\u00a09"},{"key":"271_CR109","doi-asserted-by":"publisher","first-page":"2","DOI":"10.18653\/v1\/W15-2902","volume-title":"Proceedings of the 6th workshop on computational approaches to subjectivity, sentiment and social media analysis","author":"D Vilares","year":"2015","unstructured":"Vilares D, Alonso MA, G\u00f3mez-Rodr\u00edguez C (2015) Sentiment analysis on monolingual, multilingual and code-switching Twitter corpora. In: Proceedings of the 6th workshop on computational approaches to subjectivity, sentiment and social media analysis. Association for Computational Linguistics, Lisboa, pp\u00a02\u20138. https:\/\/doi.org\/10.18653\/v1\/W15-2902"},{"key":"271_CR110","doi-asserted-by":"publisher","first-page":"1971","DOI":"10.18653\/v1\/P17-1180","volume-title":"Proceedings of the 55th annual meeting of the association for computational linguistics (volume\u00a01: long papers)","author":"S Rijhwani","year":"2017","unstructured":"Rijhwani S, Sequiera R, Choudhury M, Bali K, Maddila C (2017) Estimating code-switching on Twitter with a novel generalized word-level language detection technique. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume\u00a01: long papers), pp\u00a01971\u20131982. https:\/\/doi.org\/10.18653\/v1\/P17-1180"},{"key":"271_CR111","unstructured":"Rosen A (2017) Tweeting made easier. https:\/\/blog.twitter.com\/en_us\/topics\/product\/2017\/tweetingmadeeasier.html"},{"issue":"1","key":"271_CR112","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1007\/s00146-014-0549-4","volume":"30","author":"B Batrinca","year":"2015","unstructured":"Batrinca B, Treleaven PC (2015) Social media analytics: a survey of techniques, tools and platforms. AI & Society 30(1):89\u2013116","journal-title":"AI & Society"},{"issue":"2","key":"271_CR113","doi-asserted-by":"publisher","DOI":"10.1145\/2938640","volume":"49","author":"A Giachanou","year":"2016","unstructured":"Giachanou A, Crestani F (2016) Like it or not: a survey of Twitter sentiment analysis methods. ACM Comput Surv 49(2):28. https:\/\/doi.org\/10.1145\/2938640","journal-title":"ACM Comput Surv"},{"issue":"3","key":"271_CR114","doi-asserted-by":"publisher","first-page":"965","DOI":"10.1007\/s10115-016-0997-x","volume":"51","author":"F Pla","year":"2017","unstructured":"Pla F, Hurtado L-F (2017) Language identification of multilingual posts from Twitter: a case study. Knowl Inf Syst 51(3):965\u2013989","journal-title":"Knowl Inf Syst"},{"issue":"4","key":"271_CR115","doi-asserted-by":"publisher","first-page":"729","DOI":"10.1007\/s10579-015-9317-4","volume":"50","author":"A Zubiaga","year":"2016","unstructured":"Zubiaga A, San Vicente I, Gamallo P, Pichel JR, Alegria I, Aranberri N, Ezeiza A, Fresno V (2016) Tweetlid: a benchmark for tweet language identification. Lang Resour Eval 50(4):729\u2013766","journal-title":"Lang Resour Eval"},{"key":"271_CR116","doi-asserted-by":"publisher","first-page":"56","DOI":"10.18653\/v1\/W17-4408","volume-title":"Proceedings of the 3rd workshop on noisy user-generated text","author":"SL Blodgett","year":"2017","unstructured":"Blodgett SL, Wei J, O\u2019Connor B (2017) A\u00a0dataset and classifier for recognizing social media English. In: Proceedings of the 3rd workshop on noisy user-generated text. Association for Computational Linguistics, Copenhagen, pp\u00a056\u201361. https:\/\/doi.org\/10.18653\/v1\/W17-4408"},{"key":"271_CR117","series-title":"Workshop track proceedings","volume-title":"1st international conference on learning representations, ICLR 2013","author":"T Mikolov","year":"2013","unstructured":"Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: Bengio Y, LeCun Y (eds) 1st international conference on learning representations, ICLR 2013, Scottsdale, Arizona, USA, May 2\u20134, 2013. Workshop track proceedings. http:\/\/arxiv.org\/abs\/1301.3781"},{"key":"271_CR118","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135\u2013146","journal-title":"Trans Assoc Comput Linguist"},{"key":"271_CR119","unstructured":"Facebook AI Research (2017) FastText language identification. https:\/\/fasttext.cc\/docs\/en\/language-identification.html"},{"key":"271_CR120","first-page":"1107","volume-title":"Proceedings of the 15th conference of the European chapter of the association for computational linguistics: volume\u00a01, long papers","author":"A Conneau","year":"2017","unstructured":"Conneau A, Schwenk H, Barrault L, Lecun Y (2017) Very deep convolutional networks for text classification. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics: volume\u00a01, long papers. Association for Computational Linguistics, Valencia, pp\u00a01107\u20131116"},{"key":"271_CR121","volume-title":"Advances in neural information processing systems","author":"X Zhang","year":"2015","unstructured":"Zhang X, Zhao J, LeCun Y (2015) Character-level convolutional networks for text classification. In: Cortes C, Lawrence N, Lee D, Sugiyama M, Garnett R (eds) Advances in neural information processing systems, vol\u00a028. Curran Associates, Red Hook"},{"key":"271_CR122","first-page":"1010","volume-title":"Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies","author":"S Bergsma","year":"2013","unstructured":"Bergsma S, Dredze M, Van Durme B, Wilson T, Yarowsky D (2013) Broadly improving user classification via communication-based name and location clustering on Twitter. In: Proceedings of the 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, pp\u00a01010\u20131019"},{"key":"271_CR123","unstructured":"Twitter (2019) Rules and filtering. https:\/\/developer.twitter.com\/en\/docs\/tweets\/rules-and-filtering\/overview\/premium-operators"},{"key":"271_CR124","doi-asserted-by":"crossref","unstructured":"Phillips A, Davis M (2009) Best current practice (BCP): tags for identifying languages. Technical report, Network Working Group IETF, California, USA","DOI":"10.17487\/rfc5646"},{"key":"271_CR125","doi-asserted-by":"crossref","unstructured":"Alshaabi T, Adams JL, Arnold MV, Minot JR, Dewhurst DR, Reagan AJ, Danforth CM, Dodds PS (2020) Storywrangler: a massive exploratorium for sociolinguistic, cultural, socioeconomic, and political timelines using Twitter. http:\/\/arxiv.org\/abs\/2003.03667","DOI":"10.1126\/sciadv.abe6534"},{"key":"271_CR126","unstructured":"Dodds PS et al (2020) Long-term word frequency dynamics derived from Twitter are corrupted: a bespoke approach to detecting and removing pathologies in ensembles of time series. https:\/\/arxiv.org\/abs\/2008.11305"},{"key":"271_CR127","doi-asserted-by":"publisher","DOI":"10.21832\/9781853599361","volume-title":"Cross-linguistic similarity in foreign language learning","author":"H Ringbom","year":"2006","unstructured":"Ringbom H (2006) Cross-linguistic similarity in foreign language learning. Multilingual Matters, Bristol. https:\/\/doi.org\/10.21832\/9781853599361"},{"key":"271_CR128","doi-asserted-by":"publisher","DOI":"10.1515\/9783110808506","volume-title":"Parametric syntax: case studies in semitic and romance languages","author":"H Borer","year":"1984","unstructured":"Borer H (1984) Parametric syntax: case studies in semitic and romance languages. de\u00a0Gruyter, Berlin. https:\/\/doi.org\/10.1515\/9783110808506"},{"issue":"1","key":"271_CR129","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-016-0070-8","volume":"5","author":"A Samoilenko","year":"2016","unstructured":"Samoilenko A, Karimi F, Edler D, Kunegis J, Strohmaier M (2016) Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity. EPJ Data Sci 5(1):9","journal-title":"EPJ Data Sci"},{"key":"271_CR130","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1007\/978-3-319-67217-5_28","volume-title":"International conference on social informatics","author":"H Jin","year":"2017","unstructured":"Jin H, Toyoda M, Yoshinaga N (2017) Can cross-lingual information cascades be predicted on Twitter? In: International conference on social informatics. Springer, Berlin, pp\u00a0457\u2013472"},{"key":"271_CR131","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.2029711","author":"M Hussain","year":"2012","unstructured":"Hussain M, Howard P (2012) Democracy\u2019s fourth wave? Information technologies and the fuzzy causes of the Arab Spring. SSRN Electron\u00a0J 57. https:\/\/doi.org\/10.2139\/ssrn.2029711","journal-title":"SSRN Electron\u00a0J"},{"issue":"2","key":"271_CR132","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1177\/1940161212471716","volume":"18","author":"G Wolfsfeld","year":"2013","unstructured":"Wolfsfeld G, Segev E, Sheafer T (2013) Social media and the Arab Spring: politics comes first. Int J Press Polit 18(2):115\u2013137","journal-title":"Int J Press Polit"},{"key":"271_CR133","first-page":"8","volume":"5","author":"T Dewey","year":"2012","unstructured":"Dewey T, Kaden J, Marks M, Matsushima S, Zhu B (2012) The impact of social media on social unrest in the Arab Spring. Int Policy Program 5:8","journal-title":"Int Policy Program"},{"issue":"5","key":"271_CR134","doi-asserted-by":"publisher","first-page":"647","DOI":"10.1177\/1464884911410017","volume":"12","author":"S Cottle","year":"2011","unstructured":"Cottle S (2011) Media and the Arab uprisings of 2011. Journalism 12(5):647\u2013659","journal-title":"Journalism"},{"key":"271_CR135","unstructured":"Stone B (2009) Retweet limited rollout. Twitter"},{"key":"271_CR136","unstructured":"Shu C (2015) Twitter officially launches its \u201cretweet with comment\u201d feature. TechCrunch"},{"key":"271_CR137","unstructured":"Stone B (2007) Are you Twittering @ me? Twitter. https:\/\/blog.twitter.com\/official\/en_us\/a\/2007\/are-you-twittering-me.html"},{"key":"271_CR138","unstructured":"Gadde V, Beykpour K (2020) Additional steps we\u2019re taking ahead of the 2020 US election. https:\/\/blog.twitter.com\/en_us\/topics\/company\/2020\/2020-election-changes.html"},{"key":"271_CR139","unstructured":"Roth Y, Achuthan A (2020) Building rules in public: our approach to synthetic & manipulated media. https:\/\/blog.twitter.com\/en_us\/topics\/company\/2020\/new-approach-to-synthetic-and-manipulated-media.html"},{"key":"271_CR140","unstructured":"Roth Y, Pickles N (2020) Updating our approach to misleading information. https:\/\/blog.twitter.com\/en_us\/topics\/product\/2020\/updating-our-approach-to-misleading-information.html"},{"key":"271_CR141","unstructured":"Gadde V, Beykpour K (2020) Expanding our policies to further protect the civic conversation. https:\/\/blog.twitter.com\/en_us\/topics\/company\/2020\/2020-election-changes.html"},{"key":"271_CR142","unstructured":"Twitter (2019) Tweet geospatial metadata. https:\/\/developer.twitter.com\/en\/docs\/tutorials\/tweet-geo-metadata"},{"key":"271_CR143","volume-title":"Human behaviour and the principle of least-effort","author":"GK Zipf","year":"1949","unstructured":"Zipf GK (1949) Human behaviour and the principle of least-effort. Addison-Wesley, Cambridge"}],"container-title":["EPJ Data Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-021-00271-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1140\/epjds\/s13688-021-00271-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-021-00271-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T03:44:00Z","timestamp":1675136640000},"score":1,"resource":{"primary":{"URL":"https:\/\/epjdatascience.springeropen.com\/articles\/10.1140\/epjds\/s13688-021-00271-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,31]]},"references-count":143,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["271"],"URL":"https:\/\/doi.org\/10.1140\/epjds\/s13688-021-00271-0","relation":{},"ISSN":["2193-1127"],"issn-type":[{"value":"2193-1127","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,31]]},"assertion":[{"value":"10 April 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 March 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 March 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"15"}}