{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:28:12Z","timestamp":1777454892340,"version":"3.51.4"},"reference-count":69,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2019,1,1]],"date-time":"2019-01-01T00:00:00Z","timestamp":1546300800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1028177"],"award-info":[{"award-number":["1028177"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Big Data &amp; Society"],"published-print":{"date-parts":[[2019,1]]},"abstract":"<jats:p>Within some online communities, discussion often centers on issues on which writers take sides, and within some subset of those debate-prone communities, we find over time that particular sets of writers almost always end up on the same side of an issue. These sets we call factions. In this paper, we describe a tool to perform what we call faction discovery on online communities. Generalizing methods developed in the bibliometrics and information retrieval literature, we define a network determined by similarities of content in a community of users and add in direct evidence of online ties between users (e.g., link information such as mention-links). We then perform community detection on the network to find factions. Using a set of data collected from science and fantasy blogs, we show that the discovered factions accurately reflect an active conflict in the community leading to significant, politically related social fracture.<\/jats:p>","DOI":"10.1177\/2053951719846634","type":"journal-article","created":{"date-parts":[[2019,5,14]],"date-time":"2019-05-14T05:59:27Z","timestamp":1557813567000},"update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":1,"title":["Linguistically guided community discovery"],"prefix":"10.1177","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6288-4014","authenticated-orcid":false,"given":"Jean M","family":"Gawron","sequence":"first","affiliation":[{"name":"San Diego State University, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alex","family":"Dodge","sequence":"additional","affiliation":[{"name":"NTENT, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ming-Hsiang","family":"Tsou","sequence":"additional","affiliation":[{"name":"San Diego State University, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Brian","family":"Spitzberg","sequence":"additional","affiliation":[{"name":"San Diego State University, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"An","sequence":"additional","affiliation":[{"name":"San Diego State University, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2019,5,14]]},"reference":[{"key":"bibr1-2053951719846634","doi-asserted-by":"crossref","unstructured":"Adamic LA and Glance N (2005) The political blogosphere and the 2004 US election: Divided they blog. In:\n                      Proceedings of the third international workshop on Link discovery\n                      , pp.36\u201343. New York, NY: ACM Digital Press.","DOI":"10.1145\/1134271.1134277"},{"key":"bibr2-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4419-8462-3_1"},{"key":"bibr3-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-009-9108-x"},{"key":"bibr4-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-009-9108-x"},{"key":"bibr5-2053951719846634","volume-title":"Imagined Communities: Reflections on the Origin and Spread of Nationalism","author":"Anderson B","year":"2003"},{"key":"bibr6-2053951719846634","unstructured":"Biedenharn I (2015) Hugo awards fall victim to misogynistic and racist voting.\n                      Entertainment Weekly\n                      , 6 April."},{"key":"bibr7-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1088\/1742-5468\/2008\/10\/P10008"},{"key":"bibr8-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1080\/1369118X.2012.678878"},{"key":"bibr9-2053951719846634","doi-asserted-by":"crossref","unstructured":"Bradshaw S (2003) Reference directed indexing: Redeeming relevance for subject search in citation indexes. In:\n                      International conference on theory and practice of digital libraries\n                      , pp.499\u2013510.","DOI":"10.1007\/978-3-540-45175-4_45"},{"key":"bibr10-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1177\/2053951716658060"},{"key":"bibr11-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21309"},{"key":"bibr12-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1177\/2053951716662897"},{"key":"bibr13-2053951719846634","first-page":"22","volume":"16","author":"Church KW","year":"1990","journal-title":"Computational Linguistics"},{"key":"bibr14-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1038\/nature06830"},{"key":"bibr15-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.70.066111"},{"key":"bibr16-2053951719846634","unstructured":"Colbert S (2014) The Colbert Report \u2013 Gamergate \u2013 Anita Sarkeesian. YouTube."},{"key":"bibr17-2053951719846634","unstructured":"Correia L (2015) Sad puppies update: The nominees announced and why I refused my nomination. monsterhunternation.com, 4 April."},{"key":"bibr18-2053951719846634","volume-title":"Elements of Information Theory","author":"Cover TM","year":"2012"},{"key":"bibr20-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2011.239"},{"key":"bibr21-2053951719846634","doi-asserted-by":"crossref","unstructured":"Cutting DR, Karger DR, Pedersen JO, et\u00a0al. (1992) Scatter\/gather: A cluster-based approach to browsing large document collections. In:\n                      Proceedings of the 15th annual international ACM SIGIR conference on research and development in information retrieval\n                      , pp.318\u2013329. New York, NY: ACM Digital Press.","DOI":"10.1145\/133160.133214"},{"key":"bibr22-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1126\/science.149.3683.510"},{"key":"bibr23-2053951719846634","unstructured":"Desto Y (2018) Rotten tomatoes is fighting back against white nationalist black panther trolls. Available at: https:\/\/www.vanityfair.com\/hollywood\/2018\/02\/rotten-tomatoes-black-panther-facebook-group (accessed June 7, 2016)."},{"key":"bibr24-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1111\/ajps.12226"},{"key":"bibr26-2053951719846634","doi-asserted-by":"crossref","unstructured":"Finkel JR, Grenager T and Manning C (2005) Incorporating non-local information into information extraction systems by Gibbs sampling. In:\n                      Proceedings of the 43nd annual meeting of the association for computational linguistics (ACL 2005)\n                      , pp.363\u2013370. Stroudsberg, PA: Association for Computational Linguistics.","DOI":"10.3115\/1219840.1219885"},{"key":"bibr27-2053951719846634","unstructured":"Flood A (2014) Hugo award nominees withdraw amid \u2018Puppygate\u2019 storm.\n                      The Guardian\n                      , 17 April."},{"key":"bibr28-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1016\/j.physrep.2009.11.002"},{"key":"bibr29-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1126\/science.122.3159.108"},{"key":"bibr130-2053951719846634","doi-asserted-by":"crossref","unstructured":"Gawron, J. M., Gupta, D., Stephens, K., Tsou, M. H., Spitzberg, B. & An, L. (2012). Using group membership markers for group identification. In\n                      Sixth International AAAI Conference on Weblogs and Social Media\n                      . Menlo Park, CA: AAAI Press, pp. 467\u2013470.","DOI":"10.1609\/icwsm.v6i1.14336"},{"key":"bibr30-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.122653799"},{"key":"bibr31-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2017.12.018"},{"key":"bibr32-2053951719846634","doi-asserted-by":"publisher","DOI":"10.4324\/9780203930274"},{"key":"bibr33-2053951719846634","doi-asserted-by":"crossref","unstructured":"Halko N, Tropp JA and Martinsson PG (2011) Finding structure with randomness: Stochastic algorithms for constructing approximate matrix decompositions.\n                      SIAM Review, Survey and Review Section\n                      53(2): 217\u2013288.","DOI":"10.1137\/090771806"},{"key":"bibr34-2053951719846634","doi-asserted-by":"crossref","unstructured":"Haveliwala TH, Gionis A, Klein D, et\u00a0al. (2002) Evaluating strategies for similarity search on the web. In:\n                      Proceedings of the 11th international conference on World Wide Web\n                      , pp.432\u2013442. New York, NY: ACM Digital Press.","DOI":"10.1145\/511446.511502"},{"key":"bibr35-2053951719846634","volume-title":"Culture Wars: The Struggle to Define America","author":"Hunter JD","year":"1992"},{"key":"bibr36-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1145\/324133.324140"},{"key":"bibr38-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511809071"},{"key":"bibr39-2053951719846634","unstructured":"Martin GRR (2015) Puppygate. grrm.livejournal.com."},{"key":"bibr42-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0601602103"},{"key":"bibr143-2053951719846634","doi-asserted-by":"crossref","unstructured":"Newman ME (2006b) Finding community structure in networks using the eigenvectors of matrices.\n                      Physical Review E\n                      74: 036104.","DOI":"10.1103\/PhysRevE.74.036104"},{"key":"bibr43-2053951719846634","unstructured":"Nobyline (2015) Hugo award nominations spark criticism over diversity in sci-fi.\n                      The Telegraph\n                      , 8 April."},{"key":"bibr44-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1177\/2053951718811844"},{"key":"bibr45-2053951719846634","unstructured":"Parkin S (2014a) Gamergae: A scandal erupts in the video-game community.\n                      The New Yorker\n                      , Oct 17, 2014, 1\u20134."},{"key":"bibr46-2053951719846634","unstructured":"Parkin S (2014b) Zoe Quinn's depression quest.\n                      The New Yorker\n                      , Sep. 9, 2014."},{"key":"bibr47-2053951719846634","first-page":"2825","volume":"12","author":"Pedregosa F","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"bibr48-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1086\/660298"},{"key":"bibr49-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1007\/11569596_31"},{"key":"bibr50-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1007\/s10844-013-0272-5"},{"key":"bibr51-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1177\/2053951714532240"},{"key":"bibr52-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.76.036106"},{"key":"bibr53-2053951719846634","unstructured":"Rapoport M (2015) The culture wars invade science fiction.\n                      The Wall Street Journal\n                      , 15 May."},{"key":"bibr54-2053951719846634","doi-asserted-by":"crossref","unstructured":"Ritchie A, Robertson S and Teufel S (2008) Comparing citation contexts for information retrieval. In:\n                      Proceedings of the 17th ACM conference on information and knowledge management\n                      , pp.213\u2013222. New York, NY: ACM Digital Press.","DOI":"10.1145\/1458082.1458113"},{"key":"bibr55-2053951719846634","doi-asserted-by":"crossref","unstructured":"Robertson S, Zaragoza H and Taylor M (2004) Simple BM25 extension to multiple weighted fields. In:\n                      Proceedings of the thirteenth ACM international conference on information and knowledge management\n                      , pp.42\u201349. New York, NY: ACM Digital Press.","DOI":"10.1145\/1031171.1031181"},{"key":"bibr56-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1140\/epjst\/e2010-01179-1"},{"key":"bibr57-2053951719846634","volume-title":"The SMART Retrieval System","author":"Salton G","year":"1971"},{"key":"bibr58-2053951719846634","volume-title":"Introduction to Modern Information Retrieval","author":"Salton G","year":"1983"},{"key":"bibr59-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0061823"},{"key":"bibr60-2053951719846634","unstructured":"Scalzi J (2015) A note about the Hugo nominations this year. whatever.scalzi.com, 6 April."},{"key":"bibr62-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1111\/jcc4.12120"},{"key":"bibr63-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630240406"},{"key":"bibr64-2053951719846634","doi-asserted-by":"crossref","unstructured":"Tang J, Sun J, Wang C, et\u00a0al. (2009) Social influence analysis in large-scale networks. In:\n                      Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining\n                      , June, pp.807\u2013816. New York: ACM.","DOI":"10.1145\/1557019.1557108"},{"key":"bibr65-2053951719846634","doi-asserted-by":"crossref","unstructured":"Tumasjan A, Sprenger TO, Sandner PG, et\u00a0al. (2010) Predicting elections with twitter: What 140 characters reveal about political sentiment. In:\n                      Fourth international AAAI conference on weblogs and social media\n                      , Washington, DC, USA, 23\u201326 May 2010, pp.178\u2013185. Menlo Park, CA: The AAAI Press.","DOI":"10.1609\/icwsm.v4i1.14009"},{"key":"bibr66-2053951719846634","first-page":"2837","volume":"11","author":"Vinh NX","year":"2010","journal-title":"The Journal of Machine Learning Research"},{"key":"bibr68-2053951719846634","doi-asserted-by":"crossref","unstructured":"Wang Y and Kitsuregawa M (2004) Enhancing contents\u2010link coupled web page clustering and its evaluation. In:\n                      Proceedings of data engineering workshop (DEWS 2004)\n                      , pp.499\u2013506. New York, NY: ACM Digital Press.","DOI":"10.1145\/584792.584875"},{"key":"bibr69-2053951719846634","doi-asserted-by":"crossref","unstructured":"Weiss R, V\u00e9lez B and Sheldon MA (1996) Hypursuit: A hierarchical network search engine that exploits content-link hypertext clustering. In:\n                      Proceedings of the seventh ACM conference on hypertext\n                      , pp.180\u2013193. New York, NY: ACM Digital Press.","DOI":"10.1145\/234828.234846"},{"key":"bibr70-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(19980401)49:4<327::AID-ASI4>3.0.CO;2-4"},{"key":"bibr71-2053951719846634","unstructured":"Wingfield N (2014) Intel pulls ads from site after \u2018Gamergate\u2019.\n                      New York Times\n                      , 2 October."},{"key":"bibr72-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1086\/jar.33.4.3629752"},{"key":"bibr73-2053951719846634","unstructured":"Zeitchik S (2019) Captain Marvel: How the trolls always win \u2013 Until they don't.\n                      Washington Post\n                      , 7 March."},{"key":"bibr74-2053951719846634","doi-asserted-by":"publisher","DOI":"10.1002\/asi.23027"}],"container-title":["Big Data &amp; Society"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/2053951719846634","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/2053951719846634","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/2053951719846634","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T12:57:48Z","timestamp":1777381068000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/2053951719846634"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,1]]},"references-count":69,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,1]]}},"alternative-id":["10.1177\/2053951719846634"],"URL":"https:\/\/doi.org\/10.1177\/2053951719846634","relation":{},"ISSN":["2053-9517","2053-9517"],"issn-type":[{"value":"2053-9517","type":"print"},{"value":"2053-9517","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,1]]},"article-number":"2053951719846634"}}