{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T07:29:53Z","timestamp":1772695793096,"version":"3.50.1"},"reference-count":126,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T00:00:00Z","timestamp":1736467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100006374","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1950885,2243941"],"award-info":[{"award-number":["1950885,2243941"]}],"id":[{"id":"10.13039\/501100006374","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Hum.-Comput. Interact."],"published-print":{"date-parts":[[2025,1,10]]},"abstract":"<jats:p>Practitioners dealing with large text collections frequently use topic models such as Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) in their projects to explore trends. Despite twenty years of accrued advancement in natural language processing tools, these models are found to be slow and challenging to apply to text exploration projects. In our work, we engaged with practitioners (n=15) who use topic modeling to explore trends in large text collections to understand their project workflows and investigate which factors often slow down the processes and how they deal with such errors and interruptions in automated topic modeling. Our findings show that practitioners are required to diagnose and resolve context-specific problems with preparing data and models and need control for these steps, especially for data cleaning and parameter selection. Our major findings resonate with existing work across CSCW, computational social science, machine learning, data science, and digital humanities. They also leave us questioning whether automation is actually a useful goal for tools designed for topic models and text exploration.<\/jats:p>","DOI":"10.1145\/3701201","type":"journal-article","created":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T16:06:46Z","timestamp":1736525206000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["\"My Very Subjective Human Interpretation\": Domain Expert Perspectives on Navigating the Text Analysis Loop for Topic Models"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-0693-6334","authenticated-orcid":false,"given":"Alexandra","family":"Schofield","sequence":"first","affiliation":[{"name":"Harvey Mudd College, Claremont, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-4426-8179","authenticated-orcid":false,"given":"Siqi","family":"Wu","sequence":"additional","affiliation":[{"name":"Massachusetts Institute of Technology &amp; Harvey Mudd College, Cambridge, Massachusetts, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-4051-6104","authenticated-orcid":false,"given":"Theo","family":"Bayard de Volo","sequence":"additional","affiliation":[{"name":"Pitzer College, Claremont, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-3916-4160","authenticated-orcid":false,"given":"Tatsuki","family":"Kuze","sequence":"additional","affiliation":[{"name":"Harvey Mudd College, Claremont, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6116-1647","authenticated-orcid":false,"given":"Alfredo","family":"Gomez","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University &amp; Harvey Mudd College, Pittsburgh, PA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2906-7391","authenticated-orcid":false,"given":"Sharifa","family":"Sultana","sequence":"additional","affiliation":[{"name":"University of Illinois, Urbana-Champaign, Champaign, IL, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,1,10]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1207\/S15327051HCI1523_5"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013)","author":"Aletras Nikolaos","year":"2013","unstructured":"Nikolaos Aletras and Mark Stevenson. 2013. Evaluating Topic Coherence Using Distributional Semantics. In Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013) -- Long Papers (Potsdam, Germany). Association for Computational Linguistics, 13--22. http:\/\/www.aclweb.org\/anthology\/W13-0102"},{"key":"e_1_2_1_3_1","volume-title":"ZUMA-Arbeitsbericht","volume":"07","author":"Alexa Melina","year":"1997","unstructured":"Melina Alexa. 1997. Computer-assisted text analysis methodology in the social sciences. ZUMA-Arbeitsbericht, Vol. 1997\/07. Zentrum f\u00fcr Umfragen, Methoden und Analysen -ZUMA-, Mannheim. 40 pages."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2909132.2909252"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/VAST.2014.7042493"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v35i4.2513"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300233"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359190"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/978--3-030--24925-0_4"},{"key":"e_1_2_1_10_1","volume-title":"Researcher as Bricoleur: Contextualizing humanists' digital workflows. DHQ: Digital Humanities Quarterly 12, 3","author":"Antonijevic Smiljana","year":"2018","unstructured":"Smiljana Antonijevic and Ellysa Stern Cahoy. 2018. Researcher as Bricoleur: Contextualizing humanists' digital workflows. DHQ: Digital Humanities Quarterly 12, 3 (2018)."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613905.3650849"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.23786"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.21105\/joss.00774"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2133806.2133826"},{"key":"e_1_2_1_15_1","first-page":"993","article-title":"Latent Dirichlet allocation","author":"Blei David M","year":"2003","unstructured":"David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, Jan (2003), 993--1022.","journal-title":"Journal of Machine Learning Research 3"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of GSCL","author":"Bouma Gerlof","year":"2009","unstructured":"Gerlof Bouma. 2009. Normalized (pointwise) mutual information in collocation extraction. In Proceedings of GSCL (Potsdam, Germany). German Society for Computational Linguistics and Language Technology, 31--40."},{"key":"e_1_2_1_17_1","volume-title":"Handbook of Mixed Membership Models and Their Applications, Edoardo M. Airoldi, David Blei, Elena A. Erosheva, and Stephen E. fienberg (Eds.)","author":"Boyd-Graber Jordan","year":"2014","unstructured":"Jordan Boyd-Graber, David Mimno, and David Newman. 2014. Care and Feeding of Topic Models: Problems, Diagnostics, and Improvements. In Handbook of Mixed Membership Models and Their Applications, Edoardo M. Airoldi, David Blei, Elena A. Erosheva, and Stephen E. fienberg (Eds.). CRC Press, Boca Raton, florida. docs\/2014_book_chapter_care_and_feeding.pdf"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581524"},{"key":"e_1_2_1_19_1","volume-title":"Visualizing Topic Models. In Sixth International AAAI Conference on Weblogs and Social Media. AAAI. https:\/\/www.aaai.org\/ocs\/index.php\/ICWSM\/ICWSM12\/paper\/view\/4645","author":"June-Barlow Chaney Allison","unstructured":"Allison June-Barlow Chaney and David M. Blei. 2012. Visualizing Topic Models. In Sixth International AAAI Conference on Weblogs and Social Media. AAAI. https:\/\/www.aaai.org\/ocs\/index.php\/ICWSM\/ICWSM12\/paper\/view\/4645"},{"key":"e_1_2_1_20_1","volume-title":"Blei","author":"Chang Jonathan","year":"2009","unstructured":"Jonathan Chang, Sean Gerrish, ChongWang, Jordan L. Boyd-Graber, and David M. Blei. 2009. Reading Tea Leaves: How Humans Interpret Topic Models. In Advances in Neural Information Processing Systems 22, Y. Bengio, D. Schuurmans, J. D. Lafferty, C. K. I. Williams, and A. Culotta (Eds.). Curran Associates, Inc., 288--296. http:\/\/papers.nips.cc\/paper\/3700-reading-tea-leaves-how-humans-interpret-topic-models.pdf"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2856767.2856787"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/tvcg.2013.212"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254556.2254572"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s42803-023-00069-8"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445425"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445775"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1086\/702594"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.erss.2020.101704"},{"key":"e_1_2_1_29_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In North American","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In North American Chapter of the Association for Computational Linguistics. https:\/\/api.semanticscholar.org\/CorpusID:52967399"},{"key":"e_1_2_1_30_1","volume-title":"Humanities Approaches to Graphical Display. Digital Humanities Quarterly 5, 1","author":"Drucker Johanna","year":"2011","unstructured":"Johanna Drucker. 2011. Humanities Approaches to Graphical Display. Digital Humanities Quarterly 5, 1 (2011). http:\/\/ccl.idm.oclc.org\/login?url=https:\/\/www.proquest.com\/scholarly-journals\/humanities-approachesgraphical-display\/docview\/2555208513\/se-2"},{"key":"e_1_2_1_31_1","volume-title":"Applied Cognitive Work Analysis: A Pragmatic Methodology for Designing Revolutionary Cognitive Affordances. Handbook of cognitive task design","author":"Elm William C","year":"2003","unstructured":"William C Elm, Scott S Potter, James W Gualtieri, Emilie M Roth, and James R Easter. 2003. Applied Cognitive Work Analysis: A Pragmatic Methodology for Designing Revolutionary Cognitive Affordances. Handbook of cognitive task design (2003), 357--382."},{"key":"e_1_2_1_32_1","volume-title":"ChatGPT outperforms humans in emotional awareness evaluations. Frontiers in Psychology 14","author":"Elyoseph Zohar","year":"2023","unstructured":"Zohar Elyoseph, Dorit Hadar-Shoval, Kfir Asraf, and Maya Lvovsky. 2023. ChatGPT outperforms humans in emotional awareness evaluations. Frontiers in Psychology 14 (2023). https:\/\/api.semanticscholar.org\/CorpusID:258891670"},{"key":"e_1_2_1_33_1","volume-title":"The Cambridge Handbook of Expertise and Expert Performance","author":"Ericsson K Anders","unstructured":"K Anders Ericsson, Robert R Hoffman, Aaron Kozbelt, and A Mark Williams. 2018. The Cambridge Handbook of Expertise and Expert Performance. Cambridge University Press."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3134678"},{"key":"e_1_2_1_35_1","volume-title":"An Interpretation of Digital Humanities","author":"Evans Leighton","unstructured":"Leighton Evans and Sian Rees. 2012. An Interpretation of Digital Humanities. In Understanding Digital Humanities, David M Berry (Ed.). Palgrave Macmillan, 21--42."},{"key":"e_1_2_1_36_1","unstructured":"Shane Evans Pablo Hoffman and the Zyte software team. 2021. ScraPy. https:\/\/scrapy.org\/."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3479856"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3375192"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3517446"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the TextVis Workshop - Intelligent User Interfaces (IUI). 7.","author":"Ganesan Ashwinkumar","year":"2015","unstructured":"Ashwinkumar Ganesan, Kiante Branley, Shimei Pan, and Jian Chen. 2015. LDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation. In Proceedings of the TextVis Workshop - Intelligent User Interfaces (IUI). 7."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613905.3637133"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2023.3244065"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-367"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3502076"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3567552"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3492844"},{"key":"e_1_2_1_47_1","volume-title":"Building better digital humanities tools: Toward broader audiences and usercentered designs. Digital Humanities Quarterly 6, 2","author":"Gibbs Fred","year":"2012","unstructured":"Fred Gibbs and Trevor Owens. 2012. Building better digital humanities tools: Toward broader audiences and usercentered designs. Digital Humanities Quarterly 6, 2 (2012)."},{"key":"e_1_2_1_48_1","unstructured":"Andrew Goldstone Susana Gal\u00e1n C Laura Lovin Andrew Mazzaschi and Lindsey Whitmore. 2014. An interactive topic model of signs. Signs J. (2014)."},{"key":"e_1_2_1_49_1","volume-title":"BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794","author":"Grootendorst Maarten","year":"2022","unstructured":"Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794 (2022)."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.18637\/jss.v040.i13"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3148330.3148338"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3132946"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376177"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3526113.3545681"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.1212303"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/2854158"},{"key":"e_1_2_1_57_1","volume-title":"Advances in Neural Information Processing Systems","volume":"34","author":"Hoyle Alexander Miserlis","year":"2021","unstructured":"Alexander Miserlis Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan L. Boyd-Graber, and Philip Resnik. 2021. Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence. In Advances in Neural Information Processing Systems, Vol. 34."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-013--5413-0"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.2196\/15700"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1111\/psj.12343"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359252"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/fqv052"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACIFICVIS.2015.7156366"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-019-09459-3"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642830"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14--1056"},{"key":"e_1_2_1_67_1","volume-title":"Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems 13","author":"Lee Daniel","year":"2000","unstructured":"Daniel Lee and H Sebastian Seung. 2000. Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems 13 (2000)."},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2017.03.007"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445682"},{"key":"e_1_2_1_70_1","volume-title":"Transdisciplinarity and digital humanities: lessons learned from developing text-mining tools for textual analysis","author":"Lin Yu-Wei","unstructured":"Yu-Wei Lin. 2012. Transdisciplinarity and digital humanities: lessons learned from developing text-mining tools for textual analysis. In Understanding Digital Humanities, David M Berry (Ed.). Palgrave Macmillan, 295--314."},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/978--3-030--86324--1_25"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17--1083"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/3579484"},{"key":"e_1_2_1_74_1","volume-title":"MALLET: A MAchine Learning for LanguagE Toolkit","author":"McCallum Andrew Kachites","year":"2002","unstructured":"Andrew Kachites McCallum. 2002. MALLET: A MAchine Learning for LanguagE Toolkit. http:\/\/mallet.cs.umass.edu."},{"key":"e_1_2_1_75_1","volume-title":"Mendenhall","author":"Millar Jeremy R.","year":"2009","unstructured":"Jeremy R. Millar, Gilbert L. Peterson, and Michael J. Mendenhall. 2009. Document Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps. In The Florida AI Research Society. https:\/\/api.semanticscholar. org\/CorpusID:14725197"},{"key":"e_1_2_1_76_1","unstructured":"David Mimno. 2013. mimno\/jsLDA: An implementation of latent Dirichlet allocation in JavaScript. https:\/\/github.com\/mimno\/jsLDA last updated: 2018--10--3."},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.5555\/2145432.2145462"},{"key":"e_1_2_1_78_1","volume-title":"State of What Art? A Call for Multi-Prompt LLM Evaluation. arXiv preprint arXiv:2401.00595","author":"Mizrahi Moran","year":"2023","unstructured":"Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, and Gabriel Stanovsky. 2023. State of What Art? A Call for Multi-Prompt LLM Evaluation. arXiv preprint arXiv:2401.00595 (2023)."},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/2957276.2957280"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.5555\/2888116.2888368"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1177\/2053951715599179"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.apergo.2016.06.001"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.3389\/frai.2020.00062"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.24565"},{"key":"e_1_2_1_85_1","unstructured":"OpenAI. 2023. GPT-4 Technical Report. https:\/\/api.semanticscholar.org\/CorpusID:257532815"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1080\/13645579.2021.2018905"},{"key":"e_1_2_1_87_1","volume-title":"Digital Humanities","author":"Pielstrom\u00f6m Steffen","year":"2018","unstructured":"Steffen Pielstrom\u00f6m, Severin Simmler, Thorsten Vitt, and Fotis Jannidis. 2018. A Graphical User Interface for LDA Topic Modeling. In Digital Humanities 2018. https:\/\/dh2018.adho.org\/en\/a-graphical-user-interface-for-lda-topic-modeling\/"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.22148\/001c.11826"},{"key":"e_1_2_1_89_1","volume-title":"SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support. arXiv abs\/2305.00450","author":"Qiu Huachuan","year":"2023","unstructured":"Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, and Zhenzhong Lan. 2023. SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support. arXiv abs\/2305.00450 (2023)."},{"key":"e_1_2_1_90_1","volume-title":"NeurIPS 2009 Workshop on Applications for Topic Models: Text and Beyond","volume":"5","author":"Ramage Daniel","year":"2009","unstructured":"Daniel Ramage, Evan Rosen, Jason Chuang, Christopher D Manning, and Daniel A McFarland. 2009. Topic modeling for the social sciences. In NeurIPS 2009 Workshop on Applications for Topic Models: Text and Beyond, Vol. 5. 1--4."},{"key":"e_1_2_1_91_1","volume-title":"Sentiment Analysis in Literary Studies. A Critical Survey. DHQ: Digital Humanities Quarterly 17, 3","author":"Rebora Simone","year":"2023","unstructured":"Simone Rebora. 2023. Sentiment Analysis in Literary Studies. A Critical Survey. DHQ: Digital Humanities Quarterly 17, 3 (2023)."},{"key":"e_1_2_1_92_1","volume-title":"Gensim--python framework for vector space modelling","author":"Rehurek Radim","year":"2011","unstructured":"Radim Rehurek and Petr Sojka. 2011. Gensim--python framework for vector space modelling. NLP Centre, Faculty of Informatics, Masaryk University, Brno, Czech Republic 3, 2 (2011)."},{"key":"e_1_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613905.3650798"},{"key":"e_1_2_1_94_1","unstructured":"Leonard Richardson. 2012. Beautiful Soup 4. https:\/\/www.crummy.com\/software\/BeautifulSoup\/."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1111\/ajps.12103"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3517742"},{"key":"e_1_2_1_97_1","first-page":"49","article-title":"Words alone: Dismantling topic models in the humanities","volume":"2","author":"Schmidt Benjamin M","year":"2012","unstructured":"Benjamin M Schmidt. 2012. Words alone: Dismantling topic models in the humanities. Journal of Digital Humanities 2, 1 (2012), 49--65.","journal-title":"Journal of Digital Humanities"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17--1290"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.46430\/phen0017"},{"key":"e_1_2_1_100_1","unstructured":"Dave Shepard. 2020. shepdl\/handle. https:\/\/github.com\/shepdl\/handle original-date: 2019-05-03T23:59:03Z."},{"key":"e_1_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1145\/3532106.3533483"},{"key":"e_1_2_1_102_1","unstructured":"Aditi Shrikumar. 2013. Designing an Exploratory Text Analysis Tool for Humanities and Social Sciences Research. Ph.D. Dissertation. UC Berkeley. https:\/\/escholarship.org\/uc\/item\/9f88p8t2"},{"key":"e_1_2_1_103_1","unstructured":"Carson Sievert. 2014. cpsievert\/LDAvis. https:\/\/github.com\/cpsievert\/LDAvis original-date: 2014-03-05T06:17:16Z."},{"key":"e_1_2_1_104_1","unstructured":"St\u00e9fan Sinclair and Geoffrey Rockwell. 2016. Voyant Tools. http:\/\/voyant-tools.org\/."},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1145\/3172944.3172965"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376624"},{"key":"e_1_2_1_107_1","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics","author":"Thompson Laure","year":"2018","unstructured":"Laure Thompson and David Mimno. 2018. Authorless Topic Models: Biasing Models Away from Known Structure. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, 3903--3914. https:\/\/aclanthology.org\/C18--1329"},{"key":"e_1_2_1_108_1","volume-title":"Will We Come? Large Scale Digital Infrastructures as a Dead End for Digital Humanities. Historical Social Research 37, 3 (141)","author":"van Zundert Joris","year":"2012","unstructured":"Joris van Zundert. 2012. If You Build It, Will We Come? Large Scale Digital Infrastructures as a Dead End for Digital Humanities. Historical Social Research 37, 3 (141) (2012), 165--186. http:\/\/www.jstor.org\/stable\/41636603"},{"key":"e_1_2_1_109_1","volume-title":"Cognitive work analysis: Toward safe, productive, and healthy computer-based work","author":"Vicente Kim J","unstructured":"Kim J Vicente. 1999. Cognitive work analysis: Toward safe, productive, and healthy computer-based work. CRC press."},{"key":"e_1_2_1_110_1","volume-title":"Culotta (Eds.)","volume":"22","author":"Wallach Hanna M","year":"2009","unstructured":"Hanna M Wallach, David Mimno, and Andrew McCallum. 2009. Rethinking LDA: Why priors matter. In Advances in Neural Information Processing Systems, Y. Bengio, D. Schuurmans, J. Lafferty, C. Williams, and A. Culotta (Eds.), Vol. 22. Curran Associates, Inc., 1973--1981."},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553515"},{"key":"e_1_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.21105\/joss.03021"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835827"},{"key":"e_1_2_1_114_1","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-92bf1922-00a"},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613905.3636302"},{"key":"e_1_2_1_116_1","first-page":"1","article-title":"Graph neural collaborative topic model for citation recommendation","volume":"40","author":"Xie Qianqian","year":"2021","unstructured":"Qianqian Xie, Yutao Zhu, Jimin Huang, Pan Du, and Jian-Yun Nie. 2021. Graph neural collaborative topic model for citation recommendation. ACM Transactions on Information Systems (TOIS) 40, 3 (2021), 1--30.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"key":"e_1_2_1_117_1","doi-asserted-by":"publisher","DOI":"10.14778\/3297753.3297763"},{"key":"e_1_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457566"},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445306"},{"key":"e_1_2_1_120_1","volume-title":"Corpus-Driven Analysis of Conceptual Metaphor in Artificial Intelligence Language: A Sample of ChatGPT-Written Speeches. Journal of Contemporary Educational Research","author":"Yang Yang","year":"2023","unstructured":"Yang Yang. 2023. Corpus-Driven Analysis of Conceptual Metaphor in Artificial Intelligence Language: A Sample of ChatGPT-Written Speeches. Journal of Contemporary Educational Research (2023)."},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1075\/ijcl.23087.yu"},{"key":"e_1_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.1145\/3544548.3581388"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1145\/3392826"},{"key":"e_1_2_1_124_1","doi-asserted-by":"crossref","unstructured":"Gechuan Zhang David Lillis and Paul Nulty. 2021. Can Domain Pre-training Help Interdisciplinary Researchers from Data Annotation Poverty? A Case Study of Legal Argument Mining with BERT-based Transformers. In NLP4DH. https:\/\/api.semanticscholar.org\/CorpusID:252847488","DOI":"10.46298\/jdmdh.9147"},{"key":"e_1_2_1_125_1","doi-asserted-by":"publisher","DOI":"10.1145\/3512980"},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSME.2014.103"}],"container-title":["Proceedings of the ACM on Human-Computer Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3701201","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3701201","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T14:24:40Z","timestamp":1755872680000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3701201"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,10]]},"references-count":126,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1,10]]}},"alternative-id":["10.1145\/3701201"],"URL":"https:\/\/doi.org\/10.1145\/3701201","relation":{},"ISSN":["2573-0142"],"issn-type":[{"value":"2573-0142","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,10]]},"assertion":[{"value":"2025-01-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}