{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T06:04:20Z","timestamp":1777615460697,"version":"3.51.4"},"reference-count":86,"publisher":"MIS Quarterly","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,9,1]]},"abstract":"<jats:p>Topic models are becoming a frequently employed tool in the empirical methods repertoire of information systems and management scholars. Given textual corpora, such as consumer reviews and online discussion forums, researchers and business practitioners often use topic modeling to either explore data in an unsupervised fashion or generate variables of interest for subsequent econometric analysis. However, one important concern stems from the fact that topic models can be notorious for their instability, i.e., the generated results could be inconsistent and irreproducible at different times, even on the same dataset. Therefore, researchers might arrive at potentially unreliable results regarding the theoretical relationships that they are testing or developing. In this paper, we attempt to highlight this problem and suggest a potential approach to addressing it. First, we empirically define and evaluate the stability problem of topic models using four textual datasets. Next, to alleviate the problem and with the goal of extracting actionable insights from textual data, we propose a new method, Stable LDA, which incorporates topical word clusters into the topic model to steer the model inference toward consistent results. We show that the proposed Stable LDA approach can significantly improve model stability while maintaining or even improving the topic model quality. Further, employing two case studies related to an online knowledge community and online consumer reviews, we demonstrate that the variables generated from Stable LDA can lead to more consistent estimations in econometric analyses. We believe that our work can further enhance management scholars\u2019 collective toolkit to analyze ever-growing textual data.<\/jats:p>","DOI":"10.25300\/misq\/2022\/16957","type":"journal-article","created":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T17:14:55Z","timestamp":1696007695000},"page":"923-954","source":"Crossref","is-referenced-by-count":20,"title":["Extracting Actionable Insights from Text Data: A Stable Topic Model Approach"],"prefix":"10.25300","volume":"47","author":[{"given":"Yi","family":"Yang","sequence":"first","affiliation":[{"name":"Department of Information Systems, Business Statistics and Operations Management, School of Business and Management Hong Kong University of Science and Technology, Sai Kung, Hong Kong"}]},{"given":"Ramanath","family":"Subramanyam","sequence":"additional","affiliation":[{"name":"Department of Business Administration, Gies College of Business University of Illinois at Urbana-Champaign, Champaign, IL, U.S.A."}]}],"member":"10933","published-online":{"date-parts":[[2023,9,1]]},"reference":[{"issue":"6","key":"2025082212315359700_b1-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"975","DOI":"10.1111\/poms.12303","article-title":"An integrated text analytic framework for product defect discovery","volume":"24","author":"Abrahams","year":"2014","journal-title":"Production and Operations Management"},{"issue":"4","key":"2025082212315359700_b2-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2382438.2382442","article-title":"Stability of recommendation algorithms","volume":"30","author":"Adomavicius","year":"2012","journal-title":"ACM Transactions on Information Systems"},{"issue":"1","key":"2025082212315359700_b3-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1287\/ijoc.2015.0662","article-title":"Classification, ranking, and top-k stability of recommendation algorithms","volume":"28","author":"Adomavicius","year":"2016","journal-title":"INFORMS Journal on Computing"},{"key":"2025082212315359700_b4-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1145\/1553374.1553378","article-title":"Incorporating domain knowledge into topic modeling via Dirichlet forest priors","author":"Andrzejewski","year":"2009","journal-title":"Proceedings of the 26th Annual International Conference on Machine Learning"},{"key":"2025082212315359700_b5-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1145\/129712.129746","article-title":"Computational learning theory","author":"Angluin","year":"1992","journal-title":"Proceedings of the 24th Annual ACM Symposium on Theory of Computing"},{"issue":"6","key":"2025082212315359700_b6-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1371","DOI":"10.1287\/mnsc.2014.1930","article-title":"Simultaneously discovering and quantifying risk types from textual risk disclosures","volume":"60","author":"Bao","year":"2014","journal-title":"Management Science"},{"key":"2025082212315359700_b7-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/j.eswa.2017.08.047","article-title":"Stability of topic modeling via matrix factorization","volume":"91","author":"Belford","year":"2018","journal-title":"Expert Systems with Applications"},{"issue":"3","key":"2025082212315359700_b8-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"72","DOI":"10.17705\/1jais.00065","article-title":"Trust in and adoption of online recommendation agents","volume":"6","author":"Benbasat","year":"2005","journal-title":"Journal of the Association for Information Systems"},{"issue":"4","key":"2025082212315359700_b9-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1145\/2133806.2133826","article-title":"Probabilistic topic models","volume":"55","author":"Blei","year":"2012","journal-title":"Communications of the ACM"},{"key":"2025082212315359700_b10-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"993","DOI":"10.5555\/944919.944937","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"2025082212315359700_b11-01_ra_10_25300_misq_2022_16957","first-page":"499","article-title":"Stability and Generalization","volume":"2","author":"Bousquet","year":"2002","journal-title":"Journal of Machine Learning Research"},{"key":"2025082212315359700_b12-01_ra_10_25300_misq_2022_16957","first-page":"75","article-title":"Multilingual topic models for unaligned text","author":"Boyd-Graber","year":"2009"},{"key":"2025082212315359700_b13-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1201\/b17520","article-title":"Care and feeding of topic models: Problems, diagnostics, and improvements","volume-title":"Handbook of mixed membership models and their applications","author":"Boyd-Graber","year":"2014"},{"issue":"1","key":"2025082212315359700_b14-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1111\/1475-679x.12154","article-title":"IRS attention","volume":"55","author":"Bozanic","year":"2017","journal-title":"Journal of Accounting Research"},{"issue":"2","key":"2025082212315359700_b15-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1111\/j.2517-6161.1975.tb01532.x","article-title":"Techniques for testing the constancy of regression relationships over time","volume":"37","author":"Brown","year":"1975","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"issue":"6","key":"2025082212315359700_b16-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"953","DOI":"10.1287\/mksc.2016.0993","article-title":"Sentence-based text analysis for customer reviews","volume":"35","author":"B\u00fcschken","year":"2016","journal-title":"Marketing Science"},{"key":"2025082212315359700_b17-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"175","DOI":"10.3115\/v1\/n15-1018","article-title":"TopicCheck: Interactive alignment for assessing topic model stability","author":"Chuang","year":"2015"},{"issue":"12","key":"2025082212315359700_b18-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"4069","DOI":"10.1080\/03610929108830757","article-title":"On the hyper-Dirichlet Type 1 and hyper-Liouville distributions","volume":"20","author":"Dennis","year":"1991","journal-title":"Communications in Statistics: Theory and Methods"},{"key":"2025082212315359700_b19-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/p15-1075","article-title":"Efficient methods for inferring large sparse topic hierarchies","author":"Downey","year":"2015","journal-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics"},{"issue":"2-3","key":"2025082212315359700_b20-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1016\/j.jacceco.2017.07.002","article-title":"The evolution of 10-K textual disclosure: Evidence from latent Dirichlet allocation","volume":"64","author":"Dyer","year":"2017","journal-title":"Journal of Accounting and Economics"},{"key":"2025082212315359700_b21-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1363","DOI":"10.1145\/3412841.3442011","article-title":"On the instability of embeddings for recommender systems: The case of matrix factorization","author":"Gabbolini","year":"2021","journal-title":"Proceedings of the 36th Annual ACM Symposium on Applied Computing"},{"issue":"2","key":"2025082212315359700_b22-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1123","DOI":"10.1007\/s10115-018-1314-7","article-title":"Incorporating word embeddings into topic modeling of short text","volume":"61","author":"Gao","year":"2019","journal-title":"Knowledge and Information Systems"},{"issue":"357","key":"2025082212315359700_b23-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"54","DOI":"10.2307\/2286905","article-title":"Two methods for examining the stability of regression coefficients","volume":"72","author":"Garbade","year":"1977","journal-title":"Journal of the American Statistical Association"},{"issue":"2","key":"2025082212315359700_b24-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"501","DOI":"10.25300\/MISQ\/2019\/14346","article-title":"Using retweets when shaping our online persona: Topic modeling approach","volume":"43","author":"Geva","year":"2019","journal-title":"MIS Quarterly"},{"issue":"3","key":"2025082212315359700_b25-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1363","DOI":"10.1287\/mnsc.2017.2991","article-title":"Modeling consumer footprints on search engines: An interplay with social media","volume":"65","author":"Ghose","year":"2019","journal-title":"Management Science"},{"issue":"3","key":"2025082212315359700_b26-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"805","DOI":"10.25300\/MISQ\/2018\/14042","article-title":"Examining the impact of keyword ambiguity on search advertising performance: A topic model approach","volume":"42","author":"Gong","year":"2018","journal-title":"MIS Quarterly"},{"key":"2025082212315359700_b27-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1007\/978-3-662-44848-9_32","article-title":"How many topics? Stability analysis for topic models","author":"Greene","year":"2014","journal-title":"Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases"},{"issue":"4","key":"2025082212315359700_b28-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1105","DOI":"10.25300\/MISQ\/2017\/41.4.05","article-title":"Extracting representative information on intra-organizational blogging platforms","volume":"41","author":"Guo","year":"2017","journal-title":"MIS Quarterly"},{"key":"2025082212315359700_b29-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.5465\/ambpp.2016.10900abstract","article-title":"Researching mass customization: Mapping hidden structures and development trajectories","author":"Hankammer","year":"2016","journal-title":"Academy of Management Proceedings"},{"issue":"2","key":"2025082212315359700_b30-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"586","DOI":"10.5465\/annals.2017.0099","article-title":"Topic modeling in management research: Rendering new theory from textual data","volume":"13","author":"Hannigan","year":"2019","journal-title":"Academy of Management Annals"},{"issue":"4","key":"2025082212315359700_b31-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1016\/0161-8938(92)90019-9","article-title":"Testing for parameter instability in linear models","volume":"14","author":"Hansen","year":"1992","journal-title":"Journal of Policy Modeling"},{"issue":"3","key":"2025082212315359700_b32-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"827","DOI":"10.25300\/MISQ\/2019\/15049","article-title":"Mobile app recommendation: An involvement-enhanced approach","volume":"43","author":"He","year":"2019","journal-title":"MIS Quarterly"},{"issue":"6","key":"2025082212315359700_b33-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"2833","DOI":"10.1287\/mnsc.2017.2751","article-title":"Analyst information discovery and interpretation roles: A topic modeling approach","volume":"64","author":"Huang","year":"2017","journal-title":"Management Science"},{"key":"2025082212315359700_b34-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.18653\/V1\/e17-2068","article-title":"Bag of tricks for efficient text classification","author":"Joulin","year":"2017","journal-title":"Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics"},{"issue":"10","key":"2025082212315359700_b35-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1435","DOI":"10.1002\/smj.2294","article-title":"The double-edged sword of recombination in breakthrough innovation","volume":"36","author":"Kaplan","year":"2015","journal-title":"Strategic Management Journal"},{"issue":"4","key":"2025082212315359700_b36-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"871","DOI":"10.1287\/isre.2017.0750","article-title":"Extrinsic versus intrinsic rewards for contributing reviews in an online platform","volume":"29","author":"Khern-am-nuai","year":"2018","journal-title":"Information Systems Research"},{"key":"2025082212315359700_b37-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1007\/978-3-319-45982-0_16","article-title":"Stable topic modeling with local density regularization","volume":"9934","author":"Koltcov","year":"2016","journal-title":"internet Science"},{"issue":"4","key":"2025082212315359700_b38-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"941","DOI":"10.2307\/25148760","article-title":"The effects of personalization and familiarity on trust and adoption of recommendation agents","volume":"30","author":"Komiak","year":"2006","journal-title":"MIS Quarterly"},{"issue":"1-2","key":"2025082212315359700_b39-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Naval Research Logistics Quarterly"},{"issue":"1","key":"2025082212315359700_b40-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevX.5.011007","article-title":"High-reproducibility and high-accuracy method for automated topic classification","volume":"5","author":"Lancichinetti","year":"2015","journal-title":"Physical Review X"},{"issue":"4","key":"2025082212315359700_b41-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"940","DOI":"10.1287\/isre.2016.0674","article-title":"The impact of fake reviews on online visibility: A vulnerability assessment of the hotel industry","volume":"27","author":"Lappas","year":"2016","journal-title":"Information Systems Research"},{"issue":"3","key":"2025082212315359700_b42-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"529","DOI":"10.25300\/MISQ\/2016\/40.3.01","article-title":"A tool for addressing construct identity in literature reviews and meta-analyses","volume":"40","author":"Larsen","year":"2016","journal-title":"MIS Quarterly"},{"issue":"3","key":"2025082212315359700_b43-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1111\/poms.12805","article-title":"Sentiment manipulation in online platforms: An analysis of movie tweets","volume":"27","author":"Lee","year":"2018","journal-title":"Production and Operations Management"},{"issue":"4","key":"2025082212315359700_b44-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"906","DOI":"10.1287\/isre.2013.0480","article-title":"Research commentary \u2014Too big to fail: Large samples and the p-value problem","volume":"24","author":"Lin","year":"2013","journal-title":"Information Systems Research"},{"key":"2025082212315359700_b45-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"627","DOI":"10.1201\/9781420085938","article-title":"Sentiment analysis and subjectivity","volume-title":"Handbook of Natural Language Processing","author":"Liu","year":"2010"},{"issue":"3","key":"2025082212315359700_b46-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1287\/isre.2019.0911","article-title":"A structured analysis of unstructured big data by leveraging cloud computing","volume":"35","author":"Liu","year":"2016","journal-title":"Marketing Science"},{"issue":"3","key":"2025082212315359700_b47-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"731","DOI":"10.1287\/isre.2019.0911","article-title":"Finding useful solutions in online knowledge communities: A theory-driven design and multilevel analysis","volume":"31","author":"Liu","year":"2020","journal-title":"Information Systems Research"},{"issue":"2","key":"2025082212315359700_b48-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1287\/serv.2016.0126","article-title":"Understanding online hotel reviews through automated text analysis","volume":"8","author":"Mankad","year":"2016","journal-title":"Service Science"},{"key":"2025082212315359700_b49-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.1145\/3239235.3267435","article-title":"Measuring LDA topic stability from clusters of replicated runs","author":"Mantyla","year":"2018","journal-title":"Proceedings of the 12th ACM\/IEEE International Symposium on Empirical Software Engineering and Measurement"},{"key":"2025082212315359700_b50-01_ra_10_25300_misq_2022_16957","first-page":"121","article-title":"Supervised topic models","author":"Mcauliffe","year":"2008","journal-title":"Proceedings of the 2oth International Conference on Neural Information Processing Systems Proceedings"},{"issue":"2","key":"2025082212315359700_b51-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"142","DOI":"10.2307\/2986299","article-title":"Testing the constancy of regression relationships over time using least squares residuals","volume":"29","author":"McCabe","year":"1980","journal-title":"Journal of the Royal Statistical Society. Series C (Applied Statistics)"},{"key":"2025082212315359700_b52-01_ra_10_25300_misq_2022_16957","first-page":"3111","article-title":"Distributed representations of words and phrases and their compositionality","author":"Mikolov","year":"2013","journal-title":"Proceedings of the 26th International Conference on Neural Information Processing Systems Proceedings"},{"key":"2025082212315359700_b53-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"64","DOI":"10.18653\/v1\/W17-4509","article-title":"Topic model stability for hierarchical summarization","author":"Miller","year":"2017","journal-title":"Proceedings of the Workshop on New Frontiers in Summarization"},{"key":"2025082212315359700_b54-01_ra_10_25300_misq_2022_16957","first-page":"262","article-title":"Optimizing semantic coherence in topic models","author":"Mimno","year":"2011","journal-title":"Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing"},{"key":"2025082212315359700_b55-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"188","DOI":"10.18653\/v1\/D19-1018","article-title":"Justifying recommendations using distantly-labeled reviews and fine-grained aspects","author":"Ni","year":"2019","journal-title":"Proceedings of EMNLP-IJCNLP"},{"issue":"6242","key":"2025082212315359700_b56-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1422","DOI":"10.1126\/science.aab2374","article-title":"Promoting an open research culture","volume":"348","author":"Nosek","year":"2015","journal-title":"Science"},{"issue":"3\/4","key":"2025082212315359700_b57-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"523","DOI":"10.1093\/biomet\/42.3-4.523","article-title":"A test for a change in a parameter occurring at an unknown point","volume":"42","author":"Page","year":"1955","journal-title":"Biometrika"},{"key":"2025082212315359700_b58-01_ra_10_25300_misq_2022_16957","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"The Journal of Machine Learning Research"},{"issue":"3","key":"2025082212315359700_b59-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"913","DOI":"10.1287\/isre.2020.0923","article-title":"More than words in medical question-and-answer sites: A content-context congruence perspective","volume":"31","author":"Peng","year":"2020","journal-title":"Information Systems Research"},{"key":"2025082212315359700_b60-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1532","DOI":"10.3115\/v1\/D14-1162","article-title":"Glove: Global vectors for word representation","author":"Pennington","year":"2014","journal-title":"Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing"},{"issue":"5","key":"2025082212315359700_b61-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1287\/mksc.2017.1048","article-title":"The effect of calorie posting regulation on consumer opinion: A flexible latent Dirichlet allocation model with informative priors","volume":"36","author":"Puranam","year":"2017","journal-title":"Marketing Science"},{"key":"2025082212315359700_b62-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"254","DOI":"10.18653\/v1\/W18-6532","article-title":"Generation of company descriptions using concept-to-text and text-to-text deep models: Dataset collection and systems evaluation","author":"Qader","year":"2018","journal-title":"Proceedings of the 11th International Conference on Natural Language Generation"},{"issue":"2","key":"2025082212315359700_b63-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"462","DOI":"10.1287\/isre.2020.0977","article-title":"Correcting misclassification bias in regression models with variables generated via data mining","volume":"32","author":"Qiao","year":"2021","journal-title":"Information Systems Research"},{"issue":"2","key":"2025082212315359700_b64-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"crossref","first-page":"iii","DOI":"10.25300\/MISQ\/2016\/40.2.E0","article-title":"Editor\u2019s comments: Synergies between big data and theory","volume":"40","author":"Rai","year":"2016","journal-title":"MIS Quarterly"},{"key":"2025082212315359700_b65-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1121","DOI":"10.7551\/mitpress\/7503.003.0145","article-title":"Stability of K-means clustering","author":"Rakhlin","year":"2007","journal-title":"Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference"},{"key":"2025082212315359700_b66-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.13140\/2.1.2393.1847","article-title":"Software framework for topic modelling with large corpora","author":"Rehurek","year":"2010","journal-title":"Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks"},{"issue":"3","key":"2025082212315359700_b67-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1007\/s11573-018-0915-7","article-title":"Topic modeling in marketing: Recent advances and research opportunities","volume":"89","author":"Reisenbichler","year":"2019","journal-title":"Journal of Business Economics"},{"key":"2025082212315359700_b68-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1145\/2684822.2685324","article-title":"Exploring the space of topic coherence measures","author":"R\u00f6der","year":"2015","journal-title":"Proceedings of the Eighth ACM International Conference on Web Search and Data Mining"},{"issue":"4","key":"2025082212315359700_b69-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1035","DOI":"10.25300\/MISQ\/2016\/40.4.11","article-title":"Toward a better measure of business proximity: Topic modeling for industry intelligence","volume":"40","author":"Shi","year":"2016","journal-title":"MIS Quarterly"},{"issue":"3","key":"2025082212315359700_b70-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"553","DOI":"10.2307\/23042796","article-title":"Predictive analytics in information systems research","volume":"35","author":"Shmueli","year":"2011","journal-title":"MIS Quarterly"},{"key":"2025082212315359700_b71-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"439","DOI":"10.4324\/9780203936399","article-title":"Probabilistic topic models","volume-title":"Handbook of latent semantic analysis","author":"Steyvers","year":"2007"},{"issue":"6317","key":"2025082212315359700_b72-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1240","DOI":"10.1126\/science.aah6168","article-title":"Enhancing reproducibility for computational methods","volume":"354","author":"Stodden","year":"2016","journal-title":"Science"},{"key":"2025082212315359700_b73-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1508.01067","author":"Su","year":"2015","journal-title":"Topic stability over noisy sources"},{"issue":"4","key":"2025082212315359700_b74-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1509\/jmr.12.0106","article-title":"Mining marketing meaning from online chatter: Strategic brand analysis of big data using latent Dirichlet allocation","volume":"51","author":"Tirunillai","year":"2014","journal-title":"Journal of Marketing Research"},{"key":"2025082212315359700_b75-01_ra_10_25300_misq_2022_16957","article-title":"Evaluating topic models with stability","author":"Waal","year":"2008","journal-title":"Proceedings of the Nineteenth Annual Symposium of the Pattern Recognition Association of South Africa"},{"issue":"2","key":"2025082212315359700_b76-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1287\/isre.2017.0735","article-title":"Copycats vs. original mobile apps: A machine learning copycat-detection method and empirical analysis","volume":"29","author":"Wang","year":"2018","journal-title":"Information Systems Research"},{"issue":"2","key":"2025082212315359700_b77-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1509\/jmr.15.0511","article-title":"When and how managers\u2019 responses to online reviews affect subsequent reviews","volume":"55","author":"Wang","year":"2018","journal-title":"Journal of Marketing Research"},{"issue":"4","key":"2025082212315359700_b78-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2700497","article-title":"Peacock: Learning long-tail topic features for industrial applications","volume":"6","author":"Wang","year":"2015","journal-title":"ACM Transactions on Intelligent Systems and Technology"},{"key":"2025082212315359700_b79-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"725","DOI":"10.3115\/v1\/N15-1074","article-title":"Incorporating word correlation knowledge into topic modeling","author":"Xie","year":"2015","journal-title":"Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies"},{"issue":"1","key":"2025082212315359700_b80-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1287\/isre.2017.0727","article-title":"Mind the gap: Accounting for measurement error and misclassification in variables generated via data mining","volume":"29","author":"Yang","year":"2018","journal-title":"Information Systems Research"},{"issue":"2","key":"2025082212315359700_b81-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2954002","article-title":"The stability and usability of statistical topic models","volume":"6","author":"Yang","year":"2016","journal-title":"ACM Transactions on Interactive Intelligent Systems"},{"key":"2025082212315359700_b82-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1145\/2678025.2701396","article-title":"User-directed non-disruptive topic model update for effective exploration of dynamic content","author":"Yang","year":"2015","journal-title":"Proceedings of the 20th International Conference on Intelligent User Interfaces"},{"issue":"1","key":"2025082212315359700_b83-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1287\/isre.2022.1124","article-title":"sDTM: A supervised Bayesian deep topic model for text analytics","volume":"34","author":"Yang","year":"2022","journal-title":"Information Systems Research"},{"issue":"1","key":"2025082212315359700_b84-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"73","DOI":"10.25300\/MISQ\/2019\/13042","article-title":"See no evil, hear no evil? Dissecting the impact of online hacker forums","volume":"43","author":"Yue","year":"2019","journal-title":"MIS Quarterly"},{"key":"2025082212315359700_b85-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","first-page":"2620","DOI":"10.1145\/3366423.3380015","article-title":"Review-guided helpful answer identification in e-commerce","author":"Zhang","year":"2020","journal-title":"Proceedings of The Web Conference"},{"key":"2025082212315359700_b86-01_ra_10_25300_misq_2022_16957","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2103.00498","author":"Zhao","year":"2021","journal-title":"Topic modelling meets deep neural networks: A survey"}],"container-title":["MIS Quarterly"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/misq.umn.edu\/misq\/article-pdf\/47\/3\/923\/9114\/01_ra_10_25300_misq_2022_16957.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/misq.umn.edu\/misq\/article-pdf\/47\/3\/923\/9114\/01_ra_10_25300_misq_2022_16957.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T16:32:02Z","timestamp":1755880322000},"score":1,"resource":{"primary":{"URL":"https:\/\/misq.umn.edu\/misq\/article\/47\/3\/923\/2253\/Extracting-Actionable-Insights-from-Text-Data-A"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,1]]},"references-count":86,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,9,1]]},"published-print":{"date-parts":[[2023,9,1]]}},"URL":"https:\/\/doi.org\/10.25300\/misq\/2022\/16957","relation":{},"ISSN":["0276-7783","2162-9730"],"issn-type":[{"value":"0276-7783","type":"print"},{"value":"2162-9730","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,1]]}}}