{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,12]],"date-time":"2026-05-12T14:13:28Z","timestamp":1778595208541,"version":"3.51.4"},"reference-count":57,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,11,27]],"date-time":"2023-11-27T00:00:00Z","timestamp":1701043200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Res. Metr. Anal."],"abstract":"<jats:p>The problem of mental health in academia is increasingly discussed in literature, and to extract meaningful insights from the growing amount of scientific publications, text mining approaches are used. In this study, BERTopic, an advanced method of topic modeling, was applied to abstracts of 2,846 PubMed articles on depression, anxiety, and burnout in academia published in years 1975\u20132023. BERTopic is a modular technique comprising a text embedding method, a dimensionality reduction procedure, a clustering algorithm, and a weighing scheme for topic representation. A model was selected based on the proportion of outliers, the topic interpretability considerations, topic coherence and topic diversity metrics, and the inevitable subjectivity of the criteria was discussed. The selected model with 27 topics was explored and visualized. The topics evolved differently with time: research papers on students' pandemic-related anxiety and medical residents' burnout peaked in recent years, while publications on psychometric research or internet-related problems are yet to be presented more amply. The study demonstrates the use of BERTopic for analyzing literature on mental health in academia and sheds light on areas in the field to be addressed by further research.<\/jats:p>","DOI":"10.3389\/frma.2023.1271385","type":"journal-article","created":{"date-parts":[[2023,11,27]],"date-time":"2023-11-27T06:57:35Z","timestamp":1701068255000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":16,"title":["Depression, anxiety, and burnout in academia: topic modeling of PubMed abstracts"],"prefix":"10.3389","volume":"8","author":[{"given":"Olga","family":"Lezhnina","sequence":"first","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2023,11,27]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1002\/mpr.1481","article-title":"Text mining applications in psychiatry: a systematic literature review: text mining applications in Psychiatry","volume":"25","author":"Abbe","year":"2016","journal-title":"Int. J. Methods Psychiatr. Res."},{"key":"B2","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1016\/j.infsof.2018.02.005","article-title":"What is wrong with topic modeling? And how to fix it using search-based software engineering","volume":"98","author":"Agrawal","year":"2018","journal-title":"Inform. Softw. Technol."},{"key":"B3","doi-asserted-by":"publisher","first-page":"42","DOI":"10.3389\/frai.2020.00042","article-title":"Using topic modeling methods for short-text data: a comparative analysis","volume":"3","author":"Albalawi","year":"2020","journal-title":"Front. Artif. Intellig."},{"key":"B4","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2015.060121","article-title":"A survey of topic modeling in text mining","author":"Alghamdi","year":"2015","journal-title":"Int. J. Adv. Comp. Sci. Appl"},{"key":"B5","doi-asserted-by":"publisher","first-page":"4745","DOI":"10.1038\/s41598-023-31852-w","article-title":"Academic burnout among master and doctoral students during the COVID-19 pandemic","volume":"13","author":"Andrade","year":"2023","journal-title":"Sci. Rep."},{"key":"B6","year":"2013","journal-title":"Diagnostic and Statistical Manual of Mental Disorders (5th ed.)."},{"key":"B7","unstructured":"What are Anxiety Disorders?2023"},{"key":"B8","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/s44186-023-00143-3","article-title":"Factors that impact burnout and psychological wellbeing in Australian postgraduate medical trainees: a systematic review","volume":"2","author":"Balhatchet","year":"2023","journal-title":"Global Surg. Educ."},{"key":"B9","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1007\/s10489-019-01438-z","article-title":"Aggregated topic models for increasing social media topic coherence","volume":"50","author":"Blair","year":"2020","journal-title":"Appl. Intellig."},{"key":"B10","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937","article-title":"Latent dirichlet allocation","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res"},{"key":"B11","doi-asserted-by":"publisher","first-page":"1152535","DOI":"10.3389\/frma.2023.1152535","article-title":"Multi-task learning to detect suicide ideation and mental disorders among social media users","volume":"8","author":"Buddhitha","year":"2023","journal-title":"Front. Res. Metrics Analyt."},{"key":"B12","volume-title":"Researcher Mental Health: From Raising Awareness to Providing Evidence of Best Practices","author":"Cahill","year":"2023"},{"key":"B13","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-37456-2_14","article-title":"\u201cDensity-based clustering based on hierarchical density estimates,\u201d","volume-title":"Advances in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science","author":"Campello","year":"2013"},{"key":"B14","doi-asserted-by":"publisher","first-page":"286","DOI":"10.3390\/bs12080286","article-title":"Talking about health: a topic analysis of narratives from individuals with schizophrenia and other serious mental illnesses","volume":"12","author":"Cowan","year":"2022","journal-title":"Behav. Sci."},{"key":"B15","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1176\/appi.ajp.2013.13030325","article-title":"Comprehensive meta-analysis of excess mortality in depression in the general community versus patients with specific illnesses","volume":"171","author":"Cuijpers","year":"2014","journal-title":"Am. J. Psychiatry"},{"key":"B16","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1007\/s10734-020-00500-x","article-title":"Mapping the scattered field of research on higher education. A correlated topic model of 17,000 articles, 1991\u20132018","volume":"80","author":"Daenekindt","year":"2020","journal-title":"Higher Educ."},{"key":"B17","unstructured":"\u201cExperiments on Generalizability of BERTopic on Multi-Domain Short Text (arXiv:2212.08459),\u201d\n            de GrootM.\n            AliannejadiM.\n            HaasM. R.\n          arXiv2022"},{"key":"B18","first-page":"4171","article-title":"\u201cBERT: pre-training of deep bidirectional transformers for language understanding,\u201d","volume-title":"Proceedings of NAACL-HLT 2019","author":"Devlin","year":"2019"},{"key":"B19","doi-asserted-by":"publisher","first-page":"354","DOI":"10.1097\/00001888-200604000-00009","article-title":"Systematic review of depression, anxiety, and other indicators of psychological distress among U.S. and Canadian Medical Students: Academic","volume":"81","author":"Dyrbye","year":"2006","journal-title":"Medicine"},{"key":"B20","doi-asserted-by":"publisher","first-page":"886498","DOI":"10.3389\/fsoc.2022.886498","article-title":"A topic modeling comparison between LDA, NMF, Top2Vec, and BERTopic to demystify Twitter posts","volume":"7","author":"Egger","year":"2022","journal-title":"Front. Sociol."},{"key":"B21","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1162\/tacl_a_00078","article-title":"Anchored correlation explanation: topic modeling with minimal domain knowledge","volume":"5","author":"Gallagher","year":"2017","journal-title":"Trans. Assoc. Computat. Linguist."},{"key":"B22","doi-asserted-by":"publisher","first-page":"893845","DOI":"10.3389\/fpubh.2022.893845","article-title":"Internet addiction, symptoms of anxiety, depressive symptoms, stress among higher education students during the COVID-19 pandemic","volume":"10","author":"Gavurova","year":"2022","journal-title":"Front. Public Health"},{"key":"B23","doi-asserted-by":"publisher","first-page":"967","DOI":"10.1111\/rssa.12276","article-title":"Beyond subjective and objective in statistics","volume":"180","author":"Gelman","year":"2017","journal-title":"J. Royal Statist. Soc."},{"key":"B24","unstructured":"\u201cBERTopic: Neural topic modeling with a class-based TF-IDF procedure (arXiv:2203.05794),\u201d\n            GrootendorstM.\n          arXiv2022"},{"key":"B25","unstructured":"\u201cBERTopic,\u201d\n            GrootendorstM.\n          GitHub2023"},{"key":"B26","unstructured":"GuthrieS.\n            LichtenC. A.\n            van BelleJ.\n            BallS.\n            KnackA.\n            HofmanJ.\n          10.7249\/RR202229607246Understanding Mental Health in the Research Environment: a Rapid Evidence Assessment. Santa Monica, CA: RAND Corporation2017"},{"key":"B27","doi-asserted-by":"publisher","first-page":"586","DOI":"10.5465\/annals.2017.0099","article-title":"Topic modeling in management research: rendering new theory from textual data","volume":"13","author":"Hannigan","year":"2019","journal-title":"Acad. Manage. Annals"},{"key":"B28","doi-asserted-by":"publisher","first-page":"244","DOI":"10.4088\/PCC.v03n0609","article-title":"The comorbidity of major depression and anxiety disorders: recognition and management in primary care","volume":"3","author":"Hirschfeld","year":"2001","journal-title":"Prim. Care Companion J. Clin. Psychiatry"},{"key":"B29","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1016\/j.jpsychires.2012.11.015","article-title":"A systematic review of studies of depression prevalence in university students","volume":"47","author":"Ibrahim","year":"2013","journal-title":"J. Psychiatr. Res."},{"key":"B30","doi-asserted-by":"publisher","first-page":"284","DOI":"10.3389\/fpsyg.2019.00284","article-title":"The relationship between burnout, depression, and anxiety: a systematic review and meta-analysis","volume":"10","author":"Koutsimani","year":"2019","journal-title":"Front. Psychol."},{"key":"B31","doi-asserted-by":"publisher","first-page":"173","DOI":"10.3390\/data7120173","article-title":"Digital twins: a systematic literature review based on data analysis and topic modeling","volume":"7","author":"Kukushkin","year":"2022","journal-title":"Data"},{"key":"B32","doi-asserted-by":"publisher","first-page":"6023","DOI":"10.32604\/cmc.2023.039104","article-title":"ESG discourse analysis through BERTopic: comparing news articles and academic papers","volume":"75","author":"Lee","year":"2023","journal-title":"Comp. Mat. Continua"},{"key":"B33","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1146\/annurev.psych.52.1.397","article-title":"Job burnout","volume":"52","author":"Maslach","year":"2001","journal-title":"Annu. Rev. Psychol"},{"key":"B34","doi-asserted-by":"publisher","first-page":"11","DOI":"10.21105\/joss.00205","article-title":"Hdbscan: Hierarchical density based clustering","volume":"2","author":"McInnes","year":"2017","journal-title":"J. Open Source Softw."},{"key":"B35","doi-asserted-by":"publisher","first-page":"861","DOI":"10.21105\/joss.00861","article-title":"UMAP: uniform manifold approximation and projection","volume":"3","author":"McInnes","year":"2018","journal-title":"J. Open Source Softw."},{"key":"B36","doi-asserted-by":"publisher","first-page":"393","DOI":"10.2147\/AMEP.S302897","article-title":"Depression and anxiety among medical students: a brief overview","volume":"12","author":"Mirza","year":"2021","journal-title":"Adv. Med. Educ. Pract."},{"key":"B37","first-page":"70","article-title":"\u201cExploring Online Depression Forums via Text Mining: A Comparison of Reddit and a Curated Online Forum,\u201d","volume-title":"Proceedings of the 5th Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task","author":"Mo\u00dfburger","year":"2020"},{"key":"B38","doi-asserted-by":"publisher","first-page":"1279","DOI":"10.1113\/JP279386","article-title":"Mental health disorders: prevalent but widely ignored in academia?","volume":"598","author":"M\u00fcller","year":"2020","journal-title":"J. Physiol."},{"key":"B39","doi-asserted-by":"publisher","first-page":"e0283095","DOI":"10.1371\/journal.pone.0283095","article-title":"Classification and analysis of text transcription from Thai depression assessment tasks among patients with depression","volume":"18","author":"Munthuli","year":"2023","journal-title":"PLoS ONE"},{"key":"B40","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1089\/cpb.2008.0321","article-title":"Factors influencing internet addiction in a sample of freshmen university students in China","volume":"12","author":"Ni","year":"2009","journal-title":"Cyberpsychol. Behav."},{"key":"B41","first-page":"1","article-title":"\u201cBERTopic modeling with P53 in ovarian cancer,\u201d","volume-title":"2022 5th Information Technology for Education and Development (ITED)","author":"Oveh","year":"2022"},{"key":"B42","doi-asserted-by":"publisher","first-page":"759802","DOI":"10.3389\/fpubh.2021.759802","article-title":"Leveraging text mining approach to identify what people want to know about mental disorders from online inquiry platforms","volume":"9","author":"Park","year":"2021","journal-title":"Front. Public Health"},{"key":"B43","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1007\/s11573-018-0915-7","article-title":"Topic modeling in marketing: Recent advances and research opportunities","volume":"89","author":"Reisenbichler","year":"2019","journal-title":"J. Busin. Econ."},{"key":"B44","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1145\/2684822.2685324","article-title":"\u201cExploring the Space of Topic Coherence Measures,\u201d","volume-title":"Proceedings of the Eighth ACM International Conference on Web Search and Data Mining","author":"R\u00f6der","year":"2015"},{"key":"B45","doi-asserted-by":"publisher","first-page":"1133484","DOI":"10.3389\/fpubh.2023.1133484","article-title":"The relationship between physician burnout and depression, anxiety, suicidality and substance abuse: a mixed methods systematic review","volume":"11","author":"Ryan","year":"2023","journal-title":"Front. Public Health"},{"key":"B46","doi-asserted-by":"publisher","first-page":"959","DOI":"10.1080\/14783363.2022.2139674","article-title":"Clustering abstracts from the literature on Quality Management (1980\u20132020)","volume":"34","author":"S\u00e1nchez-Franco","year":"2023","journal-title":"Total Qual. Management & Busin. Excel"},{"key":"B47","article-title":"Words alone: dismantling topic models in the humanities","author":"Schmidt","year":"2013","journal-title":"J. Digit. Humanit"},{"key":"B48","doi-asserted-by":"publisher","first-page":"100097","DOI":"10.1016\/j.jadr.2021.100097","article-title":"Fear of COVID-19, depression, anxiety, and their association with Internet addiction disorder in a sample of Italian students","volume":"4","author":"Servidio","year":"2021","journal-title":"J. Affect. Disord. Rep."},{"key":"B49","doi-asserted-by":"publisher","first-page":"1426","DOI":"10.1017\/S0033291719000151","article-title":"Machine learning in mental health: a scoping review of methods and applications","volume":"49","author":"Shatte","year":"2019","journal-title":"Psychol. Med."},{"key":"B50","doi-asserted-by":"publisher","first-page":"100553","DOI":"10.1016\/j.invent.2022.100553","article-title":"Internet-delivered cognitive behavioral interventions to reduce elevated stress: a systematic review and meta-analysis","volume":"29","author":"Sv\u00e4rdman","year":"2022","journal-title":"Intern. Intervent."},{"key":"B51","first-page":"384","article-title":"\u201cWord representations: A simple and general method for semi-supervised learning,\u201d","author":"Turian","year":"2010","journal-title":"Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10"},{"key":"B52","first-page":"1","article-title":"\u201cAn exploratory analysis of GSDMM and BERTopic on short text topic modeling,\u201d","author":"Udupa","year":"2022","journal-title":"2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP)"},{"key":"B53","author":"Vogt","year":"2023","journal-title":"Towards a Rosetta Stone for (meta)Data: Learning From Natural Language to Improve Semantic and Cognitive Interoperability"},{"key":"B54","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1016\/j.jad.2017.06.006","article-title":"Mapping the relationship between anxiety, anhedonia, and depression","volume":"221","author":"Winer","year":"2017","journal-title":"J. Affect. Disord."},{"key":"B55","unstructured":"Depression [Fact Sheet]2023"},{"key":"B56","doi-asserted-by":"publisher","first-page":"109442","DOI":"10.1016\/j.celrep.2021.109442","article-title":"Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data","volume":"36","author":"Yang","year":"2021","journal-title":"Cell Rep."},{"key":"B57","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1038\/s41746-022-00589-7","article-title":"Natural language processing applied to mental illness detection: a narrative review","volume":"5","author":"Zhang","year":"2022","journal-title":"NPJ Digital Med."}],"container-title":["Frontiers in Research Metrics and Analytics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2023.1271385\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,27]],"date-time":"2023-11-27T06:57:44Z","timestamp":1701068264000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/frma.2023.1271385\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,27]]},"references-count":57,"alternative-id":["10.3389\/frma.2023.1271385"],"URL":"https:\/\/doi.org\/10.3389\/frma.2023.1271385","relation":{},"ISSN":["2504-0537"],"issn-type":[{"value":"2504-0537","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,27]]},"article-number":"1271385"}}