{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,17]],"date-time":"2026-02-17T15:41:52Z","timestamp":1771342912465,"version":"3.50.1"},"reference-count":104,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2025,3,31]],"date-time":"2025-03-31T00:00:00Z","timestamp":1743379200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Slovenian Research and Innovation Agency and Ministry of Digital Transformation of Republic of Slovenia","award":["V5-2356"],"award-info":[{"award-number":["V5-2356"]}]},{"name":"Slovenian Research and Innovation Agency and Ministry of Digital Transformation of Republic of Slovenia","award":["P5-0018"],"award-info":[{"award-number":["P5-0018"]}]},{"name":"Slovenian Research and Innovation Agency","award":["V5-2356"],"award-info":[{"award-number":["V5-2356"]}]},{"name":"Slovenian Research and Innovation Agency","award":["P5-0018"],"award-info":[{"award-number":["P5-0018"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems"],"abstract":"<jats:p>The expectations for the (re)use of open government data (OGD) are high. However, measuring their impact remains challenging, as their effects are not solely economic but also long-term and spread across multiple domains. To accurately assess these impacts, we must first understand where they occur. This research presents a structured approach to developing a taxonomy for open government data (OGD) impact areas using machine learning-driven topic modeling and iterative taxonomy refinement. By analyzing a dataset of 697 OGD use cases, we employed various machine learning techniques\u2014including Latent Dirichlet Allocation (LDA), Non-Negative Matrix Factorization (NMF), and Hierarchical Dirichlet Process (HDP)\u2014to extract thematic categories and construct a structured taxonomy. The final taxonomy comprises seven high-level dimensions: Society, Health, Infrastructure, Education, Innovation, Governance, and Environment, each with specific subdomains and characteristics. Our findings reveal that OGD\u2019s impact extends beyond governance and transparency, influencing education, sustainability, and public services. Our approach provides a scalable and data-driven methodology for categorizing OGD impact areas compared to previous research that relies on predefined classifications or manual taxonomies. However, the study has limitations, including a relatively small dataset, brief use cases, and the inherent subjectivity of taxonomic classification, which requires further validation by domain experts. This research contributes to the systematic assessment of OGD initiatives and provides a foundational framework for policymakers and researchers aiming to maximize the benefits of open data.<\/jats:p>","DOI":"10.3390\/systems13040242","type":"journal-article","created":{"date-parts":[[2025,4,1]],"date-time":"2025-04-01T11:08:40Z","timestamp":1743505720000},"page":"242","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Open Government Data Topic Modeling and Taxonomy Development"],"prefix":"10.3390","volume":"13","author":[{"given":"Alja\u017e","family":"Ferencek","sequence":"first","affiliation":[{"name":"Faculty of Organizational Sciences, University of Maribor, 4000 Kranj, Slovenia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4608-9090","authenticated-orcid":false,"given":"Mirjana","family":"Kljaji\u0107 Bor\u0161tnar","sequence":"additional","affiliation":[{"name":"Faculty of Organizational Sciences, University of Maribor, 4000 Kranj, Slovenia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,3,31]]},"reference":[{"key":"ref_1","unstructured":"Organisation for Economic Co-operation and Development (2018). Open Government Data Report: Enhancing Policy Maturity for Sustainable Impact, OECD Publishing."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1016\/j.giq.2015.07.006","article-title":"A systematic review of open government data initiatives","volume":"32","author":"Attard","year":"2015","journal-title":"Gov. Inf. Q."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Attard, J., Orlandi, F., and Auer, S. (2016, January 5\u20138). Value Creation on Open Government Data. Proceedings of the 2016 49th Hawaii International Conference on System Sciences (HICSS), Koloa, HI, USA.","DOI":"10.1109\/HICSS.2016.326"},{"key":"ref_4","first-page":"1","article-title":"Utilization of open government data: A systematic literature review of types, conditions, effects and users","volume":"22","author":"Safarov","year":"2017","journal-title":"Inf. Polity"},{"key":"ref_5","unstructured":"Ubaldi, B. (2013). Open Government Data: Towards Empirical Analysis of Open Government Data Initiatives, OECD Publishing."},{"key":"ref_6","unstructured":"Yan, A., and Weber, N. (2018, January 25\u201328). Mining Open Government Data Used in Scientific Research. Proceedings of the 13th International Conference, iConference 2018, Transforming Digital Worlds, Lecture Notes in Computer Science, Sheffield, UK."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1016\/j.giq.2010.05.003","article-title":"Transparency and technological change: Ensuring equal and sustained public access to government information","volume":"27","author":"Jaeger","year":"2010","journal-title":"Gov. Inf. Q."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1002\/poi3.275","article-title":"Open Government Data: The OECD\u2019s Swiss army knife in the transformation of government","volume":"14","author":"Buttow","year":"2022","journal-title":"Policy Internet"},{"key":"ref_9","first-page":"300","article-title":"Moderating Effects of Governance on Open Government Data Quality and Open Government Data Utilization: Analysis Based on the Resource Complementarity Perspective","volume":"26","author":"Fan","year":"2023","journal-title":"J. Glob. Inf. Technol. Manag."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Nikiforova, A. (2021). Smarter Open Government Data for Society 5.0: Are Your Open Data Smart Enough?. Sensors, 21.","DOI":"10.3390\/s21155204"},{"key":"ref_11","unstructured":"Jiang, H., Duan, Y., and Zhu, Y. (2021, January 10\u201314). Citizens\u2019 Continuous-Use Intention to Open Government Data: Empirical Evidence from China. Proceedings of the 10th International Conference on Big Data, Big Data 2021, Virtual Event."},{"key":"ref_12","first-page":"233","article-title":"Researching the democratic impact of open government data: A systematic literature review","volume":"22","author":"Ruijer","year":"2017","journal-title":"Inf. Polity"},{"key":"ref_13","unstructured":"(2025, March 07). Open Data Institute. Available online: https:\/\/theodi.org\/."},{"key":"ref_14","unstructured":"European Commission (2025, March 07). Danish Basic Data Program. Available online: https:\/\/ec.europa.eu\/digital-building-blocks\/sites\/pages\/viewpage.action?pageId=533365971."},{"key":"ref_15","first-page":"722","article-title":"Open government data, innovation and diversification: The pursuit of economic value","volume":"18","author":"Farhadloo","year":"2024","journal-title":"Transform. Gov. People Process Policy"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Gonz\u00e1lez-Zapata, F., Rivera, A., Chauvet, L., Emilsson, C., Zahuranec, A., Young, A., and Verhulst, S. (2021). Open Data in Action: Initiatives during the Initial Stage of the COVID-19 Pandemic (Report), GovLab, Organisation for Economic Co-operation and Development (OECD).","DOI":"10.2139\/ssrn.3937613"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Roa, H.N., Loza-Aguirre, E., and Flores, P. (2019, January 24\u201326). A Survey on the Problems Affecting the Development of Open Government Data Initiatives. Proceedings of the Sixth International Conference on eDemocracy & eGovernment (ICEDEG), Quito, Ecuador.","DOI":"10.1109\/ICEDEG.2019.8734452"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/j.giq.2013.04.003","article-title":"Open data policies, their implementation and impact: A framework for comparison","volume":"31","author":"Zuiderwijk","year":"2014","journal-title":"Gov. Inf. Q."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1080\/09540962.2019.1611240","article-title":"The (im)possibilities of open data?","volume":"39","author":"Jamieson","year":"2019","journal-title":"Public Money Manag."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"102667","DOI":"10.1016\/j.pacfin.2025.102667","article-title":"The impact of government open data platform construction on corporate capital market performance: Evidence from stock liquidity","volume":"90","author":"Zhang","year":"2025","journal-title":"Pac. Basin Financ. J."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Tan, L., and Pei, J. (2023). Open government data and the urban\u2013rural income divide in China: An exploration of data inequalities and their consequences. Sustainability, 15.","DOI":"10.3390\/su15139867"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Peng, X., and Xiao, D. (2024). Can open government data improve city green land-use efficiency? Evidence from China. Land, 13.","DOI":"10.3390\/land13111891"},{"key":"ref_23","unstructured":"European Commission (2024). 2024 Open Data Maturity Report, Publications Office of the European Union."},{"key":"ref_24","unstructured":"European Commission (2020). European Data Portal Report, Capgemini Invent."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"101566","DOI":"10.1016\/j.giq.2021.101566","article-title":"Open government research over a decade: A systematic review","volume":"38","author":"Tai","year":"2021","journal-title":"Gov. Inf. Q."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Ali Hassan, M., and Twinomurinzi, H. (2018, January 3\u20135). A Systematic Literature Review of Open Government Data Research: Challenges, Opportunities and Gaps. Proceedings of the 2018 Open Innovations Conference (OI), Johannesburg, South Africa.","DOI":"10.1109\/OI.2018.8535794"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, H., and Zheng, L. (2022, January 4\u20137). Research on Open Government Data in China:: A Critical Assessment of 587 Papers. Proceedings of the 15th International Conference on Theory and Practice of Electronic Governance, Guimaraes, Portugal.","DOI":"10.1145\/3560107.3560185"},{"key":"ref_28","unstructured":"Young, A., and Verhulst, S. (2016). The Global Impact of Open Data: Key Findings from Detailed Case Studies Around the World, O\u2019Reilly Media."},{"key":"ref_29","unstructured":"Publications Office of the European Union (2025, February 15). Use Cases. Available online: https:\/\/data.europa.eu\/en\/publications\/use-cases."},{"key":"ref_30","first-page":"230","article-title":"Classification of Open Government Data Solutions\u2019 Help: A Novel Taxonomy and Cluster Analysis","volume":"14130","author":"Crusoe","year":"2023","journal-title":"Electron. Gov."},{"key":"ref_31","unstructured":"Zeleti, A.F. (2023). Analytical Frame for Open Data Impact Assessment\u2014An Exploratory Research. SSRN."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.datak.2013.01.005","article-title":"Improving classification models with taxonomy information","volume":"86","author":"Cagliero","year":"2013","journal-title":"Data Knowl. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1080\/10919392.2015.1124720","article-title":"A taxonomy of open government data research areas and topics","volume":"26","author":"Charalabidis","year":"2016","journal-title":"J. Organ. Comput. Electron. Commer."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1108\/OIR-02-2022-0117","article-title":"Towards a taxonomy of research areas in open government data","volume":"48","author":"Mohamad","year":"2024","journal-title":"Online Inf. Rev."},{"key":"ref_35","first-page":"377","article-title":"Why open government data initiatives fail to achieve their objectives: Categorizing and prioritizing barriers through a global survey","volume":"15","author":"Zuiderwijk","year":"2021","journal-title":"Transform. Gov. People Process Policy"},{"key":"ref_36","unstructured":"Hao-En, K. (2023, January 11\u201314). Between International Practice and Academia: Review and integration of Open Government Data Benchmarks. Proceedings of the 24th Annual International Conference on Digital Government Research, Gdansk, Poland."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"101634","DOI":"10.1016\/j.tele.2021.101634","article-title":"Comparing open data benchmarks: Which metrics and methodologies determine countries\u2019 positions in the ranking lists?","volume":"62","author":"Zuiderwijk","year":"2021","journal-title":"Telemat. Inform."},{"key":"ref_38","unstructured":"Open Knowledge Foundation (2024, July 08). Global Open Data Index. Available online: http:\/\/index.okfn.org\/."},{"key":"ref_39","unstructured":"(2024, July 08). Open Data Economy. Available online: https:\/\/www.opendataeconomy.org\/."},{"key":"ref_40","unstructured":"(2024, July 08). Open Data Inventory Network. Available online: https:\/\/odin.opendatawatch.com\/."},{"key":"ref_41","unstructured":"World Bank Group (2024, July 08). Readiness Assessment Tool. Available online: https:\/\/opendatatoolkit.worldbank.org\/en\/data\/opendatatoolkit\/odra."},{"key":"ref_42","unstructured":"World Wide Web Foundation (2024, July 09). Open Data Barometer. Available online: https:\/\/opendatabarometer.org\/."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1080\/10580530.2012.716740","article-title":"Benefits, Adoption Barriers and Myths of Open Data and Open Government","volume":"29","author":"Janssen","year":"2012","journal-title":"Inf. Syst. Manag."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Stuermer, M., and Dapp, M. (2016, January 18\u201320). Measuring the Promise of Open Data: Development of the Impact Monitoring Framework. Proceedings of the 2016 Conference for E-Democracy and Open Government (CeDEM), Krems, Austria.","DOI":"10.1109\/CeDEM.2016.31"},{"key":"ref_45","first-page":"702","article-title":"The Sustainable Value of Open Government Data","volume":"20","author":"Jetzek","year":"2019","journal-title":"J. Assoc. Inf. Syst."},{"key":"ref_46","unstructured":"Jetzek, T., Avital, M., and Bj\u00f8rn-Andersen, N. (2013, January 15\u201318). Generating Value from Open Government Data. Proceedings of the International Conference on Information Systems, ICIS 2013, Milano, Italy."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"100","DOI":"10.4067\/S0718-18762014000200008","article-title":"Data-Driven Innovation through Open Government Data","volume":"9","author":"Jetzek","year":"2014","journal-title":"J. Theor. Appl. Electron. Commer. Res."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1080\/17538947.2016.1224938","article-title":"How to assess the success of the open data ecosystem?","volume":"10","author":"Donker","year":"2016","journal-title":"Int. J. Digit. Earth"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Kao, H. (2024, January 11\u201314). Societal, Economic, Political and Environmental: A Review of Benchmarks and AI-assisted Systematic Literature Review of Impact of Open Government Data. Proceedings of the 25th Annual International Conference on Digital Government Research, dg.o \u201924, Taipei, Taiwan.","DOI":"10.1145\/3657054.3657121"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Dawes, S., Vidiasova, L., and Trutnev, D. (2015, January 24\u201325). Approaches to assessing open government data programs: Comparison of common traits and differences at global context. Proceedings of the 2015 2nd International Conference on Electronic Governance and Open Society: Challenges in Eurasia, EGOSE \u201915, St. Petersburg, Russia.","DOI":"10.1145\/2846012.2846031"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Zheng, L., Kwok, W., Aquaro, V., Qi, X., and Lyu, W. (2020, January 23\u201325). Evaluating global open government data: Methods and status. Proceedings of the 13th International Conference on Theory and Practice of Electronic Governance, ICEGOV \u201920, Athens, Greece.","DOI":"10.1145\/3428502.3428553"},{"key":"ref_52","unstructured":"Kawashita, I.M., Baptista, A.A., and Soares, D.S. (September, January 31). An Assessment of Open Government Data Benchmark Instruments. Proceedings of the IFIP WG 8.5 International Conference EGOV-CeDEM-ePart, Linkoping, Sweden."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Osorio-Sanabria, M., Brito-Carvajal, J., Astudillo, H., Amaya-Fern\u00e0ndez, F., and Gonzalez-Zabala, M. (2020, January 22\u201324). Evaluating Open Government Data Programs: A Systematic Mapping Study. Proceedings of the 2020 Seventh International Conference on eDemocracy & eGovernment, ICEDEG, Buenos Aires, Argentina.","DOI":"10.1109\/ICEDEG48599.2020.9096755"},{"key":"ref_54","unstructured":"Ferencek, A. (2021, January 27\u201330). Impact assessment of open government data. Proceedings of the 34th Bled eConference Digital Support from Crisis to Progressive Change, Bled, Slovenia."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"664","DOI":"10.1016\/j.ijinfomgt.2017.05.009","article-title":"Liberation of public data: Exploring central themes in open government data and freedom of information research","volume":"37","year":"2017","journal-title":"Int. J. Inf. Manag."},{"key":"ref_56","first-page":"29","article-title":"Evaluating the Impact of Open Data Using Partial Least Squares Structural Equation Modeling","volume":"22","author":"Bilkova","year":"2015","journal-title":"Sci. Pap. Univ. Pardubic. Ser. D"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"62","DOI":"10.29379\/jedem.v8i1.412","article-title":"Modelling E-Government Development through the Years Using Cluster Analysis","volume":"8","author":"Machova","year":"2016","journal-title":"JeDEM"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Yerden, X., and Luna-Reyes, L. (2021, January 9\u201311). Promoting Government Impacts through Open Data: Key Influential Factors. Proceedings of the 22nd Annual International Conference on Digital Government Research, dg.o \u201921, Omaha, NE, USA.","DOI":"10.1145\/3463677.3463711"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Buteau, S., Rao, P., Mehta, A., and Kadirvell, V. (2018, January 22\u201324). Developing a Framework to Assess Socio-Economic Value of Open Data in India. Proceedings of the 14th International Symposium on Open Collaboration, OpenSym \u201918, Paris, France.","DOI":"10.1145\/3233391.3233532"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"498","DOI":"10.3390\/fi6030498","article-title":"Assessing Social Value in Open Data Initiatives: A Framework","volume":"6","author":"Viscusi","year":"2014","journal-title":"Future Internet"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"dos Santos Brito, K., da Silva Costa, M., Cardoso Garcia, V., and de Lemos Meira, S. (2015, January 27\u201330). Assessing the benefits of open government data: The case of Meu Congresso Nacional in Brazilian elections 2014. Proceedings of the 16th Annual International Conference on Digital Government Research, dg.o \u201915, Phoenix, AZ, USA.","DOI":"10.1145\/2757401.2757422"},{"key":"ref_62","unstructured":"Tinati, R., Carr, L., Halford, S., and Pope, C. (2012, January 23\u201325). Exploring the impact of adopting open data in the UK government. Proceedings of the Digital Futures 2012, Aberdeen, UK."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"1","DOI":"10.29379\/jedem.v6i1.288","article-title":"Investigating the Roots of Open Data\u2019s Social Impact","volume":"6","author":"Meng","year":"2014","journal-title":"JeDEM"},{"key":"ref_64","first-page":"225","article-title":"DGABr: Metric for Evaluating Brazilian Open Government Data","volume":"28","author":"Silva","year":"2018","journal-title":"Inf. Soc."},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Vracic, T., Varga, M., and Curko, K. (June, January 30). Effects and evaluation of open government data initiative in Croatia. Proceedings of the 2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO, Opatija, Croatia.","DOI":"10.1109\/MIPRO.2016.7522380"},{"key":"ref_66","unstructured":"Silva, P.N., and Pinheiro, M.M.K. (2018, January 22\u201326). Metricas para Dados Governamentais Abertos. Proceedings of the Encontro Nacional De Pesquisa Em Ci\u00eancia Da Informa\u00e7\u00e3o, ENANCIB, Sao Paulo, Brasil."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"101526","DOI":"10.1016\/j.tele.2020.101526","article-title":"Beyond the supply side: Use and impact of municipal open data in the U.S","volume":"58","author":"Wilson","year":"2021","journal-title":"Telemat. Inform."},{"key":"ref_68","first-page":"2493","article-title":"Natural Language Processing (Almost) from Scratch","volume":"12","author":"Collobert","year":"2011","journal-title":"J. Mach. Learn."},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"75","DOI":"10.2307\/25148625","article-title":"Design Science in Information Systems Research","volume":"28","author":"Hevner","year":"2004","journal-title":"MIS Q."},{"key":"ref_70","unstructured":"Azevedo, A., and Santos, M. (2008, January 24\u201326). KDD, SEMMA and CRISP-DM: A parallel overview. Proceedings of the IADIS European Conference on Data Mining, Amsterdam, The Netherlands."},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"336","DOI":"10.1057\/ejis.2012.26","article-title":"A method for taxonomy development and its application in information systems","volume":"22","author":"Nickerson","year":"2013","journal-title":"Eur. J. Inf. Syst."},{"key":"ref_72","unstructured":"Miner, G.D., Elder, J.F., and Nisbet, R. (2012). Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications, Academic Press. [1st ed.]."},{"key":"ref_73","unstructured":"(2024, September 24). Textract. Available online: https:\/\/textract.readthedocs.io\/en\/stable\/."},{"key":"ref_74","unstructured":"Bird, S., Loper, E., and Klein, E. (2009). Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit, O\u2019Reilly Media. [1st ed.]."},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Barde, B.V., and Bainwad, A.M. (2017, January 15\u201316). An overview of topic modeling methods and tools. Proceedings of the 2017 International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India.","DOI":"10.1109\/ICCONS.2017.8250563"},{"key":"ref_76","unstructured":"OpenAI (2024, March 05). ChatGPT. Available online: https:\/\/openai.com\/chatgpt."},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1108\/LHTN-01-2023-0009","article-title":"Chatting about ChatGPT: How may AI and GPT impact academia and libraries?","volume":"40","author":"Lund","year":"2023","journal-title":"Libr. Hi Tech News"},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Roumeliotis, K.I., and Tselikas, N.D. (2023). ChatGPT and Open-AI Models: A Preliminary Review. Future Internet, 15.","DOI":"10.3390\/fi15060192"},{"key":"ref_79","unstructured":"Vaswani, A., Shazeer, N.M., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017, January 4\u20139). Attention is All you Need. Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS\u201917), Long Beach, CA, USA."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Ambartsoumian, A., and Popowich, F. (2018, January 31). Self-attention: A better building block for sentiment analysis neural network classifiers. Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Brussels, Belgium.","DOI":"10.18653\/v1\/W18-6219"},{"key":"ref_81","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2015, January 7\u20139). Neural Machine Translation by Jointly Learning to Align and Translate. Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015), San Diego, CA, USA."},{"key":"ref_82","unstructured":"OpenAI (2024, March 05). API Reference Introduction. Available online: https:\/\/platform.openai.com\/docs\/api-reference\/introduction."},{"key":"ref_83","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1038\/44565","article-title":"Learning the parts of objects by non-negative matrix factorization","volume":"401","author":"Lee","year":"1999","journal-title":"Nature"},{"key":"ref_84","unstructured":"Carbonetto, P., Sarkar, A.K., Wang, Z., and Stephens, M. (2021). Non-negative matrix factorization algorithms greatly improve topic model fits. arXiv."},{"key":"ref_85","unstructured":"Purpura, A. (2018, January 28\u201331). Non-negative Matrix Factorization for Topic Modeling. Proceedings of the Biennial Conference on Design of Experimental Search & Information Retrieval Systems, Bertinoro, Italy."},{"key":"ref_86","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Ostrowski, D. (2015, January 7\u20139). Using latent dirichlet allocation for topic modelling in twitter. Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing, Anaheim, CA, USA.","DOI":"10.1109\/ICOSC.2015.7050858"},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"3367","DOI":"10.1166\/jctn.2019.8234","article-title":"A Hybrid Model for Topic Modeling Using Latent Dirichlet Allocation and Feature Selection Method","volume":"16","author":"Christy","year":"2019","journal-title":"J. Comput. Theor. Nanosci."},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Muchene, L., and Safari, W. (2021). Two-stage topic modelling of scientific publications: A case study of University of Nairobi, Kenya. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0243208"},{"key":"ref_90","doi-asserted-by":"crossref","first-page":"1566","DOI":"10.1198\/016214506000000302","article-title":"Hierarchical Dirichlet processes","volume":"101","author":"Teh","year":"2006","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_91","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1007\/978-3-642-45185-0_39","article-title":"Incorporating Hierarchical Dirichlet Process into Tag Topic Model","volume":"8229","author":"Zhang","year":"2013","journal-title":"Chin. Lex. Semant."},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1109\/TPAMI.2014.2315802","article-title":"The Supervised Hierarchical Dirichlet Process","volume":"37","author":"Dai","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Fan, W., and Bouguila, N. (2014, January 21\u201324). Online Data Clustering Using Variational Learning of a Hierarchical Dirichlet Process Mixture of Dirichlet Distributions. Proceedings of the 19th International Conference, DASFAA 2014, International Workshops: BDMA, DaMEN, SIM3, UnCrowd, Bali, Indonesia.","DOI":"10.1007\/978-3-662-43984-5_2"},{"key":"ref_94","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9","article-title":"Indexing by latent semantic analysis","volume":"41","author":"Deerwester","year":"1990","journal-title":"J. Am. Soc. Inf. Sci."},{"key":"ref_95","doi-asserted-by":"crossref","first-page":"1665","DOI":"10.1111\/ssqu.12528","article-title":"Topic Modeling: Latent Semantic Analysis for the Social Sciences","volume":"99","author":"Valdez","year":"2018","journal-title":"Soc. Sci. Q."},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Gupta, I., Chatterjee, I., and Gupta, N. (2022, January 23\u201325). Latent Semantic Analysis based Real-world Application of Topic Modeling: A Review Study. Proceedings of the 2022 Second International Conference on Artificial Intelligence and Smart Energy (ICAIS), Coimbatore, India.","DOI":"10.1109\/ICAIS53314.2022.9742848"},{"key":"ref_97","first-page":"147","article-title":"A Survey of Topic Modeling in Text Mining","volume":"6","author":"Alghamdi","year":"2015","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_98","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1109\/52.391837","article-title":"Contemporary application-domain taxonomies","volume":"12","author":"Glass","year":"1995","journal-title":"IEEE Softw."},{"key":"ref_99","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1287\/mnsc.40.3.285","article-title":"A taxonomy of manufacturing strategies","volume":"40","author":"Miller","year":"1994","journal-title":"Manag. Sci."},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Bailey, K. (1994). Typologies and Taxonomies\u2014An Introduction to Classification Techniques, Sage.","DOI":"10.4135\/9781412986397"},{"key":"ref_101","unstructured":"Devlin, J., Chang, M., Lee, K., and Toutanova, K. (2019, January 2\u20137). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA."},{"key":"ref_102","doi-asserted-by":"crossref","first-page":"012120","DOI":"10.1088\/1742-6596\/978\/1\/012120","article-title":"The implementation of cosine similarity to calculate text relevance between two documents","volume":"978","author":"Gunawan","year":"2018","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Li, B., and Han, L. (2013, January 20\u201323). Distance weighted cosine similarity measure for text classification. Proceedings of the Intelligent Data Engineering and Automated Learning\u2013IDEAL 2013: 14th International Conference, Hefei, China.","DOI":"10.1007\/978-3-642-41278-3_74"},{"key":"ref_104","doi-asserted-by":"crossref","unstructured":"Muflikhah, L., and Baharudin, B. (2009, January 13\u201315). Document clustering using concept space and cosine similarity measurement. Proceedings of the 2009 International Conference on Computer Technology and Development, Kota Kinabalu, Malaysia.","DOI":"10.1109\/ICCTD.2009.206"}],"container-title":["Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-8954\/13\/4\/242\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:07:02Z","timestamp":1760029622000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-8954\/13\/4\/242"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,31]]},"references-count":104,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,4]]}},"alternative-id":["systems13040242"],"URL":"https:\/\/doi.org\/10.3390\/systems13040242","relation":{},"ISSN":["2079-8954"],"issn-type":[{"value":"2079-8954","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,31]]}}}