{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,25]],"date-time":"2026-04-25T14:44:28Z","timestamp":1777128268891,"version":"3.51.4"},"reference-count":53,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2023,10,20]],"date-time":"2023-10-20T00:00:00Z","timestamp":1697760000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>This literature review explores the existing work and practices in applying thematic analysis natural language processing techniques to financial data in cloud environments. This work aims to improve two of the five Vs of the big data system. We used the PRISMA approach (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) for the review. We analyzed the research papers published over the last 10 years about the topic in question using a keyword-based search and bibliometric analysis. The systematic literature review was conducted in multiple phases, and filters were applied to exclude papers based on the title and abstract initially, then based on the methodology\/conclusion, and, finally, after reading the full text. The remaining papers were then considered and are discussed here. We found that automated data discovery methods can be augmented by applying an NLP-based thematic analysis on the financial data in cloud environments. This can help identify the correct classification\/categorization and measure data quality for a sentiment analysis.<\/jats:p>","DOI":"10.3390\/info14100577","type":"journal-article","created":{"date-parts":[[2023,10,20]],"date-time":"2023-10-20T11:53:56Z","timestamp":1697802836000},"page":"577","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Thematic Analysis of Big Data in Financial Institutions Using NLP Techniques with a Cloud Computing Perspective: A Systematic Literature Review"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9139-2120","authenticated-orcid":false,"given":"Ratnesh Kumar","family":"Sharma","sequence":"first","affiliation":[{"name":"School of Computer Science, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney 2007, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8384-9509","authenticated-orcid":false,"given":"Gnana","family":"Bharathy","sequence":"additional","affiliation":[{"name":"School of Computer Science, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney 2007, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Faezeh","family":"Karimi","sequence":"additional","affiliation":[{"name":"School of Computer Science, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney 2007, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anil V.","family":"Mishra","sequence":"additional","affiliation":[{"name":"School of Business, Western Sydney University, Sydney 2150, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7745-9667","authenticated-orcid":false,"given":"Mukesh","family":"Prasad","sequence":"additional","affiliation":[{"name":"School of Computer Science, Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney 2007, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/j.dss.2016.10.006","article-title":"Sentiment analysis in financial texts","volume":"94","author":"Chan","year":"2017","journal-title":"Decis. Support. Syst."},{"key":"ref_2","unstructured":"Lima, L., Portela, F., Santos, M.F., Abelha, A., and Machado, J. (2015). Advances in Intelligent Systems and Computing, Springer."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1186\/s40537-019-0221-4","article-title":"The anatomy of the data-driven smart sustainable city: Instrumentation, datafication, computerization and related applications","volume":"6","author":"Bibri","year":"2019","journal-title":"J. Big Data"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1108\/IJAIM-12-2018-0154","article-title":"Conceptualizing big data practices","volume":"28","author":"Lin","year":"2020","journal-title":"Int. J. Account. Inf. Manag."},{"key":"ref_5","first-page":"94","article-title":"Customer relationship management and big data enabled: Personalization & customization of services","volume":"15","author":"Anshari","year":"2019","journal-title":"Appl. Comput. Inf."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.procs.2019.12.147","article-title":"Capabilities and Readiness for Big Data Analytics","volume":"164","author":"Pedro","year":"2019","journal-title":"Proc. Comput. Sci."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bibri, S.E. (2018). Smart Sustainable Cities of the Future, Elsevier.","DOI":"10.1007\/978-3-319-73981-6"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1016\/j.scs.2017.04.012","article-title":"ICT of the new wave of computing for sustainable urban forms: Their big data and context-aware augmented typologies and design concepts","volume":"32","author":"Bibri","year":"2017","journal-title":"Sustain. Cities Soc."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1109\/TKDE.2017.2763144","article-title":"Web Media and Stock Markets: A Survey and Future Directions from a Big Data Perspective","volume":"30","author":"Li","year":"2018","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1186\/s40537-019-0206-3","article-title":"Uncertainty in big data analytics: Survey, opportunities, and challenges","volume":"6","author":"Hariri","year":"2019","journal-title":"J. Big Data"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"O\u2019Halloran, S., Maskey, S., McAllister, G., Park, D.K., and Chen, K. (2015, January 25\u201328). Big data and the regulation of financial markets. Proceedings of the 2015 IEEE\/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Paris, France.","DOI":"10.1145\/2808797.2808841"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.procs.2015.10.043","article-title":"Sentiment Analysis for Indian Stock Market Prediction Using Sensex and Nifty","volume":"70","author":"Bhardwaj","year":"2015","journal-title":"Proc. Comput. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1109\/MITP.2018.021921649","article-title":"The Use of Big Data Analytics to Predict the Foreign Exchange Rate Based on Public Media: A Machine-Learning Experiment","volume":"20","author":"Tsaih","year":"2018","journal-title":"IT Prof."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1007\/s40171-019-00228-3","article-title":"Configuration of Data Monetization: A Review of Literature with Thematic Analysis","volume":"21","author":"Hanafizadeh","year":"2020","journal-title":"Glob. J. Flex. Syst. Manag."},{"key":"ref_15","first-page":"2247","article-title":"Predicting Carpark Prices Indices in Hong Kong Using AutoML. CMES Comput","volume":"134","author":"Li","year":"2023","journal-title":"Model. Eng. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1016\/j.tre.2017.04.001","article-title":"Understanding big data analytics capabilities in supply chain management: Unravelling the issues, challenges and implications for practice","volume":"114","author":"Arunachalam","year":"2018","journal-title":"Transp. Res. Part. E Logist. Transp. Rev."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1080\/17439760.2016.1262613","article-title":"Thematic analysis","volume":"12","author":"Clarke","year":"2017","journal-title":"J. Posit. Psychol."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1191\/1478088706qp063oa","article-title":"Using thematic analysis in psychology","volume":"3","author":"Braun","year":"2006","journal-title":"Qual. Res. Psychol."},{"key":"ref_19","unstructured":"Boyatzis, R. (1998). Transforming Qualitative Information: Thematic Analysis and Code Development, Sage."},{"key":"ref_20","unstructured":"Braun, V., and Clarke, V. (2013). Successful Qualitative Research: A Practical Guide for Beginners. Successful Qualitative Research: A Practical Guide for Beginners, Sage."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1016\/j.healthpol.2021.01.001","article-title":"#COVID-19: An exploratory investigation of hashtag usage on Twitter","volume":"125","author":"Petersen","year":"2021","journal-title":"Health Policy"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"e190","DOI":"10.1093\/geronb\/gbaa128","article-title":"Modern Senicide in the Face of a Pandemic: An Examination of Public Discourse and Sentiment About Older Adults and COVID-19 Using Machine Learning","volume":"76","author":"Xiang","year":"2021","journal-title":"J. Gerontol. B Psychol. Sci. Soc. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"951","DOI":"10.1111\/epi.16507","article-title":"Digital conversations about suicide among teenagers and adults with epilepsy: A big-data, machine learning analysis","volume":"61","author":"Falcone","year":"2020","journal-title":"Epilepsia"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Mondal, B. (2019). Book Artificial Intelligence: State of the Art, Springer.","DOI":"10.1007\/978-3-030-32644-9_32"},{"key":"ref_25","unstructured":"Van Banerveld, M., Le-Khac, N.A., and Kechadi, M.T. (2014). Future Data and Security Engineering, Springer."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"e030355","DOI":"10.1136\/bmjopen-2019-030355","article-title":"Studying expressions of loneliness in individuals using twitter: An observational study","volume":"9","author":"Guntuku","year":"2019","journal-title":"BMJ Open"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"e23957","DOI":"10.2196\/23957","article-title":"Public opinions and concerns regarding the Canadian prime minister\u2019s daily COVID-19 briefing: Longitudinal study of youtube comments using machine learning techniques","volume":"23","author":"Zheng","year":"2021","journal-title":"J. Med. Internet"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1080\/15228835.2019.1616350","article-title":"A computational social science perspective on qualitative data exploration: Using topic models for the descriptive analysis of social media data","volume":"38","author":"Rodriguez","year":"2020","journal-title":"J. Technol. Hum. Serv."},{"key":"ref_29","unstructured":"P\u00e9rez, V., Caro, R., and Rua Vieites, A. Unraveling the Complexities of Climate Change and Environment Migration: A Transformers-Based Topic Modelling Approach; 2023, preprint version."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1177\/15586898211021196","article-title":"Accelerating Mixed Methods Research With Natural Language Processing of Big Text Data","volume":"15","author":"Chang","year":"2021","journal-title":"J. Mix. Methods Res."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1766","DOI":"10.3758\/s13428-019-01202-8","article-title":"Analyzing social media data: A mixed-methods framework combining computational and qualitative text analysis","volume":"51","author":"Andreotta","year":"2019","journal-title":"Behav. Res. Method."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"120180","DOI":"10.1016\/j.techfore.2020.120180","article-title":"Reshaping competitive advantages with analytics capabilities in service systems","volume":"159","author":"Akter","year":"2020","journal-title":"Technol. Forecast. Soc. Chang."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"230","DOI":"10.1016\/j.scs.2017.12.034","article-title":"The IoT for smart sustainable cities of the future: An analytical framework for sensor-based big data applications for environmental sustainability","volume":"38","author":"Bibri","year":"2018","journal-title":"Sustain. Cities Soc."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.ijinfomgt.2019.01.020","article-title":"Analytics-based decision-making for service systems: A qualitative study and agenda for future research","volume":"48","author":"Akter","year":"2019","journal-title":"Int. J. Inf. Manag."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Che, S., Zhu, W., and Li, X. (2020). Anticipating Corporate Financial Performance from CEO Letters Utilizing Sentiment Analysis. Math. Probl. Eng., 4.","DOI":"10.1155\/2020\/5609272"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Mbah, R.B.K., Rege, M., and Misra, B. (2019, January 14\u201317). Using spark and scala for discovering latent trends in job markets. Proceedings of the ICCDA 2019: Proceedings of the 2019 3rd International Conference on Compute and Data Analysis, New York, NY, USA.","DOI":"10.1145\/3314545.3314566"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Gu, Y., Storey, V.C., and Woo, C.C. (2015). Conceptual Modeling for Financial Investment with Text Mining, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-25264-3_39"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1015","DOI":"10.1007\/s10270-012-0290-8","article-title":"Strategic business modeling: Representation and reasoning","volume":"13","author":"Horkoff","year":"2014","journal-title":"Softw. Syst. Model."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"103965","DOI":"10.1016\/j.compedu.2020.103965","article-title":"Improving the quality of teaching by utilising written student feedback: A streamlined process","volume":"157","author":"Hujala","year":"2020","journal-title":"Comput. Educ."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"i130","DOI":"10.1093\/llc\/fqv052","article-title":"Exploratory thematic analysis for digitized archival collections","volume":"30","author":"Klein","year":"2015","journal-title":"Digit. Scholarsh. Humanit."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"e10262","DOI":"10.2196\/10262","article-title":"How twitter can support the HIV\/AIDS response to achieve the 2030 eradication goal: In-depth thematic analysis of world AIDS day tweets","volume":"4","author":"Odlum","year":"2018","journal-title":"JMIR Public Health Surv."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Tang, S., Liu, Q., and Tan, W.A. (2019). Intention Classification based on Transfer Learning: A Case Study on Insurance Data, Springer.","DOI":"10.1007\/978-3-030-37429-7_36"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1016\/j.procs.2019.01.212","article-title":"A novel stock evaluation index based on public opinion analysis","volume":"147","author":"Ni","year":"2019","journal-title":"Proc. Comput. Sci."},{"key":"ref_44","first-page":"692","article-title":"Modeling public mood and emotion: Blog and news sentiment and socio-economic phenomena. Future Gen","volume":"96","author":"Chen","year":"2019","journal-title":"Comput. Syst."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Esichaikul, V., and Phumdontree, C. (2018, January 18\u201320). Sentiment analysis of Thai financial news. Proceedings of the ICSEB\u201918: Proceedings of the 2018 2nd International Conference on Software and e-Business, New York, NY, USA.","DOI":"10.1145\/3301761.3301773"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.eswa.2017.11.010","article-title":"Mining social lending motivations for loan project recommendations","volume":"111","author":"Yan","year":"2018","journal-title":"Expert. Syst. Appl."},{"key":"ref_47","unstructured":"Konstantinidis, A., Scalzodees, B., Calvi, G.G., and Mandic, D.P. (2018). Text Mining\u2014A Key Lynchpin in the Investment Process: A Survey, IOS Press. Series Frontiers in Artificial Intelligence and Applications, Applications of Intelligent Systems."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"e37350","DOI":"10.2196\/37350","article-title":"Integrating Natural Language Processing and Interpretive Thematic Analyses to Gain Human-Centered Design Insights on HIV Mobile Health: Proof-of-Concept Analysis","volume":"9","author":"Skeen","year":"2022","journal-title":"JMIR Hum. Factors"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Sallam, M. (2023). ChatGPT Utility in Health Care Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare, 11.","DOI":"10.3390\/healthcare11060887"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Watkins, R. (2023). Guidance for researchers and peer-reviewers on the ethical use of Large Language Models (LLMs) in scientific research workflows. AI Ethics, 6\u20137.","DOI":"10.1007\/s43681-023-00294-5"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Sallam, M. (2023). The Utility of ChatGPT as an Example of Large Language Models in Healthcare Education, Research and Practice: Systematic Review on the Future Perspectives and Potential Limitations. medRxiv.","DOI":"10.1101\/2023.02.19.23286155"},{"key":"ref_52","first-page":"7596094","article-title":"Financial Big Data Management and Control and Artificial Intelligence Analysis Method Based on Data Mining Technology","volume":"2022","author":"Yang","year":"2022","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Suciu, G., Suciu, V., Halunga, S., and Fratu, O. (2015). Book Big Data, Internet of Things and Cloud Convergence for E-Health Applications, Springer.","DOI":"10.1007\/978-3-319-16486-1_15"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/10\/577\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:09:12Z","timestamp":1760130552000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/10\/577"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,20]]},"references-count":53,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["info14100577"],"URL":"https:\/\/doi.org\/10.3390\/info14100577","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,20]]}}}