{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T17:34:19Z","timestamp":1774719259703,"version":"3.50.1"},"reference-count":40,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2022,9,27]],"date-time":"2022-09-27T00:00:00Z","timestamp":1664236800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>According to the natural language perspective, UGC has been significantly used for the screening of key nodes in knowledge discovery and strategic investment. This article presents a new research framework that is proposed for the decomposition of UGC knowledge feature extraction into topic recognition and language analysis, mainly. For visual analysis of associated topics, the LDAvis approach is utilized. Then, risk features of UGC knowledge are assigned according to language attribution. Based on previous studies, the risk attribute lexicon is further updated by judging semantic distance through word vectors. This research uses platform data and individual stock data as samples for subject recognition and knowledge feature extraction. A regression model is constructed based on the panel data after natural language processing to verify the feedback effect of the market at strategic risk measurement. It can be found from the conclusion that the change in market behavior is regular and correlates with the change in the UGC risk degree of individual stocks. The purpose of this paper is to examine the value of UGC in investment decision-making from the perspective of knowledge discovery. The research content can provide a reference for data mining, fintech, strategic risk monitoring, and other related works.<\/jats:p>","DOI":"10.3390\/info13100454","type":"journal-article","created":{"date-parts":[[2022,9,28]],"date-time":"2022-09-28T01:51:49Z","timestamp":1664329909000},"page":"454","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["UGC Knowledge Features and Their Influences on the Stock Market: An Empirical Study Based on Topic Modeling"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4581-0002","authenticated-orcid":false,"given":"Ning","family":"Li","sequence":"first","affiliation":[{"name":"School of Communication, Fujian Normal University, Fuzhou 350007, China"}]},{"given":"Kefu","family":"Chen","sequence":"additional","affiliation":[{"name":"Management School, Xiamen University, Xiamen 361000, China"}]},{"given":"Huixin","family":"He","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering School, Huaqiao University, Xiamen 361021, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,9,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1007\/s00530-008-0138-9","article-title":"Production and multi-channel distribution of news","volume":"14","author":"Mannens","year":"2008","journal-title":"Multimed. Syst."},{"key":"ref_2","first-page":"53","article-title":"Tracing digital news networks towards an integrated framework of the dynamics of news production, circulation and use","volume":"3","author":"Domingo","year":"2015","journal-title":"Digit. J."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1108\/OIR-06-2020-0258","article-title":"The \u201cso-called\u201d UGC: An updated definition of user-generated content in the age of social media","volume":"46","year":"2022","journal-title":"Online Inf. Rev."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"35","DOI":"10.4018\/IJSWIS.2019070103","article-title":"Social Presence and User-Generated Content of Social Media in China","volume":"15","author":"Sun","year":"2019","journal-title":"Int. J. Semant. Web Inf. Syst."},{"key":"ref_5","first-page":"101","article-title":"The Impact of Graphic and Text Matching on Consumer Perceived Usefulness of User Generated Content","volume":"31","author":"Wang","year":"2018","journal-title":"Manag. Sci."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072591","article-title":"Learning to Align Comments to News Topics","volume":"36","author":"Hou","year":"2017","journal-title":"ACM Trans. Inf. Syst."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.is.2018.02.011","article-title":"Investment recommendation by discovering high-quality opinions in investor based social networks","volume":"78","author":"Tu","year":"2018","journal-title":"Inf. Syst."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.chaos.2018.12.007","article-title":"Investor behavior, information disclosure strategy and counterparty credit risk contagion","volume":"119","author":"Wang","year":"2019","journal-title":"Chaos Solitons Fractals"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"18569","DOI":"10.1007\/s11042-016-4159-7","article-title":"Stock prediction using deep learning","volume":"76","author":"Singh","year":"2017","journal-title":"Multimed. Tools Appl."},{"key":"ref_10","first-page":"993","article-title":"Latent Dirichlet Allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"113073","DOI":"10.1016\/j.eswa.2019.113073","article-title":"Drawing openness to experience from user generated contents: The An interpretable data\u2014Driven topic modeling approach","volume":"144","author":"Zhang","year":"2020","journal-title":"J. Expert Syst. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"103070","DOI":"10.1016\/j.im.2018.05.003","article-title":"Business analytics for strategic management: Identifying and assessing corporate challenges via topic modeling","volume":"57","author":"Prollochs","year":"2020","journal-title":"Inf. Manag."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1509\/jm.16.0044","article-title":"Harvesting brand information from social tags","volume":"81","author":"Nam","year":"2017","journal-title":"J. Mark."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/15252019.2008.10722137","article-title":"Note from special issue editors","volume":"8","author":"Krishnamurthy","year":"2008","journal-title":"J. Interact. Advert."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Hofmann, T. (1999, January 15\u201319). Probabilistic Latent Semantic Indexing. Proceedings of the Sigir\u201999: Proceedings of 22nd International Conference on Research and Development in Information Retrieval, Berkeley, CA, USA.","DOI":"10.1145\/312624.312649"},{"key":"ref_16","first-page":"112","article-title":"Analysis of Topic Extraction Effect of Scientific Literature Based on LDA Topic Model in Different Corpus","volume":"60","author":"Peng","year":"2016","journal-title":"Libr. Inf. Serv."},{"key":"ref_17","first-page":"73","article-title":"Research on Core Technology Theme Recognition Method Based on Chunk-LDAVIS","volume":"63","author":"Liu","year":"2019","journal-title":"Libr. Inf. Sci."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"6386","DOI":"10.1109\/ACCESS.2018.2887314","article-title":"Mining dynamics of research topics based on the combined LDA and Wordnet","volume":"7","author":"Li","year":"2019","journal-title":"IEEE Access"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1007\/s10844-019-00586-5","article-title":"Topic-sentiment evolution over time: A manifold learning-based model for online news","volume":"55","author":"Xu","year":"2020","journal-title":"J. Intell. Inf. Syst."},{"key":"ref_20","first-page":"73","article-title":"Learning author-topic models from text corpora","volume":"28","author":"Chemudugunta","year":"2010","journal-title":"ACM Trans. Inf. Syst."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"718","DOI":"10.1109\/TNNLS.2016.2626379","article-title":"Identifying objective and subjective words via topic modeling","volume":"29","author":"Wang","year":"2018","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_22","first-page":"12","article-title":"From Data Mining to Knowledge Discovery: On Overview","volume":"1","author":"Fayyad","year":"1996","journal-title":"Adv. Knowl. Discov. Data Min."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1016\/j.jbusres.2007.01.026","article-title":"Knowledge transfer between and within alliance partners: Private versus collective benefits of social capital","volume":"60","author":"Walter","year":"2007","journal-title":"J. Bus. Res."},{"key":"ref_24","unstructured":"Rennolls, K., and Society, I.C. (2005, January 22\u201326). An intelligent framework (O-SS-E) for data mining, knowledge discovery and business intelligence. Proceedings of the 16th International Workshop on Database and Expert Systems Applications (DEXA\u201905), Copenhagen, Denmark."},{"key":"ref_25","unstructured":"Cazzella, S., and Dragone, L. (2004, January 18\u201321). The Role of Domain Knowledge in KDD-Based Strategic Marketing Applications. Proceedings of the 8th World Multi-Conference on Systemics, Cybernetics and Informatics, Orlando, FL, USA."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1162\/coli.2006.32.1.13","article-title":"Evaluating WordNet-based measures of lexical semantic relatedness","volume":"32","author":"Budanitsky","year":"2006","journal-title":"Comput. Lin-Guistics"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1145\/219717.219748","article-title":"Wordnet\u2014A lexical database for English","volume":"38","author":"Miller","year":"1995","journal-title":"Commun. ACM"},{"key":"ref_28","first-page":"147","article-title":"Using corpus statistics and WordNet relations for sense identification","volume":"24","author":"Leacock","year":"1998","journal-title":"Comput.-Tional Linguist."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Schmitt, X., Kubler, S., Robert, J., Papadakis, M., and LeTraon, Y. (2019, January 22\u201325). A Replicable Comparison Study of NER Software: StanfordNLP, NLTK, OpenNLP, SpaCy, Gate. Proceedings of the 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS), Social Networks Analysis, Management and Security (SNAMS), Granada, Spain.","DOI":"10.1109\/SNAMS.2019.8931850"},{"key":"ref_30","unstructured":"Omran, F.A., and Treude, C. (2017, January 20\u201321). Choosing an NLP library for analyzing software documentation: A systematic literature review and a series of experiments. Proceedings of the 14th International Conference on Mining Software Repositories, Buenos Aires, Argentina."},{"key":"ref_31","first-page":"78","article-title":"Evolution of Prophet Prediction-Correction Topic Strength Model\u2014An Empirical Study in Stem Cell Field","volume":"64","author":"Zhang","year":"2020","journal-title":"Libr. Inf. Serv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Arun, R., Suresh, V., Madhavan, C.V., and Narasimha Murthy, M.N. (2010, January 21\u201324). On finding the natural number of topics with latent Dirichlet allocation: Some observations. Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Hyderabad, India.","DOI":"10.1007\/978-3-642-13657-3_43"},{"key":"ref_33","unstructured":"Mimno, D., Wallach, H.M., Talley, E., and McCallum, A. (2011, January 27\u201331). Optimizing semantic coherence in topic models. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Scotland, UK."},{"key":"ref_34","unstructured":"Zhang, H. (2019). Research on Technology Prediction Method from the Perspective of Data Fusion, Jilin University."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1578","DOI":"10.1080\/1369183X.2020.1762552","article-title":"Language has a home: How case officers make use of language analysis in asylum decisions","volume":"47","author":"Hedlund","year":"2020","journal-title":"J. Ethn. Migr. Stud."},{"key":"ref_36","first-page":"81","article-title":"Social network analysis of informal academic communication in digital age: A case study of the small wood vermin life science forum","volume":"33","author":"Zou","year":"2015","journal-title":"Inf. Sci."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"999","DOI":"10.1080\/14697688.2012.672762","article-title":"Predicting abnormal returns from news using text classification","volume":"15","author":"Luss","year":"2015","journal-title":"Quant. Financ."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.dss.2009.05.009","article-title":"Are online auction markets efficient? An empirical study of market liquidity and abnormal returns","volume":"48","author":"Kauffman","year":"2009","journal-title":"Decis. Support Syst."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1080\/10696679.2017.1389239","article-title":"User-generated content as word-of-mouth","volume":"26","author":"Ramirez","year":"2018","journal-title":"J. Mark. Theory Pract."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1287\/mksc.1110.0682","article-title":"Does chatter really matter? Dynamics of user-generated content and stock performance","volume":"31","author":"Tirunillai","year":"2012","journal-title":"Mark. Sci."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/10\/454\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:40:16Z","timestamp":1760143216000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/10\/454"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,27]]},"references-count":40,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2022,10]]}},"alternative-id":["info13100454"],"URL":"https:\/\/doi.org\/10.3390\/info13100454","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,27]]}}}