{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T21:54:24Z","timestamp":1740174864943,"version":"3.37.3"},"reference-count":24,"publisher":"Wiley","license":[{"start":{"date-parts":[[2021,2,11]],"date-time":"2021-02-11T00:00:00Z","timestamp":1613001600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Shenzhen Technology Projects","award":["JCYJ20160226201347750","ZDSYS201707280904031","51577043"],"award-info":[{"award-number":["JCYJ20160226201347750","ZDSYS201707280904031","51577043"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["JCYJ20160226201347750","ZDSYS201707280904031","51577043"],"award-info":[{"award-number":["JCYJ20160226201347750","ZDSYS201707280904031","51577043"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mobile Information Systems"],"published-print":{"date-parts":[[2021,2,11]]},"abstract":"<jats:p>False information on the Internet is being heralded as serious social harm to our society. To recognize false text information, in this paper, an effective method for mining text features is proposed in the field of false drug advertisements. Firstly, the data of false drug advertisements and real drug advertisements were collected from the official websites to build a database of false and real drug advertisements. Secondly, by performing feature extraction on the text of drug advertisements, this work built a characteristic matrix based on the effective features and assigned positive or negative labels to the feature vector of the matrix according to whether it is a fake medical advertisement or not. Thirdly, this study trained and tested several different classifiers, selected the classification model with the best performance in identifying false drug advertisements, and found the key characteristics that can determine the classification. Finally, the model with the best performance was used to predict new false drug advertisements collected from Sina Weibo. In the case of identifying false drug advertisements, the classification effect of the support vector machine (SVM) classifier established on the feature set after feature selection was the most effective. The findings of this study can provide an effective method for the government to identify and combat false advertisements. This study has a certain reference significance in demonstrating the use of text data mining technology to identify and detect information fraud behavior.<\/jats:p>","DOI":"10.1155\/2021\/4206424","type":"journal-article","created":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T03:21:07Z","timestamp":1613100067000},"page":"1-13","source":"Crossref","is-referenced-by-count":3,"title":["Data Mining Technology Application in False Text Information Recognition"],"prefix":"10.1155","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4789-6049","authenticated-orcid":true,"given":"Jie","family":"Wan","sequence":"first","affiliation":[{"name":"Fundamental Space Science Research Center, Harbin Institute of Technology, Harbin, Heilongjiang Province 150001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xue","family":"Cao","sequence":"additional","affiliation":[{"name":"School of Economics and Management, Southeast University, Nanjing, Jiangsu Province 211189, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6136-7741","authenticated-orcid":true,"given":"Kun","family":"Yao","sequence":"additional","affiliation":[{"name":"Department of Mechanical Engineering & Automation, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, Guangdong Province 518055, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9447-3161","authenticated-orcid":true,"given":"Donghui","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Economics and Management, Southeast University, Nanjing, Jiangsu Province 211189, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"E.","family":"Peng","sequence":"additional","affiliation":[{"name":"Fundamental Space Science Research Center, Harbin Institute of Technology, Harbin, Heilongjiang Province 150001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3619-4380","authenticated-orcid":true,"given":"Yong","family":"Cao","sequence":"additional","affiliation":[{"name":"Department of Mechanical Engineering & Automation, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, Guangdong Province 518055, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","reference":[{"key":"1","first-page":"265","article-title":"The false advertising of specialty medical products under the lanham act","volume":"44","author":"T. C. Morrison","year":"1989","journal-title":"Food Drug and Cosmetic Law Journal"},{"key":"2","doi-asserted-by":"publisher","DOI":"10.1016\/s0140-6736(16)30797-8"},{"key":"3","doi-asserted-by":"publisher","DOI":"10.1038\/nature14539"},{"key":"4","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2017.07.005"},{"key":"5","doi-asserted-by":"publisher","DOI":"10.1007\/s11192-016-2144-6"},{"key":"6","doi-asserted-by":"publisher","DOI":"10.1142\/s0218001416500117"},{"issue":"4","key":"7","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1007\/s10586-016-0673-7","article-title":"Energy efficiency evaluation method based on multi-model fusion strategy","volume":"19","author":"F. Meng","year":"2016","journal-title":"Cluster Computing"},{"key":"8","doi-asserted-by":"publisher","DOI":"10.1111\/insr.12247"},{"key":"9","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.08.043"},{"key":"10","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2018.06.009"},{"volume-title":"Discriminant Analysis and Statistical Pattern Recognition","year":"2004","author":"G. McLachlan","key":"11"},{"issue":"4","key":"12","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1016\/S0957-4174(97)00045-6","article-title":"Application of neural networks to detection of medical fraud","volume":"13","author":"H. He","year":"1996","journal-title":"Expert Systems with Applications"},{"key":"13","doi-asserted-by":"publisher","DOI":"10.1016\/0169-7439(95)80010-7"},{"key":"14","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1984.10477109"},{"key":"15","doi-asserted-by":"publisher","DOI":"10.1016\/s0950-7051(00)00050-2"},{"first-page":"409","article-title":"Discovery of fraud rules for telecommunications\u2014challenges and solutions","author":"S. Rosset","key":"16"},{"key":"17","doi-asserted-by":"publisher","DOI":"10.1007\/s11606-013-2604-0"},{"key":"18","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"19","doi-asserted-by":"publisher","DOI":"10.1145\/278459.258537"},{"key":"20","doi-asserted-by":"publisher","DOI":"10.1145\/1007730.1007741"},{"first-page":"280","article-title":"Term-frequency based feature selection methods for text categorization","author":"Y. Xu","key":"21"},{"issue":"3","key":"22","first-page":"235","article-title":"An automatic recognition method of journal impact factor manipulation","volume":"37","author":"D. H. Yang","year":"2015","journal-title":"Journal of Information Science"},{"key":"23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.neunet.2014.05.003","article-title":"Noise model based \u03bd-support vector regression with its application to short-term wind speed forecasting","volume":"57","author":"Q. Hu","year":"2014","journal-title":"Neural Networks"},{"key":"24","doi-asserted-by":"crossref","DOI":"10.1533\/9780857099440","volume-title":"Machine Learning and Data Mining","author":"I. Kononenko","year":"2007"}],"container-title":["Mobile Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/misy\/2021\/4206424.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/misy\/2021\/4206424.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/misy\/2021\/4206424.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T03:21:18Z","timestamp":1613100078000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/misy\/2021\/4206424\/"}},"subtitle":[],"editor":[{"given":"Alessandro","family":"Bazzi","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2021,2,11]]},"references-count":24,"alternative-id":["4206424","4206424"],"URL":"https:\/\/doi.org\/10.1155\/2021\/4206424","relation":{},"ISSN":["1875-905X","1574-017X"],"issn-type":[{"type":"electronic","value":"1875-905X"},{"type":"print","value":"1574-017X"}],"subject":[],"published":{"date-parts":[[2021,2,11]]}}}