{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T05:05:24Z","timestamp":1755839124438,"version":"3.41.2"},"reference-count":24,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2021,4,16]],"date-time":"2021-04-16T00:00:00Z","timestamp":1618531200000},"content-version":"vor","delay-in-days":105,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Complexity"],"published-print":{"date-parts":[[2021,1]]},"abstract":"<jats:p>In order to shorten the time for users to query news on the Internet, this paper studies and designs a network news data extraction technology, which can obtain the main news information through the extraction of news text keywords. Firstly, the TF\u2010IDF keyword extraction algorithm, TextRank keyword extraction algorithm, and LDA keyword extraction algorithm are analyzed to understand the keyword extraction process, and the TF\u2010IDF algorithm is optimized by Zipf\u2019s law. By introducing the idea of model fusion, five schemes based on waterfall fusion and parallel combination fusion are designed, and the effects of the five schemes are verified by experiments. It is found that the designed extraction technology has a good effect on network news data extraction. News keyword extraction has a great application prospect, which can provide the basis for the research fields of news key phrases, news abstracts, and so on.<\/jats:p>","DOI":"10.1155\/2021\/5529447","type":"journal-article","created":{"date-parts":[[2021,4,16]],"date-time":"2021-04-16T23:21:19Z","timestamp":1618615279000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Web News Data Extraction Technology Based on Text Keywords"],"prefix":"10.1155","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7321-2288","authenticated-orcid":false,"given":"Kun","family":"Zhang","sequence":"first","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2021,4,16]]},"reference":[{"key":"e_1_2_7_1_2","doi-asserted-by":"publisher","DOI":"10.31341\/jios.44.1.8"},{"key":"e_1_2_7_2_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2020.01.072"},{"key":"e_1_2_7_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11257-020-09261-9"},{"key":"e_1_2_7_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cogsys.2019.12.005"},{"key":"e_1_2_7_5_2","doi-asserted-by":"publisher","DOI":"10.14716\/ijtech.v10i4.2339"},{"key":"e_1_2_7_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2019.09.013"},{"key":"e_1_2_7_7_2","first-page":"97","article-title":"Unsupervised keyword extraction from microblog posts via hashtags","volume":"17","author":"Li L.","year":"2018","journal-title":"Journal of Web Engineering"},{"key":"e_1_2_7_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2016.03.045"},{"key":"e_1_2_7_9_2","doi-asserted-by":"publisher","DOI":"10.35595\/2414-9179-2020-1-26-375-384"},{"key":"e_1_2_7_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/tvt.2019.2906799"},{"key":"e_1_2_7_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.techfore.2019.02.009"},{"key":"e_1_2_7_12_2","doi-asserted-by":"publisher","DOI":"10.18287\/1613-0073-2019-2416-219-226"},{"key":"e_1_2_7_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/tkde.2017.2690421"},{"key":"e_1_2_7_14_2","first-page":"1","article-title":"Automatic text classification using BPLion-neural network and semantic word processing","volume":"66","author":"Ranjan N. M.","year":"2017","journal-title":"Imaging Science Journal the"},{"key":"e_1_2_7_15_2","doi-asserted-by":"publisher","DOI":"10.1108\/el-09-2016-0192"},{"key":"e_1_2_7_16_2","doi-asserted-by":"publisher","DOI":"10.21184\/jkeia.2019.4.13.3.43"},{"key":"e_1_2_7_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-017-5513-0"},{"key":"e_1_2_7_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41870-019-00367-x"},{"key":"e_1_2_7_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.anbehav.2019.11.017"},{"key":"e_1_2_7_20_2","doi-asserted-by":"publisher","DOI":"10.5539\/cis.v11n4p77"},{"key":"e_1_2_7_21_2","doi-asserted-by":"publisher","DOI":"10.5194\/ica-abs-1-399-2019"},{"key":"e_1_2_7_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2018.11.003"},{"key":"e_1_2_7_23_2","doi-asserted-by":"publisher","DOI":"10.1155\/2021\/6616158"},{"key":"e_1_2_7_24_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/8842297"}],"container-title":["Complexity"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/5529447.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/5529447.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2021\/5529447","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,9]],"date-time":"2024-08-09T22:29:30Z","timestamp":1723242570000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2021\/5529447"}},"subtitle":[],"editor":[{"given":"Abd E.I.-Baset","family":"Hassanien","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,1]]},"references-count":24,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,1]]}},"alternative-id":["10.1155\/2021\/5529447"],"URL":"https:\/\/doi.org\/10.1155\/2021\/5529447","archive":["Portico"],"relation":{},"ISSN":["1076-2787","1099-0526"],"issn-type":[{"type":"print","value":"1076-2787"},{"type":"electronic","value":"1099-0526"}],"subject":[],"published":{"date-parts":[[2021,1]]},"assertion":[{"value":"2021-01-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-04-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-04-16","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"5529447"}}