{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T22:42:03Z","timestamp":1776811323902,"version":"3.51.2"},"reference-count":14,"publisher":"European Society of Computational Methods in Sciences and Engineering","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JCM"],"published-print":{"date-parts":[[2021,1,19]]},"abstract":"<jats:p>In order to solve the problem of low efficiency of traditional theme crawlers in searching theme pages, the crawling algorithm based on Context Graph was discussed. After analyzing the working principle and process of the algorithm, we introduced a new algorithm idea named feature selection algorithm. This new algorithm improved the original TF-IDF formula accordingly and solved the algorithm problems.<\/jats:p>","DOI":"10.3233\/jcm-194169","type":"journal-article","created":{"date-parts":[[2020,2,18]],"date-time":"2020-02-18T12:49:51Z","timestamp":1582030191000},"page":"1043-1051","source":"Crossref","is-referenced-by-count":0,"title":["Improved algorithm of Context Graph based on feature selection"],"prefix":"10.66113","volume":"20","author":[{"given":"Wei","family":"Liu","sequence":"first","affiliation":[]},{"given":"Jian","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Yongji","family":"Yang","sequence":"additional","affiliation":[]}],"member":"55691","reference":[{"key":"10.3233\/JCM-194169_ref2","unstructured":"J. Cheng, Design and implementation of metaserch engine based on suffix tree clustering algorithm, Jilin University, 2017."},{"issue":"2","key":"10.3233\/JCM-194169_ref3","first-page":"45","article-title":"Overview of the subject web crawler research","author":"Yu","year":"2015","journal-title":"Computer Engineering and Science"},{"issue":"8","key":"10.3233\/JCM-194169_ref4","first-page":"1721","article-title":"An optimized path focusing crawler crawling strategy","volume":"8","author":"Xu","year":"2016","journal-title":"Minicomputer System"},{"issue":"1","key":"10.3233\/JCM-194169_ref5","first-page":"17","article-title":"The design and implementation of the customized theme focused crawler","volume":"36","author":"Min","year":"2015","journal-title":"Computer Engineering and Design"},{"issue":"38","key":"10.3233\/JCM-194169_ref6","first-page":"195","article-title":"Fusion link structure of the subject crawler aalgorithm","volume":"2","author":"Liu","year":"2017","journal-title":"Journal of Huaqiao University (Natural Science Edition)"},{"key":"10.3233\/JCM-194169_ref7","unstructured":"H. Wu, Research on key technology of vertical search engine and distributed implementation, Southeast University, 2017."},{"key":"10.3233\/JCM-194169_ref8","unstructured":"H. Wu, Binary network community partition based on PageRank algorithm, Shenyang University of Aeronautics and Astronautics, 2016."},{"key":"10.3233\/JCM-194169_ref9","unstructured":"B. Novak, A survey of focused web crawling algorithms, Proceedings of SIKDD at Multiconference IS. Slovenia: ACM Press, 2004, pp. 55\u201358."},{"key":"10.3233\/JCM-194169_ref10","doi-asserted-by":"crossref","unstructured":"R. Chen and C.B. Desai, An enhanced web robot for the CINDI system, Proceedings of the C3S2E Conference. Canadia: ACM Press, 2008, pp. 133\u2013135.","DOI":"10.1145\/1370256.1370278"},{"issue":"4","key":"10.3233\/JCM-194169_ref12","doi-asserted-by":"crossref","first-page":"450","DOI":"10.3844\/jcssp.2010.450.456","article-title":"An adaptive updating topic specific web system using T-graph","volume":"6","author":"Patel","year":"2010","journal-title":"Journal of Computer Science"},{"key":"10.3233\/JCM-194169_ref13","unstructured":"F. Bussche and K. Weiand, Not so creepy crawler: Easy crawler generation with standard XML queries, Proceeding of the 19th international conference on World Wide Web, Raleigh, North Carolina, USA, 2010, pp. 1305\u20131308."},{"issue":"9","key":"10.3233\/JCM-194169_ref14","first-page":"149","article-title":"The optimized background value of the GM(1,1) model which based on non-homogenous index series","author":"Li","year":"2010","journal-title":"Journal of Systems Science and Information"},{"key":"10.3233\/JCM-194169_ref15","first-page":"68","article-title":"Entity linking for queries by searching wikipedia sentences","author":"Tan","year":"2017","journal-title":"EMNLP"},{"key":"10.3233\/JCM-194169_ref16","doi-asserted-by":"crossref","first-page":"15174","DOI":"10.1109\/ACCESS.2017.2731761","article-title":"Entity search based on the representation learning model with different embedding strategies","volume":"5","author":"Shijia","year":"2017","journal-title":"IEEE Access"}],"container-title":["Journal of Computational Methods in Sciences and Engineering"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JCM-194169","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T22:06:02Z","timestamp":1776809162000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JCM-194169"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,19]]},"references-count":14,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.3233\/jcm-194169","relation":{},"ISSN":["1472-7978","1875-8983"],"issn-type":[{"value":"1472-7978","type":"print"},{"value":"1875-8983","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,19]]}}}