{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T16:13:24Z","timestamp":1781108004917,"version":"3.54.1"},"reference-count":18,"publisher":"IGI Global Scientific Publishing","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,4,1]]},"abstract":"<p>Search engines use several criteria to rank webpages and choose which pages to display when answering a request. Those criteria can be separated into two notions, relevance and popularity. The notion of popularity is calculated by the search engine and is related to links made to the webpage. Malicious webmasters want to artificially increase their popularity; the techniques they use are often referred to as Webspam. It can take many forms and is in constant evolution, but Webspam usually consists of building a specific dedicated structure of spam pages around a given target page. It is important for a search engine to address the issue of Webspam; otherwise, it cannot provide users with fair and reliable results. In this paper, the authors propose a technique to identify Webspam through the frequency language associated with random walks among those dedicated structures. The authors identify the language by calculating the frequency of appearance of k-grams on random walks launched from every node.<\/p>","DOI":"10.4018\/joci.2011040103","type":"journal-article","created":{"date-parts":[[2011,10,19]],"date-time":"2011-10-19T12:45:32Z","timestamp":1319028332000},"page":"36-48","source":"Crossref","is-referenced-by-count":0,"title":["Detecting Webspam Beneficiaries Using Information Collected by the Random Surfer"],"prefix":"10.4018","volume":"2","author":[{"given":"Thomas","family":"Largillier","sequence":"first","affiliation":[{"name":"LRI, Universit\u00e9 Paris-Sud, F-91405, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sylvain","family":"Peyronnet","sequence":"additional","affiliation":[{"name":"LRI, Universit\u00e9 Paris-Sud, F-91405, France"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"2432","reference":[{"key":"joci.2011040103-0","unstructured":"Alpert, J., & Hajaj, N. (2008). We knew the web was big. Retrieved from http:\/\/googleblog.blogspot.com\/2008\/07\/we-knew-web-was-big.html"},{"key":"joci.2011040103-1","doi-asserted-by":"crossref","unstructured":"Andersen, R., Borgs, C., Chayes, J., Hopcroft, J., Jain, K., Mirrokni, V., et al. (2008). Robust pagerank and locally computable spam detection features. In Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web (pp. 69-76).","DOI":"10.1145\/1451983.1452000"},{"key":"joci.2011040103-2","unstructured":"Benczur, A. A., Csalogany, K., Sarlos, T., Uher, M., & Uher, M. (2005). Spamrank - fully automatic link spam detection. In Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web."},{"key":"joci.2011040103-3","doi-asserted-by":"crossref","unstructured":"Boldi, P., & Vigna, S. (2003). The webgraph framework I: Compression techniques. In Proceedings of the Thirteenth International World Wide Web Conference (pp. 595-601).","DOI":"10.1145\/988672.988752"},{"key":"joci.2011040103-4","doi-asserted-by":"crossref","unstructured":"Chung, Y. J., Toyoda, M., & Kitsuregawa, M. (2009). A study of link farm distribution and evolution using a time series of web snapshots. In Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web (pp. 9-16).","DOI":"10.1145\/1531914.1531917"},{"key":"joci.2011040103-5","doi-asserted-by":"publisher","DOI":"10.1016\/j.laa.2008.01.023"},{"key":"joci.2011040103-6","doi-asserted-by":"crossref","unstructured":"Fischer, E., Magniez, F., & de Rougemont, M. (2006). Approximate satisfiability and equivalence. In Proceedings of the 21st IEEE Annual Symposium on Logic in Computer Science (pp. 421-430).","DOI":"10.1109\/LICS.2006.12"},{"key":"joci.2011040103-7","unstructured":"Gyongyi, Z., Berkhin, P., Garcia-Molina, H., & Pedersen, J. (2006). Link spam detection based on mass estimation. In Proceedings of the 32nd International Conference on Very Large Data Bases (pp. 439-450)."},{"key":"joci.2011040103-8","unstructured":"Gyongyi, Z., & Garcia-Molina, H. (2005). Web spam taxonomy. In Proceedings of the First International Workshop on Adversarial Information Retrieval."},{"key":"joci.2011040103-9","doi-asserted-by":"crossref","unstructured":"Gyongyi, Z., Garcia-Molina, H., & Pedersen, J. (2004). Combating webspam with trustrank. In Proceedings of the Thirtieth International Conference on Very Large Data Bases (pp. 576-587).","DOI":"10.1016\/B978-012088469-8\/50052-8"},{"issue":"5","key":"joci.2011040103-10","first-page":"604","article-title":"Authoritative sources in a hyperlinked environment.","volume":"46","author":"J. M.Kleinberg","year":"1999","journal-title":"Communications of the ACM"},{"key":"joci.2011040103-11","unstructured":"Krishnan, V., & Raj, R. (2006). Web spam detection with anti-trust rank. In Proceedings of AirWEB (pp. 37-40)."},{"key":"joci.2011040103-12","unstructured":"Laboratory for Web Algorithmics. (2002). Introduction to law. Retrieved from http:\/\/law.dsi.unimi.it\/"},{"key":"joci.2011040103-13","doi-asserted-by":"crossref","unstructured":"Largillier, T., & Peyronnet, S. (2010). Lightweight clustering methods for webspam demotion. In Proceedings of the Ninth International Conference on Web Intelligence.","DOI":"10.1109\/WI-IAT.2010.243"},{"key":"joci.2011040103-14","doi-asserted-by":"crossref","unstructured":"Ntoulas, A., Najork, M., Manasse, M., & Fetterly, D. (2006). Detecting spam web pages through content analysis. In Proceedings of the 15th International Conference on World Wide Web (pp. 83-92).","DOI":"10.1145\/1135777.1135794"},{"key":"joci.2011040103-15","unstructured":"Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The pagerank citation ranking: Bringing order to the web (Tech. Rep. No. SIDL-WP-1999-0120). Stanford, CA: Stanford University."},{"key":"joci.2011040103-16","doi-asserted-by":"crossref","unstructured":"Wu, B., Goel, V., & Davison, B. D. (2006). Topical trustrank: Using topicality to combat web spam. In Proceedings of the 15th International Conference on World Wide Web (pp. 63-72).","DOI":"10.1145\/1135777.1135792"},{"key":"joci.2011040103-17","unstructured":"Yahoo. Research. (2007). Datasets for research on web spam detection. Retrieved from http:\/\/barcelona.research.yahoo.net\/webspam\/datasets\/"}],"container-title":["International Journal of Organizational and Collective Intelligence"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=54070","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,1]],"date-time":"2022-06-01T21:31:45Z","timestamp":1654119105000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/joci.2011040103"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2011,4,1]]},"references-count":18,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2011,4]]}},"URL":"https:\/\/doi.org\/10.4018\/joci.2011040103","relation":{},"ISSN":["1947-9344","1947-9352"],"issn-type":[{"value":"1947-9344","type":"print"},{"value":"1947-9352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,4,1]]}}}