{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T01:55:05Z","timestamp":1775181305128,"version":"3.50.1"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2011,1,30]],"date-time":"2011-01-30T00:00:00Z","timestamp":1296345600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2011,1,30]],"date-time":"2011-01-30T00:00:00Z","timestamp":1296345600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Inf Retrieval"],"published-print":{"date-parts":[[2011,10]]},"DOI":"10.1007\/s10791-011-9162-z","type":"journal-article","created":{"date-parts":[[2011,1,29]],"date-time":"2011-01-29T03:46:17Z","timestamp":1296272777000},"page":"441-465","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":167,"title":["Efficient and effective spam filtering and re-ranking for large web datasets"],"prefix":"10.1007","volume":"14","author":[{"given":"Gordon V.","family":"Cormack","sequence":"first","affiliation":[]},{"given":"Mark D.","family":"Smucker","sequence":"additional","affiliation":[]},{"given":"Charles L. A.","family":"Clarke","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,1,30]]},"reference":[{"issue":"1","key":"9162_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1326561.1326563","volume":"2","author":"L. Becchetti","year":"2008","unstructured":"Becchetti, L., Castillo, C., Donato, D., Baeza-Yates, R., & Leonardi, S. (2008). Link analysis for web spam detection. ACM Transactions on the Web, 2(1), 1\u201342.","journal-title":"ACM Transactions on the Web"},{"key":"9162_CR2","unstructured":"B\u00fcttcher, S., Clarke, C. L. A., & Soboroff, I. (2006). The TREC 2006 terabyte track. In Proceedings of the 15th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR3","doi-asserted-by":"crossref","unstructured":"Carterette, B., Pavlu, V., Kanoulas, E., Aslam, J. A., & Allan, J. (2008). Evaluation over thousands of queries. In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval (pp. 651\u2013658), Singapore.","DOI":"10.1145\/1390334.1390445"},{"key":"9162_CR4","unstructured":"Chandar, P., Kailasam, A., Muppaneni, D., Lekha, T., & Carterette, B. (2009). Ad hoc and diversity retrieval at the University of Delaware. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR5","unstructured":"Clarke, C. L. A., Craswell, N., & Soboroff, I. (2009). Overview of the TREC 2009 web track. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR6","unstructured":"Clarke, C. L. A., Craswell, N., Soboroff, I., Cormack, G. V. (2010). Overview of the TREC 2010 web track. In Proceedings of the 19th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR7","unstructured":"Cormack, G. (2007). Content-based web spam detection. In Proceedings of the 3rd international workshop on adversarial information retrieval on the web (AIRWeb)."},{"key":"9162_CR8","unstructured":"Cormack, G. V., Lynam, T. R. (2005). TREC 2005 spam track overview. In Proceedings of the 14th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR9","unstructured":"Cormack, G. V., & Mojdeh, M. (2009). Machine learning for information retrieval: TREC 2009 web, relevance feedback and legal tracks. In: Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR10","unstructured":"Cormack, G. V. (2007). University of Waterloo participation in the TREC 2007 spam track. In Proceedings of the 16th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR11","unstructured":"Dou, Z., Cheny, K., Song, R., Ma, Y., Shi, S., & Wen, J.-R. (2009). Microsoft research Asia at the web track of TREC 2009. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR12","unstructured":"Goodman, J., & Yih, W. T. (2006). Online discriminative spam filter training. In Proceedings of the 3rd conference on email and anti-spam (CEAS)."},{"key":"9162_CR13","unstructured":"Guan, F., Yu, X., Peng, Z., Xu, H., Liu, Y., Song, L., & Cheng, X. (2009). ICTNET at   web track 2009 ad-hoc task. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"issue":"10","key":"9162_CR14","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1109\/MC.2005.352","volume":"38","author":"Z. Gy\u00f6ngyi","year":"2005","unstructured":"Gy\u00f6ngyi, Z., Garcia-Molina, H. (2005). Spam: It\u2019s not just for inboxes anymore. IEEE Computer, 38(10):28\u201334.","journal-title":"IEEE Computer"},{"key":"9162_CR15","unstructured":"Hauff, C., & Hiemstra, D. (2009). University of Twente@TREC 2009: Indexing half a billion web pages. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"issue":"1","key":"9162_CR16","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1023\/A:1022904715765","volume":"6","author":"D. Hawking","year":"2003","unstructured":"Hawking, D., & Robertson, S. (2003). On collection size and retrieval effectiveness. Information Retrieval, 6(1):99\u2013105.","journal-title":"Information Retrieval"},{"key":"9162_CR17","unstructured":"He, J., Balog, K., Hofmann, K., Meij, E., de Rijke, M., Tsagkias, M., & Weerkamp, W. (2009). Heuristic ranking and diversification of web documents. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR18","unstructured":"Jones, T., Hawking, D., & Sankaranarayana, R. (2007). A framework for measuring the impact of web spam. In Proceedings of the 12th Australasian document computing symposium (ADCS), Melbourne, Australia."},{"key":"9162_CR19","doi-asserted-by":"crossref","unstructured":"Jones, T., Sankaranarayana, R., Hawking, D., & Craswell, N. (2009). Nullification test collections for web spam and SEO. In Proceedings of the 5th international workshop on adversarial information retrieval on the web (AIRWeb) (pp. 53\u201360), Madrid, Spain.","DOI":"10.1145\/1531914.1531927"},{"key":"9162_CR20","unstructured":"Kaptein, R., Koolen, M., & Kamps, J. (2009). Result diversity and entity ranking experiments: Anchors, links, text and Wikipedia. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR22","unstructured":"Lin, J., Metzler, D., Elsayed, T., & Wang, L. (2009). Of ivory and smurfs: Loxodontan MapReduce experiments for web search. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR23","doi-asserted-by":"crossref","unstructured":"Lynam, T. R., & Cormack, G. V. (2006). On-line spam filter fusion. In Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 123\u2013130), Seattle, Washington.","DOI":"10.1145\/1148170.1148195"},{"key":"9162_CR24","unstructured":"Macdonald, C., Ounis, I., & Soboroff, I. (2007). Overview of the TREC 2007 blog track. In Proceedings of the 16th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR25","doi-asserted-by":"crossref","unstructured":"Macdonald, C., Ounis, I., & Soboroff, I. (2009). Is spam an issue for opinionated blog post search? In Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval (pp. 710\u2013711), Boston.","DOI":"10.1145\/1571941.1572090"},{"key":"9162_CR26","unstructured":"McCreadie, R., Macdonald, C., Ounis, I., Peng, J., & Santos, R. L. T. (2009). University of Glasgow at TREC 2009: Experiments with Terrier. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR27","doi-asserted-by":"crossref","unstructured":"Richardson, M., Prakash, A., & Brill, E. (2006). Beyond PageRank: Machine learning for static ranking. In Proceedings of the 15th international world wide web conference (pp. 707\u2013715). Edinburgh, Scotland","DOI":"10.1145\/1135777.1135881"},{"issue":"5","key":"9162_CR28","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1007\/s10791-008-9059-7","volume":"11","author":"T. Sakai","year":"2008","unstructured":"Sakai, T., & Kando, N. (2008). On information retrieval metrics designed for evaluation with incomplete relevance assessments. Information Retrieval, 11(5), 447\u2013470.","journal-title":"Information Retrieval"},{"key":"9162_CR29","unstructured":"Smucker, M. D., Clarke, C. L. A., & Cormack, G. V. (2009). Experiments with ClueWeb09: Relevance feedback and web tracks. In Proceedings of the 18th text retrieval conference, Gaithersburg, Maryland."},{"key":"9162_CR30","unstructured":"Strohman, T., Metzler, D., Turtle, H., & Croft W. B. (2005). Indri: A language-model based search engine for complex queries (extended version). Technical Report IR-407, CIIR, CS Dept., U. of Mass. Amherst."},{"key":"9162_CR31","unstructured":"Tomlinson, S., Oard, D. W., Baron, J. R., & Thompson, P. (2007). Overview of the TREC 2007 legal track. In Proceedings of the 16th text retrieval conference, Gaithersburg, Maryland."}],"container-title":["Information Retrieval"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10791-011-9162-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10791-011-9162-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10791-011-9162-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10791-011-9162-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,2]],"date-time":"2024-01-02T15:00:08Z","timestamp":1704207608000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10791-011-9162-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,30]]},"references-count":30,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2011,10]]}},"alternative-id":["9162"],"URL":"https:\/\/doi.org\/10.1007\/s10791-011-9162-z","relation":{},"ISSN":["1386-4564","1573-7659"],"issn-type":[{"value":"1386-4564","type":"print"},{"value":"1573-7659","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,1,30]]},"assertion":[{"value":"28 April 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 January 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 January 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"This content has been made available to all.","name":"free","label":"Free to read"}]}}