{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T14:06:20Z","timestamp":1773669980655,"version":"3.50.1"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2009,4,1]],"date-time":"2009-04-01T00:00:00Z","timestamp":1238544000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000121","name":"Division of Mathematical Sciences","doi-asserted-by":"publisher","award":["DMS-0706805"],"award-info":[{"award-number":["DMS-0706805"]}],"id":[{"id":"10.13039\/100000121","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Web"],"published-print":{"date-parts":[[2009,4]]},"abstract":"<jats:p>We propose a methodology for building a robust query classification system that can identify thousands of query classes, while dealing in real time with the query volume of a commercial Web search engine. We use a pseudo relevance feedback technique: given a query, we determine its topic by classifying the Web search results retrieved by the query. Motivated by the needs of search advertising, we primarily focus on rare queries, which are the hardest from the point of view of machine learning, yet in aggregate account for a considerable fraction of search engine traffic. Empirical evaluation confirms that our methodology yields a considerably higher classification accuracy than previously reported. We believe that the proposed methodology will lead to better matching of online ads to rare queries and overall to a better user experience.<\/jats:p>","DOI":"10.1145\/1513876.1513877","type":"journal-article","created":{"date-parts":[[2009,4,28]],"date-time":"2009-04-28T14:58:07Z","timestamp":1240930687000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":39,"title":["Classifying search queries using the Web as a source of knowledge"],"prefix":"10.1145","volume":"3","author":[{"given":"Evgeniy","family":"Gabrilovich","sequence":"first","affiliation":[{"name":"Yahoo Research, Santa Clara, CA"}]},{"given":"Andrei","family":"Broder","sequence":"additional","affiliation":[{"name":"Yahoo Research, Santa Clara, CA"}]},{"given":"Marcus","family":"Fontoura","sequence":"additional","affiliation":[{"name":"PUC-Rio, Rio de Janeiro, Brazil"}]},{"given":"Amruta","family":"Joshi","sequence":"additional","affiliation":[{"name":"UCLA, Los Angeles, CA"}]},{"given":"Vanja","family":"Josifovski","sequence":"additional","affiliation":[{"name":"Yahoo Research, Santa Clara, CA"}]},{"given":"Lance","family":"Riedel","sequence":"additional","affiliation":[{"name":"Yahoo Research, Santa Clara, CA"}]},{"given":"Tong","family":"Zhang","sequence":"additional","affiliation":[{"name":"Rutgers University, Piscataway, NJ"}]}],"member":"320","published-online":{"date-parts":[[2009,4,30]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009048"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1076034.1076138"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.80"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1229179.1229183"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1458082.1458217"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526778"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277783"},{"key":"e_1_2_1_8_1","unstructured":"Duda R. and Hart P. 1973. Pattern Classification and Scene Analysis. John Wiley and Sons New York NY.  Duda R. and Hart P. 1973. Pattern Classification and Scene Analysis. John Wiley and Sons New York NY."},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the Text REtrieval Conference (TREC-2). National Institute of Standards and Technology (NIST).","author":"Efthimiadis E.","unstructured":"Efthimiadis , E. and Biron , P . 1994. UCLA-Okapi at TREC-2: Query expansion experiments . In Proceedings of the Text REtrieval Conference (TREC-2). National Institute of Standards and Technology (NIST). Efthimiadis, E. and Biron, P. 1994. UCLA-Okapi at TREC-2: Query expansion experiments. In Proceedings of the Text REtrieval Conference (TREC-2). National Institute of Standards and Technology (NIST)."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/1314498.1314573"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1099554.1099703"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/956863.956925"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 4th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD). Springer-Verlag.","author":"Han E.","unstructured":"Han , E. and Karypis , G . 2000. Centroid-based document classification: Analysis and experimental results . In Proceedings of the 4th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD). Springer-Verlag. Han, E. and Karypis, G. 2000. Centroid-based document classification: Analysis and experimental results. In Proceedings of the 4th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD). Springer-Verlag."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345545"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/645326.649721"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117454.1117468"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-30549-1_48"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117454.1117466"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183711"},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Manning C. D. Raghavan P. and Schuetze H. 2008. Introduction to Information Retrieval. Cambridge University Press.   Manning C. D. Raghavan P. and Schuetze H. 2008. Introduction to Information Retrieval. Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"e_1_2_1_21_1","volume-title":"AAAI\/ICML Workshop on Learning for Text Categorization. 41--48","author":"McCallum A.","unstructured":"McCallum , A. and Nigam , K . 1998. A comparison of event models for naive Bayes text classification . In AAAI\/ICML Workshop on Learning for Text Categorization. 41--48 . McCallum, A. and Nigam, K. 1998. A comparison of event models for naive Bayes text classification. In AAAI\/ICML Workshop on Learning for Text Categorization. 41--48."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290995"},{"key":"e_1_2_1_23_1","unstructured":"Moran M. and Hunt B. 2005. Search Engine Marketing Inc.: Driving Search Traffic to Your Company's Web Site. Prentice Hall Upper Saddle River NJ.   Moran M. and Hunt B. 2005. Search Engine Marketing Inc.: Driving Search Traffic to Your Company's Web Site. Prentice Hall Upper Saddle River NJ."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the Text REtrieval Conference (TREC-3). NIST","author":"Robertson S.","unstructured":"Robertson , S. , Walker , S. , Jones , S. , Hancock-Beaulieu , M. , and Gatford , M . 1995. Okapi at TREC-3 . In Proceedings of the Text REtrieval Conference (TREC-3). NIST , Gaithersburg, MD. Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., and Gatford, M. 1995. Okapi at TREC-3. In Proceedings of the Text REtrieval Conference (TREC-3). NIST, Gaithersburg, MD."},{"key":"e_1_2_1_25_1","volume-title":"The SMART Retrieval System: Experiments in Automatic Document Processing","author":"Rocchio J.","unstructured":"Rocchio , J. 1971. Relevance feedback in information retrieval . In The SMART Retrieval System: Experiments in Automatic Document Processing . Prentice Hall , Englewood Cliffs, NJ , 313--323. Rocchio, J. 1971. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice Hall, Englewood Cliffs, NJ, 313--323."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 8th Pacific Rim International Conference on Artificial Intelligence. Springer-Verlag.","author":"Sahami M.","unstructured":"Sahami , M. , Mittal , V. , Baluja , S. , and Rowley , H . 2004. The happy searcher: Challenges in web information retrieval . In Proceedings of the 8th Pacific Rim International Conference on Artificial Intelligence. Springer-Verlag. Sahami, M., Mittal, V., Baluja, S., and Rowley, H. 2004. The happy searcher: Challenges in web information retrieval. In Proceedings of the 8th Pacific Rim International Conference on Artificial Intelligence. Springer-Verlag."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199006)41:4<288::AID-ASI8>3.0.CO;2-H"},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Santner T. and Duffy D. 1989. The Statistical Analysis of Discrete Data. Springer-Verlag.  Santner T. and Duffy D. 1989. The Statistical Analysis of Discrete Data. Springer-Verlag.","DOI":"10.1007\/978-1-4612-1017-7"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117454.1117467"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1165774.1165776"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148196"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117454.1117469"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/188490.188508"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/333135.333138"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009982220290"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011441423217"}],"container-title":["ACM Transactions on the Web"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1513876.1513877","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1513876.1513877","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T13:57:58Z","timestamp":1750255078000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1513876.1513877"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,4]]},"references-count":38,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2009,4]]}},"alternative-id":["10.1145\/1513876.1513877"],"URL":"https:\/\/doi.org\/10.1145\/1513876.1513877","relation":{},"ISSN":["1559-1131","1559-114X"],"issn-type":[{"value":"1559-1131","type":"print"},{"value":"1559-114X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,4]]},"assertion":[{"value":"2008-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-04-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}