{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:29:16Z","timestamp":1750307356320,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":19,"publisher":"ACM","license":[{"start":{"date-parts":[[2011,6,12]],"date-time":"2011-06-12T00:00:00Z","timestamp":1307836800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2011,6,12]]},"DOI":"10.1145\/1989323.1989406","type":"proceedings-article","created":{"date-parts":[[2011,6,14]],"date-time":"2011-06-14T14:45:32Z","timestamp":1308062732000},"page":"793-804","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Mining a search engine's corpus"],"prefix":"10.1145","author":[{"given":"Mingyang","family":"Zhang","sequence":"first","affiliation":[{"name":"George Washington University, Washington, DC, USA"}]},{"given":"Nan","family":"Zhang","sequence":"additional","affiliation":[{"name":"George Washington University, Washington, DC, USA"}]},{"given":"Gautam","family":"Das","sequence":"additional","affiliation":[{"name":"University of Texas at Arlington, Arlington, TX, USA"}]}],"member":"320","published-online":{"date-parts":[[2011,6,12]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"WebDB","author":"Agichtein E.","year":"2003","unstructured":"E. Agichtein , P. G. Ipeirotis , and L. Gravano . Modeling query-based access to text databases . In WebDB , 2003 . E. Agichtein, P. G. Ipeirotis, and L. Gravano. Modeling query-based access to text databases. In WebDB, 2003."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1135777.1135833"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242627"},{"key":"e_1_3_2_1_4_1","volume-title":"Mining search engine query logs via suggestion sampling","author":"Bar-Yossef Z.","year":"2008","unstructured":"Z. Bar-Yossef and M. Gurevich . Mining search engine query logs via suggestion sampling . 2008 . Z. Bar-Yossef and M. Gurevich. Mining search engine query logs via suggestion sampling. 2008."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1411509.1411514"},{"key":"e_1_3_2_1_6_1","volume-title":"WWW","author":"Bharat K.","year":"1998","unstructured":"K. Bharat and A. Broder . A technique for measuring the relative size and overlap of public web search engines . In WWW , 1998 . K. Bharat and A. Broder. A technique for measuring the relative size and overlap of public web search engines. In WWW, 1998."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/382979.383040"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1394399"},{"key":"e_1_3_2_1_9_1","volume-title":"SSDBM","author":"Das G.","year":"2003","unstructured":"G. Das . Survey of approximate query processing techniques(tutorial) . In SSDBM , 2003 . G. Das. Survey of approximate query processing techniques(tutorial). In SSDBM, 2003."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247550"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807259"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2009.112"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1739041.1739051"},{"key":"e_1_3_2_1_14_1","volume-title":"VLDB","author":"Garofalakis M. N.","year":"2001","unstructured":"M. N. Garofalakis and P. B. Gibbons . Approximate query processing: Taming the terabytes . In VLDB , 2001 . M. N. Garofalakis and P. B. Gibbons. Approximate query processing: Taming the terabytes. In VLDB, 2001."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1065385.1065407"},{"key":"e_1_3_2_1_16_1","unstructured":"Open Directory Project. http:\/\/www.dmoz.org\/.  Open Directory Project. http:\/\/www.dmoz.org\/."},{"key":"e_1_3_2_1_17_1","volume-title":"VLDB","author":"Ipeirotis L. G.","year":"2002","unstructured":"L. G. Panagiotis G. Ipeirotis . Distributed search over the hidden web: Hierarchical database sampling and selection . In VLDB , 2002 . L. G. Panagiotis G. Ipeirotis. Distributed search over the hidden web: Hierarchical database sampling and selection. In VLDB, 2002."},{"key":"e_1_3_2_1_18_1","volume-title":"John Wiley and Sons","author":"Thompson S. K.","year":"1992","unstructured":"S. K. Thompson . Sampling. John Wiley and Sons , 1992 . S. K. Thompson. Sampling. John Wiley and Sons, 1992."},{"key":"e_1_3_2_1_19_1","volume-title":"Sampling methods for applied research : text and cases","author":"Tryfos P.","year":"1966","unstructured":"P. Tryfos . Sampling methods for applied research : text and cases . John Wiley and Sons , 1966 . P. Tryfos. Sampling methods for applied research : text and cases. John Wiley and Sons, 1966."}],"event":{"name":"SIGMOD\/PODS '11: International Conference on Management of Data","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"],"location":"Athens Greece","acronym":"SIGMOD\/PODS '11"},"container-title":["Proceedings of the 2011 ACM SIGMOD International Conference on Management of data"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1989323.1989406","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1989323.1989406","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:22:21Z","timestamp":1750245741000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1989323.1989406"}},"subtitle":["efficient yet unbiased sampling and aggregate estimation"],"short-title":[],"issued":{"date-parts":[[2011,6,12]]},"references-count":19,"alternative-id":["10.1145\/1989323.1989406","10.1145\/1989323"],"URL":"https:\/\/doi.org\/10.1145\/1989323.1989406","relation":{},"subject":[],"published":{"date-parts":[[2011,6,12]]},"assertion":[{"value":"2011-06-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}