{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T06:07:11Z","timestamp":1775282831198,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":30,"publisher":"ACM","license":[{"start":{"date-parts":[[2009,11,2]],"date-time":"2009-11-02T00:00:00Z","timestamp":1257120000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2009,11,2]]},"DOI":"10.1145\/1645953.1646043","type":"proceedings-article","created":{"date-parts":[[2009,11,10]],"date-time":"2009-11-10T18:36:45Z","timestamp":1257878205000},"page":"701-710","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Automatic retrieval of similar content using search engine query interface"],"prefix":"10.1145","author":[{"given":"Ali","family":"Dasdan","sequence":"first","affiliation":[{"name":"Yahoo! Inc., Sunnyvale, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paolo","family":"D'Alberto","sequence":"additional","affiliation":[{"name":"Yahoo! Inc., Sunnyvale, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Santanu","family":"Kolay","sequence":"additional","affiliation":[{"name":"Yahoo! Inc., Sunnyvale, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chris","family":"Drome","sequence":"additional","affiliation":[{"name":"Yahoo! Inc., Sunnyvale, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2009,11,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1411509.1411514"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.24.0354"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7552(98)00127-5"},{"key":"e_1_3_2_1_4_1","first-page":"858","volume-title":"Proc. Joint Conf. Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)","author":"Brants T.","year":"2007","unstructured":"T. Brants , A. C. Popat , P. Xu , F. J. Och , and J. Dean . Large language models in machine translation . In Proc. Joint Conf. Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , pp. 858 -- 867 . ACL, 2007 . T. Brants, A. C. Popat, P. Xu, F. J. Och, and J. Dean. Large language models in machine translation. In Proc. Joint Conf. Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 858--867. ACL, 2007."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/223784.223855"},{"key":"e_1_3_2_1_6_1","first-page":"21","volume-title":"Proc. Compression and Complexity of Sequences (SEQUENCES)","author":"Broder A.","unstructured":"A. Broder . On the resemblance and containment of documents . In Proc. Compression and Complexity of Sequences (SEQUENCES) , page 21 . IEEE, 1997. A. Broder. On the resemblance and containment of documents. In Proc. Compression and Complexity of Sequences (SEQUENCES), page 21. IEEE, 1997."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183699"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4613-9323-8_11"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/647819.736184"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/276698.276781"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7552(97)00031-7"},{"key":"e_1_3_2_1_12_1","volume-title":"Morgan Kaufmann","author":"Chakrabarti S.","year":"2003","unstructured":"S. Chakrabarti . Mining the Web . Morgan Kaufmann , 2003 . S. Chakrabarti. Mining the Web. Morgan Kaufmann, 2003."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/509907.509965"},{"key":"e_1_3_2_1_14_1","volume-title":"Tutorial in Int. Conf. World Wide Web (WWW), ACM","author":"Dasdan A.","year":"2009","unstructured":"A. Dasdan , K. Tsioutsiouliklis , and E. Velipasaoglu . Web search engine metrics . Tutorial in Int. Conf. World Wide Web (WWW), ACM , 2009 . A. Dasdan, K. Tsioutsiouliklis, and E. Velipasaoglu. Web search engine metrics. Tutorial in Int. Conf. World Wide Web (WWW), ACM, 2009."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/383952.384072"},{"key":"e_1_3_2_1_16_1","volume-title":"Concrete Mathematics","author":"Graham R. L.","year":"1994","unstructured":"R. L. Graham , D. E. Knuth , and O. Patashnik . Concrete Mathematics . Addison-Wesley , 1994 . R. L. Graham, D. E. Knuth, and O. Patashnik. Concrete Mathematics. Addison-Wesley, 1994."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148222"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/582415.582418"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.14.0309"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.22.0159"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242592"},{"key":"e_1_3_2_1_22_1","volume-title":"Patent Appl.","author":"McSherry F. D.","year":"2008","unstructured":"F. D. McSherry , K. Talwar , and M. D. Manasse . Consistent weighted sampling of multisets and distributions. U.S . Patent Appl. , Sep 2008 . F. D. McSherry, K. Talwar, and M. D. Manasse. Consistent weighted sampling of multisets and distributions. U.S. Patent Appl., Sep 2008."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/211390"},{"key":"e_1_3_2_1_24_1","first-page":"57","volume-title":"Proc. of Conf. on Research and Dev. in Info. Retrieval (SIGIR)","author":"Noreault T.","year":"1981","unstructured":"T. Noreault , M. McGill , and M. Koll . A performance evaluation of similarity measures, document term weighting schemes and representations in a boolean environment . In Proc. of Conf. on Research and Dev. in Info. Retrieval (SIGIR) , pp. 57 -- 76 . ACM, 1981 . T. Noreault, M. McGill, and M. Koll. A performance evaluation of similarity measures, document term weighting schemes and representations in a boolean environment. In Proc. of Conf. on Research and Dev. in Info. Retrieval (SIGIR), pp. 57--76. ACM, 1981."},{"issue":"4","key":"e_1_3_2_1_25_1","first-page":"247","article-title":"Retrieving similar documents from the Web","volume":"2","author":"Pereira A.","year":"2004","unstructured":"A. Pereira Jr . and N. Ziviani . Retrieving similar documents from the Web . J. Web Engineering , 2 ( 4 ): 247 -- 261 , 2004 . A. Pereira Jr. and N. Ziviani. Retrieving similar documents from the Web. J. Web Engineering, 2(4):247--261, 2004.","journal-title":"J. Web Engineering"},{"key":"e_1_3_2_1_26_1","volume-title":"Proc. Int. Conf. Digital Libraries (DL). ACM","author":"Shivakumar N.","year":"1995","unstructured":"N. Shivakumar and H. Garc\u00eda-Molina . SCAM: A copy detection mechanism for digital documents . In Proc. Int. Conf. Digital Libraries (DL). ACM , 1995 . N. Shivakumar and H. Garc\u00eda-Molina. SCAM: A copy detection mechanism for digital documents. In Proc. Int. Conf. Digital Libraries (DL). ACM, 1995."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/226931.226961"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(198803)39:2<92::AID-ASI4>3.0.CO;2-P"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242738"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498759.1498806"}],"event":{"name":"CIKM '09: Conference on Information and Knowledge Management","location":"Hong Kong China","acronym":"CIKM '09","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"]},"container-title":["Proceedings of the 18th ACM conference on Information and knowledge management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1645953.1646043","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1645953.1646043","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T12:41:10Z","timestamp":1750250470000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1645953.1646043"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,11,2]]},"references-count":30,"alternative-id":["10.1145\/1645953.1646043","10.1145\/1645953"],"URL":"https:\/\/doi.org\/10.1145\/1645953.1646043","relation":{},"subject":[],"published":{"date-parts":[[2009,11,2]]},"assertion":[{"value":"2009-11-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}