{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,12]],"date-time":"2025-12-12T13:01:06Z","timestamp":1765544466099,"version":"3.41.0"},"reference-count":27,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2014,4,1]],"date-time":"2014-04-01T00:00:00Z","timestamp":1396310400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2014,4]]},"abstract":"<jats:p>\n            Ranked result lists with query-dependent snippets have become state of the art in text search. They are typically implemented by searching, at query time, for occurrences of the query words in the top-ranked documents. This\n            <jats:italic>document-based<\/jats:italic>\n            approach has three inherent problems: (i) when a document is indexed by terms which it does not contain literally (e.g., related words or spelling variants), localization of the corresponding snippets becomes problematic; (ii) each query operator (e.g., phrase or proximity search) has to be implemented twice, on the index side in order to compute the correct result set, and on the snippet-generation side to generate the appropriate snippets; and (iii) in a worst case, the whole document needs to be scanned for occurrences of the query words, which could be problematic for very long documents.\n          <\/jats:p>\n          <jats:p>\n            We present a new\n            <jats:italic>index-based<\/jats:italic>\n            method that localizes snippets by information solely computed from the index and that overcomes all three problems. Unlike previous index-based methods, we show how to achieve this at essentially no extra cost in query processing time, by a technique we call\n            <jats:italic>operator inversion<\/jats:italic>\n            . We also show how our index-based method allows the caching of individual segments instead of complete documents, which enables a significantly larger cache hit-ratio as compared to the document-based approach. We have fully integrated our implementation with the CompleteSearch engine.\n          <\/jats:p>","DOI":"10.1145\/2590972","type":"journal-article","created":{"date-parts":[[2014,4,23]],"date-time":"2014-04-23T13:52:04Z","timestamp":1398261124000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Efficient Index-Based Snippet Generation"],"prefix":"10.1145","volume":"32","author":[{"given":"Hannah","family":"Bast","sequence":"first","affiliation":[{"name":"University of Freiburg"}]},{"given":"Marjan","family":"Celikik","sequence":"additional","affiliation":[{"name":"University of Freiburg"}]}],"member":"320","published-online":{"date-parts":[[2014,4]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Almeida V. Bestavros A. Crovella M. and deOliveira A. 1996. Characterizing reference locality in the WWW. Tech. rep. Boston University.   Almeida V. Bestavros A. Crovella M. and deOliveira A. 1996. Characterizing reference locality in the WWW. Tech. rep. Boston University."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148235"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1148170.1148234"},{"volume-title":"Proceedings of the 3rd Conference on Innovative Data Systems Research (CIDR). VLDB Endowment, 88--95","author":"Bast H.","key":"e_1_2_1_4_1"},{"volume-title":"Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB). VLDB Endowment, 475--486","author":"Bast H.","key":"e_1_2_1_5_1"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277856"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321560"},{"volume-title":"Proceedings of the 18th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM). IEEE Computer Society Press","author":"Breslau L.","key":"e_1_2_1_8_1"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1008992.1009132"},{"key":"e_1_2_1_10_1","unstructured":"Buettcher S. 2007. The Wumpus search engine. http:\/\/www.wumpus-search.org\/.  Buettcher S. 2007. The Wumpus search engine. http:\/\/www.wumpus-search.org\/."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1951365.1951379"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1529282.1529669"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/38.1.43"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/333135.333137"},{"key":"e_1_2_1_15_1","unstructured":"Cutting D. 2004. Lucene. http:\/\/lucene.apache.org\/.  Cutting D. 2004. Lucene. http:\/\/lucene.apache.org\/."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312665"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14778\/2078331.2078341"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277871"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1458082.1458139"},{"key":"e_1_2_1_20_1","unstructured":"Manolache G. 2008. Index-based snippet generation. M.S. thesis Saarland University.  Manolache G. 2008. Index-based snippet generation. M.S. thesis Saarland University."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290947"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-00958-7_45"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277766"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183703"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390334.1390387"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564389"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(02)00033-X"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2590972","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2590972","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T20:13:52Z","timestamp":1750277632000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2590972"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,4]]},"references-count":27,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2014,4]]}},"alternative-id":["10.1145\/2590972"],"URL":"https:\/\/doi.org\/10.1145\/2590972","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"type":"print","value":"1046-8188"},{"type":"electronic","value":"1558-2868"}],"subject":[],"published":{"date-parts":[[2014,4]]},"assertion":[{"value":"2012-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-04-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}