{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,6]],"date-time":"2025-12-06T04:59:03Z","timestamp":1764997143769,"version":"3.41.0"},"reference-count":14,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2017,8,2]],"date-time":"2017-08-02T00:00:00Z","timestamp":1501632000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGIR Forum"],"published-print":{"date-parts":[[2017,8,2]]},"abstract":"<jats:p>We propose a new probabilistic approach to information retrieval based upon the ideas and methods of statistical machine translation. The central ingredient in this approach is a statistical model of how a user might distill or \"translate\" a given document into a query. To assess the relevance of a document to a user's query, we estimate the probability that the query would have been generated as a translation of the document, and factor in the user's general preferences in the form of a prior distribution over documents. We propose a simple, well motivated model of the document-to-query translation process, and describe an algorithm for learning the parameters of this model in an unsupervised manner from a collection of documents. As we show, one can view this approach as a generalization and justification of the \"language modeling\" strategy recently proposed by Ponte and Croft. In a series of experiments on TREC data, a simple translation-based retrieval system performs well in comparison to conventional retrieval techniques. This prototype system only begins to tap the full potential of translation-based retrieval.<\/jats:p>","DOI":"10.1145\/3130348.3130371","type":"journal-article","created":{"date-parts":[[2017,8,2]],"date-time":"2017-08-02T19:36:12Z","timestamp":1501702572000},"page":"219-226","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":43,"title":["Information Retrieval as Statistical Translation"],"prefix":"10.1145","volume":"51","author":[{"given":"Adam","family":"Berger","sequence":"first","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA"}]},{"given":"John","family":"Lafferty","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA"}]}],"member":"320","published-online":{"date-parts":[[2017,8,2]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630250505"},{"key":"e_1_2_1_2_1","volume-title":"Information retrieval on the web: Tools and algorithmic issues,\" Invited tutorial at Foundations of Computer Sci- ence (FOCS)","author":"Broder A.","year":"1998","unstructured":"A. Broder and M. Henzinger ( 1998 ). \" Information retrieval on the web: Tools and algorithmic issues,\" Invited tutorial at Foundations of Computer Sci- ence (FOCS) . A. Broder and M. Henzinger (1998). \"Information retrieval on the web: Tools and algorithmic issues,\" Invited tutorial at Foundations of Computer Sci- ence (FOCS)."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/92858.92860"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/972470.972474"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075671.1075716"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb026683"},{"key":"e_1_2_1_7_1","first-page":"1","article-title":"Maximum likelihood from incomplete data via the EM algorithm","author":"Dempster A.","year":"1977","unstructured":"A. Dempster , N. Laird , and D. Rubin ( 1977 ). \" Maximum likelihood from incomplete data via the EM algorithm ,\" Journal of the Royal Statistical Society, 39(B) , pp. 1 -- 38 . A. Dempster, N. Laird, and D. Rubin (1977). \"Maximum likelihood from incomplete data via the EM algorithm,\" Journal of the Royal Statistical Society, 39(B), pp. 1--38.","journal-title":"Journal of the Royal Statistical Society, 39(B)"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.3115\/112405.112428"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.291008"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630270302"},{"key":"e_1_2_1_12_1","volume-title":"Okapi at TREC,\" In Proceedings of the first Text REtrieval Conference (TREC-1)","author":"Robertson S.","year":"1992","unstructured":"S. Robertson , S. Walker , M. Hancock-Beaulieu , A. Gull , and M. Lau ( 1992 ). \" Okapi at TREC,\" In Proceedings of the first Text REtrieval Conference (TREC-1) , Gaithersburg, Maryland . S. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau (1992). \"Okapi at TREC,\" In Proceedings of the first Text REtrieval Conference (TREC-1), Gaithersburg, Maryland."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_2_1_14_1","volume-title":"Efficient probabilistic inference for text retrieval,\" Proceedings of RIAO 3","author":"Turtle H.","year":"1991","unstructured":"H. Turtle and W. B. Croft ( 1991 ). \" Efficient probabilistic inference for text retrieval,\" Proceedings of RIAO 3 H.Turtle and W. B. Croft (1991). \"Efficient probabilistic inference for text retrieval,\" Proceedings of RIAO 3"},{"key":"e_1_2_1_15_1","volume-title":"Translation","author":"Weaver W.","year":"1955","unstructured":"W. Weaver ( 1955 ). \" Translation (1949),\" In Machine Translation of Languages, MIT Press . W. Weaver (1955). \"Translation (1949),\" In Machine Translation of Languages, MIT Press."}],"container-title":["ACM SIGIR Forum"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3130348.3130371","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3130348.3130371","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:26:17Z","timestamp":1750213577000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3130348.3130371"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,2]]},"references-count":14,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,8,2]]}},"alternative-id":["10.1145\/3130348.3130371"],"URL":"https:\/\/doi.org\/10.1145\/3130348.3130371","relation":{},"ISSN":["0163-5840"],"issn-type":[{"type":"print","value":"0163-5840"}],"subject":[],"published":{"date-parts":[[2017,8,2]]},"assertion":[{"value":"2017-08-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}