{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:46:21Z","timestamp":1754156781886,"version":"3.41.2"},"reference-count":16,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[1982,1,1]],"date-time":"1982-01-01T00:00:00Z","timestamp":378691200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1982,1,1]]},"abstract":"<jats:p>A new and promising approach to document clustering consists of utilizing previously formed clusters of queries to cluster documents. To employ this approach in practice a similarity measure for queries must be available. This requirement does not cause any problem in the case of information retrieval systems in which both the search request formulations and document representations are sets of weighted or unweighted index terms. However, in most operational retrieval systems search request formulations are Boolean combinations of index terms. Research into similarity measures for search request formulations of this type has already been undertaken by the author and reported elsewhere. The present paper provides further results of investigations in this area. The novelty of the approach discussed is the incorporation within the methodology described earlier of a weighting mechanism to indicate the relative importance of particular attributes of a given Boolean search request formulation. A modification suggested is based on the standard probabilistic approach to information retrieval.<\/jats:p>","DOI":"10.1108\/eb026719","type":"journal-article","created":{"date-parts":[[2008,1,19]],"date-time":"2008-01-19T07:47:16Z","timestamp":1200728836000},"page":"14-28","source":"Crossref","is-referenced-by-count":5,"title":["ON A PROBABILISTIC APPROACH TO DETERMINING THE SIMILARITY BETWEEN BOOLEAN SEARCH REQUEST FORMULATIONS"],"prefix":"10.1108","volume":"38","author":[{"given":"TADEUSZ","family":"RADECKI","sequence":"first","affiliation":[]}],"member":"140","reference":[{"volume-title":"New York: McGraw-Hill","year":"1968","author":"SALTON G.","key":"p_1"},{"volume-title":"The SMART retrieval system: experiments in automatic document processing","year":"1971","author":"SALTON G.","key":"p_2"},{"volume-title":"Dynamic information and library processing","year":"1975","author":"SALTON G.","key":"p_3"},{"key":"p_4","volume-title":"Information retrieval","author":"VAN RIJSBERGEN C. J.","year":"1979","edition":"2"},{"volume-title":"Document retrieval systems-optimization and evaluation. Report No. ISR-10 to the National Science Foundation","year":"1966","author":"ROCCHIO J. J.","key":"p_5"},{"volume-title":"The SMART retrieval system: experiments in automatic document processing.","year":"1971","author":"DATTOLA R. T.","key":"p_6"},{"key":"p_7","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630250403"},{"volume-title":"A modified two-level search algorithm using request clustering. Report No. ISR-11 to the National Science Foundation. Section VII","year":"1966","author":"LESSER V. R.","key":"p_8"},{"key":"p_9","first-page":"92","volume-title":"New Trends in Documentation and Information. Proceedings of the 39th FID Congress. London: Aslib","author":"RADECKI T.","year":"1980"},{"volume-title":"Journal of the American Societyfor Information Science","year":"1981","author":"RADECKI T.","key":"p_10"},{"volume-title":"Proceedings oftheJoint BCS and ACM Symposium: Research and Development in Information Retrieval","year":"1981","author":"RADECKI T.","key":"p_11"},{"key":"p_12","doi-asserted-by":"publisher","DOI":"10.1145\/362663.362735"},{"key":"p_13","unstructured":"GILL, A. Applied algebraforthe computer sciences. Englewood Cliffs, New Jersey: Prentice-Hall, 1976, pp.173-177."},{"key":"p_14","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630270302"},{"key":"p_15","doi-asserted-by":"publisher","DOI":"10.1108\/eb026637"},{"key":"p_16","doi-asserted-by":"publisher","DOI":"10.1108\/eb026647"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/eb026719\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/eb026719\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:11:03Z","timestamp":1753398663000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/38\/1\/14-28\/217086"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1982,1,1]]},"references-count":16,"journal-issue":{"issue":"1","published-print":{"date-parts":[[1982,1,1]]}},"alternative-id":["10.1108\/eb026719"],"URL":"https:\/\/doi.org\/10.1108\/eb026719","relation":{},"ISSN":["0022-0418"],"issn-type":[{"type":"print","value":"0022-0418"}],"subject":[],"published":{"date-parts":[[1982,1,1]]}}}