{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T21:40:01Z","timestamp":1750282801548,"version":"3.41.0"},"reference-count":46,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2001,7,1]],"date-time":"2001-07-01T00:00:00Z","timestamp":993945600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2001,7]]},"abstract":"<jats:p>A metasearch engine is a system that supports unified access to multiple local search engines. Database selection is one of the main challenges in building a large-scale metasearch engine. The problem is to efficiently and accurately determine a small number of potentially useful local search engines to invoke for each user query. In order to enable accurate selection, metadata that reflect the contents of each search engine need to be collected and used. This article proposes a highly scalable and accurate database selection method. This method has several novel features. First, the metadata for representing the contents of all search engines are organized into a single integrated representative. Such a representative yields both computational efficiency and storage efficiency. Second, the new selection method is based on a theory for ranking search engines optimally. Experimental results indicate that this new method is very effective. An operational prototype system has been built based on the proposed approach.<\/jats:p>","DOI":"10.1145\/502115.502120","type":"journal-article","created":{"date-parts":[[2002,7,27]],"date-time":"2002-07-27T11:29:00Z","timestamp":1027769340000},"page":"310-335","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["A highly scalable and effective method for metasearch"],"prefix":"10.1145","volume":"19","author":[{"given":"Weiyi","family":"Meng","sequence":"first","affiliation":[{"name":"State University of New York at Binghamton"}]},{"given":"Zonghuan","family":"Wu","sequence":"additional","affiliation":[{"name":"State University of New York at Binghamton"}]},{"given":"Clement","family":"Yu","sequence":"additional","affiliation":[{"name":"University of Illinois at Chicago"}]},{"given":"Zhuogang","family":"Li","sequence":"additional","affiliation":[{"name":"State University of New York at Binghamton"}]}],"member":"320","published-online":{"date-parts":[[2001,7]]},"reference":[{"volume-title":"Resource Discovery in a Globally- DistributedDigital Library","author":"ARMS W.","key":"e_1_2_1_1_1","unstructured":"ARMS , W. , BOWMAN , C. , FUHR , N. , GRAVANO , L. , KAPIDAKIS , S. , KOVACS , L. , LAGOZE , C. , LEVAN , B. , PAPAZOGLOU , M. , AND SMEATON , A. 1999. Resource Discovery in a Globally- DistributedDigital Library . Digital Library Collaborative Working Groups Report , http:\/\/www. iei.pi.cnr.it\/DELOS\/NSF\/resourcediscovery.htm. ARMS, W., BOWMAN, C., FUHR, N., GRAVANO, L., KAPIDAKIS, S., KOVACS, L., LAGOZE,C., LEVAN, B., PAPAZOGLOU, M., AND SMEATON, A. 1999. Resource Discovery in a Globally- DistributedDigital Library. Digital Library Collaborative Working Groups Report, http:\/\/www. iei.pi.cnr.it\/DELOS\/NSF\/resourcediscovery.htm."},{"key":"e_1_2_1_2_1","first-page":"258","volume-title":"Proceedings of the ACM SIGIR Conference","author":"BAUMGARTEN C.","year":"1997","unstructured":"BAUMGARTEN , C. 1997 . A probabilistic model for distributed information retrieval . In Proceedings of the ACM SIGIR Conference ( Philadelphia, July) , 258 - 266 . 10.1145\/258525.258585 BAUMGARTEN, C. 1997. A probabilistic model for distributed information retrieval. In Proceedings of the ACM SIGIR Conference (Philadelphia, July), 258-266. 10.1145\/258525.258585"},{"key":"e_1_2_1_3_1","first-page":"246","volume-title":"Proceedings of the ACM SIGIR Conference","author":"BAUMGARTEN C.","year":"1999","unstructured":"BAUMGARTEN , C. 1999 . A probabilistic solution to the selection and fusion problem in distributed information retrieval . In Proceedings of the ACM SIGIR Conference ( Berkeley, Calif., August) , 246 - 253 . 10.1145\/312624.312685 BAUMGARTEN, C. 1999. A probabilistic solution to the selection and fusion problem in distributed information retrieval. In Proceedings of the ACM SIGIR Conference (Berkeley, Calif., August), 246-253. 10.1145\/312624.312685"},{"key":"e_1_2_1_4_1","doi-asserted-by":"crossref","unstructured":"BERGMAN M. 2000. The Deep Web:Surfacing the Hidden Value. BrightPlanet www. completeplanet.com\/Tutorials\/DeepWeb\/index.asp. BERGMAN M. 2000. The Deep Web:Surfacing the Hidden Value. BrightPlanet www. completeplanet.com\/Tutorials\/DeepWeb\/index.asp.","DOI":"10.3998\/3336451.0007.104"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Seventh World Wide Web Conference (Brisbane, April), 379-388","author":"BHARAT K.","year":"1998","unstructured":"BHARAT , K. AND BRODER , A. 1998 . A technique for measuring the relative size and overlap of public web search engines . In Proceedings of the Seventh World Wide Web Conference (Brisbane, April), 379-388 . BHARAT,K.AND BRODER, A. 1998. A technique for measuring the relative size and overlap of public web search engines. In Proceedings of the Seventh World Wide Web Conference (Brisbane, April), 379-388."},{"key":"e_1_2_1_6_1","first-page":"479","volume-title":"Proceedings of the ACM SIGMOD Conference","author":"CALLAN J.","year":"1999","unstructured":"CALLAN , J. , CONNELL , M. , AND DU , A. 1999 . Automatic discovery of language models for text databases . In Proceedings of the ACM SIGMOD Conference ( Philadelphia, June) , 479 - 490 . 10.1145\/304182.304224 CALLAN, J., CONNELL, M., AND DU, A. 1999. Automatic discovery of language models for text databases. In Proceedings of the ACM SIGMOD Conference (Philadelphia, June), 479-490. 10.1145\/304182.304224"},{"key":"e_1_2_1_7_1","first-page":"21","volume-title":"Proceedings of the ACM SIGIR Conference","author":"CALLAN J.","year":"1995","unstructured":"CALLAN , J. , LU , Z. , AND CROFT , W. 1995 . Searching distributed collections with inference networks . In Proceedings of the ACM SIGIR Conference ( Seattle) , 21 - 28 . 10.1145\/215206.215328 CALLAN, J., LU, Z., AND CROFT, W. 1995. Searching distributed collections with inference networks. In Proceedings of the ACM SIGIR Conference (Seattle), 21-28. 10.1145\/215206.215328"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/256163.256164"},{"key":"e_1_2_1_9_1","first-page":"40","volume-title":"Proceedings of the 1999 AAAI Symposium on Intelligent Agents in Cyberspace","author":"FAN Y.","year":"1999","unstructured":"FAN , Y. AND GAUCH , S. 1999 . Adaptive agents for information gathering from multiple, distributed information sources . In Proceedings of the 1999 AAAI Symposium on Intelligent Agents in Cyberspace ( Stanford University, March) , 40 - 46 . FAN,Y.AND GAUCH, S. 1999. Adaptive agents for information gathering from multiple, distributed information sources. In Proceedings of the 1999 AAAI Symposium on Intelligent Agents in Cyberspace (Stanford University, March), 40-46."},{"key":"e_1_2_1_10_1","first-page":"238","volume-title":"Proceedings of the ACM SIGIR Conference","author":"FRENCH J.","year":"1999","unstructured":"FRENCH , J. , POWELL , A. , CALLAN , J. , VILES , C. , EMMITT , T. , PREY , K. , AND MOU , Y. 1999 . Comparing the performance of database selection algorithms . In Proceedings of the ACM SIGIR Conference ( Berkeley, Calif., August) , 238 - 245 . 10.1145\/312624.312684 FRENCH, J., POWELL, A., CALLAN, J., VILES, C., EMMITT, T., PREY, K., AND MOU, Y. 1999. Comparing the performance of database selection algorithms. In Proceedings of the ACM SIGIR Conference (Berkeley, Calif., August), 238-245. 10.1145\/312624.312684"},{"key":"e_1_2_1_11_1","first-page":"121","volume-title":"Proceedings of the ACM SIGIR Conference","author":"FRENCH J.","year":"1998","unstructured":"FRENCH , J. , POWELL , A. , AND VILES , C. 1998 . Evaluating database selection techniques: A testbed and experiment . In Proceedings of the ACM SIGIR Conference ( Melbourne, August) , 121 - 129 . 10.1145\/290941.290976 FRENCH, J., POWELL, A., AND VILES, C. 1998. Evaluating database selection techniques: A testbed and experiment. In Proceedings of the ACM SIGIR Conference (Melbourne, August), 121-129. 10.1145\/290941.290976"},{"issue":"9","key":"e_1_2_1_12_1","first-page":"637","article-title":"Profusion: Intelligent fusion from multiple, distributed search engines","volume":"2","author":"GAUCH S.","year":"1996","unstructured":"GAUCH , S. , WANG , G. , AND GOMEZ , M. 1996 . Profusion: Intelligent fusion from multiple, distributed search engines . J. Universal Comput. Sci. 2 , 9 , 637 - 649 . GAUCH, S., WANG,G.,AND GOMEZ, M. 1996. Profusion: Intelligent fusion from multiple, distributed search engines. J. Universal Comput. Sci. 2, 9, 637-649.","journal-title":"J. Universal Comput. Sci."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the International Conferences on Very Large Data Bases (Zurich, September), 78-89","author":"GRAVANO L.","year":"1995","unstructured":"GRAVANO , L. AND GARCIA-MOLINA , H. 1995 . Generalizing gloss to vector-space databases and broker hierarchies . In Proceedings of the International Conferences on Very Large Data Bases (Zurich, September), 78-89 . GRAVANO,L.AND GARCIA-MOLINA, H. 1995. Generalizing gloss to vector-space databases and broker hierarchies. In Proceedings of the International Conferences on Very Large Data Bases (Zurich, September), 78-89."},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the International Conferences on Very Large Data Bases (Athens, August), 196-205","author":"GRAVANO L.","year":"1997","unstructured":"GRAVANO , L. AND GARCIA-MOLINA , H. 1997 . Merging ranks from heterogeneous internet sources . In Proceedings of the International Conferences on Very Large Data Bases (Athens, August), 196-205 . GRAVANO,L.AND GARCIA-MOLINA, H. 1997. Merging ranks from heterogeneous internet sources. In Proceedings of the International Conferences on Very Large Data Bases (Athens, August), 196-205."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/297117.297123"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the ACM SIGMOD Conference (Santa Barbara, Calif.), 67-78","author":"IPEIROTIS P.","year":"2001","unstructured":"IPEIROTIS , P. , GRAVANO , L. , AND SAHAMI , M. 2001 . Probe, count, and classify: Categorizing hiddenweb databases . In Proceedings of the ACM SIGMOD Conference (Santa Barbara, Calif.), 67-78 . 10.1145\/375663.375671 IPEIROTIS, P., GRAVANO, L., AND SAHAMI, M. 2001. Probe, count, and classify: Categorizing hiddenweb databases. In Proceedings of the ACM SIGMOD Conference (Santa Barbara, Calif.), 67-78. 10.1145\/375663.375671"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/281250.281253"},{"key":"e_1_2_1_18_1","volume-title":"AAAI Spring Symposium on Information Gathering in Distributed Heterogeneous Environments.","author":"KIRK T.","year":"1995","unstructured":"KIRK , T. , LEVY , A. , SAGIV , Y. , AND SRIVASTAVA , D. 1995 . The information manifold . In AAAI Spring Symposium on Information Gathering in Distributed Heterogeneous Environments. KIRK, T., LEVY, A., SAGIV,Y.,AND SRIVASTAVA, D. 1995. The information manifold. In AAAI Spring Symposium on Information Gathering in Distributed Heterogeneous Environments."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/305110.305112"},{"volume-title":"Proceedings of the Seventh International World Wide Web Conference (Brisbane, April), 95-105","author":"LAWRENCE S.","key":"e_1_2_1_20_1","unstructured":"LAWRENCE , S. AND LEE GILES, C. 1998a. Inquirus, the neci meta search engine . In Proceedings of the Seventh International World Wide Web Conference (Brisbane, April), 95-105 . LAWRENCE,S.AND LEE GILES, C. 1998a. Inquirus, the neci meta search engine. In Proceedings of the Seventh International World Wide Web Conference (Brisbane, April), 95-105."},{"key":"e_1_2_1_21_1","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1126\/science.280.5360.98","article-title":"Searching the world wide web","volume":"280","author":"LAWRENCE S.","year":"1998","unstructured":"LAWRENCE , S. AND LEE GILES , C. 1998 b. Searching the world wide web . Science 280 , 98 - 100 . LAWRENCE,S.AND LEE GILES, C. 1998b. Searching the world wide web. Science 280, 98-100.","journal-title":"Science"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1038\/21987","article-title":"Accessibility of information on the web","volume":"400","author":"LAWRENCE S.","year":"1999","unstructured":"LAWRENCE , S. AND LEE GILES , C. 1999 . Accessibility of information on the web . Nature 400 , 107 - 109 . LAWRENCE,S.AND LEE GILES, C. 1999. Accessibility of information on the web. Nature 400, 107- 109.","journal-title":"Nature"},{"key":"e_1_2_1_23_1","first-page":"145","volume-title":"Proceedings of the ACM SIGIR Conference","author":"LIMA E.","year":"1999","unstructured":"LIMA , E. AND PEDERSEN , J. 1999 . Phrases recognition and expansion for short, precision-biased queries based on a query log . In Proceedings of the ACM SIGIR Conference ( Berkeley, Calif. August) , 145 - 152 . 10.1145\/312624.312669 LIMA,E.AND PEDERSEN, J. 1999. Phrases recognition and expansion for short, precision-biased queries based on a query log. In Proceedings of the ACM SIGIR Conference (Berkeley, Calif. August), 145-152. 10.1145\/312624.312669"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2002.1047777"},{"key":"e_1_2_1_26_1","first-page":"154","volume-title":"Proceedings of the IEEE International Conference on Data Engineering.","author":"LIU L.","year":"1999","unstructured":"LIU , L. 1999 . Query routing in large-scale digital library systems . In Proceedings of the IEEE International Conference on Data Engineering. ( Sydney, March) , 154 - 163 . LIU, L. 1999. Query routing in large-scale digital library systems. In Proceedings of the IEEE International Conference on Data Engineering. (Sydney, March), 154-163."},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the USENIX Symposium on Internet Technologies and Systems (Monterey, Calif., December), 231-239","author":"MANBER U.","year":"1997","unstructured":"MANBER , U. AND BIGOT , P. 1997 . The search broker . In Proceedings of the USENIX Symposium on Internet Technologies and Systems (Monterey, Calif., December), 231-239 . MANBER,U.AND BIGOT, P. 1997. The search broker. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (Monterey, Calif., December), 231-239."},{"key":"e_1_2_1_28_1","first-page":"14","volume-title":"Proceedings of the International Conferences on Very Large Data Bases","author":"MENG M.","year":"1998","unstructured":"MENG , M. , LIU , K. , YU , C. , WANG , X. , CHANG , Y. , AND RISHE , N. 1998 . Determine text databases to search in the internet . In Proceedings of the International Conferences on Very Large Data Bases , ( New York, August) , 14 - 25 . MENG, M., LIU, K., YU, C., WANG, X., CHANG,Y.,AND RISHE, N. 1998. Determine text databases to search in the internet. In Proceedings of the International Conferences on Very Large Data Bases, (New York, August), 14-25."},{"key":"e_1_2_1_29_1","first-page":"146","volume-title":"Proceedings of the IEEE International Conference on Data Engineering","author":"MENG M.","year":"1999","unstructured":"MENG , M. , LIU , K. , YU , C. , WU , W. , AND RISHE , N. 1999 a. Estimating the usefulness of search engines . In Proceedings of the IEEE International Conference on Data Engineering ( Sydney, March) , 146 - 153 . MENG, M., LIU, K., YU, C., WU,W.,AND RISHE, N. 1999a. Estimating the usefulness of search engines. In Proceedings of the IEEE International Conference on Data Engineering (Sydney, March), 146-153."},{"key":"e_1_2_1_30_1","article-title":"Concept hierarchy based text database categorization. Int","author":"MENG W.","year":"2001","unstructured":"MENG , W. , WANG , W. , SUN , H. , AND YU , C. 2001 a. Concept hierarchy based text database categorization. Int . J. Knowl. Inf. Syst. (to appear). MENG, W., WANG, W., SUN, H., AND YU, C. 2001a. Concept hierarchy based text database categorization. Int. J. Knowl. Inf. Syst. (to appear).","journal-title":"J. Knowl. Inf. Syst. (to appear)."},{"key":"e_1_2_1_31_1","unstructured":"MENG W. YU C. AND LIU K. 2001b. Building effective and efficient metasearch engines. ACM Comput. Surv. (to appear). 10.1145\/505282.505284 MENG W. YU C. AND LIU K. 2001b. Building effective and efficient metasearch engines. ACM Comput. Surv. (to appear). 10.1145\/505282.505284"},{"key":"e_1_2_1_32_1","first-page":"22","volume-title":"Proceedings of the Fourth IFCIS Conference on Cooperative Information Systems","author":"MENG W.","year":"1999","unstructured":"MENG , W. , YU , C. , AND LIU , K. 1999 b. Detection of heterogeneities in a multiple text database environment . In Proceedings of the Fourth IFCIS Conference on Cooperative Information Systems ( Edinburgh, September) , 22 - 33 . MENG, W., YU,C.,AND LIU, K. 1999b. Detection of heterogeneities in a multiple text database environment. In Proceedings of the Fourth IFCIS Conference on Cooperative Information Systems (Edinburgh, September), 22-33."},{"volume-title":"Introduction to Modern Information Retrieval","author":"SALTON G.","key":"e_1_2_1_34_1","unstructured":"SALTON , G. AND MCGILL , M. 1983. Introduction to Modern Information Retrieval . McGraw-Hill , New York . SALTON,G.AND MCGILL, M. 1983. Introduction to Modern Information Retrieval. McGraw-Hill, New York."},{"key":"e_1_2_1_35_1","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1145\/3592626.3592641","volume-title":"Proceedings of the Fourth World Wide Web Conference","author":"SELBERG E.","year":"1995","unstructured":"SELBERG , E. AND ETZIONI , O. 1995 . Multi-service search and comparison using the metacrawler . In Proceedings of the Fourth World Wide Web Conference ( Boston, December) , 195 - 208 . SELBERG,E.AND ETZIONI, O. 1995. Multi-service search and comparison using the metacrawler. In Proceedings of the Fourth World Wide Web Conference (Boston, December), 195-208."},{"issue":"1","key":"e_1_2_1_36_1","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/64.577468","article-title":"The metacrawler architecture for resource aggregation on the Web","volume":"12","author":"SELBERG E.","year":"1997","unstructured":"SELBERG , E. AND ETZIONI , O. 1997 . The metacrawler architecture for resource aggregation on the Web . IEEE Expert 12 , 1 , 8 - 14 . SELBERG,E.AND ETZIONI, O. 1997. The metacrawler architecture for resource aggregation on the Web. IEEE Expert 12, 1, 8-14.","journal-title":"IEEE Expert"},{"issue":"1","key":"e_1_2_1_37_1","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1108\/eb026526","article-title":"Statistical interpretation of term specificity and its application in retrieval","volume":"28","author":"SPARCK JONES K.","year":"1972","unstructured":"SPARCK JONES , K. 1972 . Statistical interpretation of term specificity and its application in retrieval . J. Doc. 28 , 1 , 11 - 20 . SPARCK JONES, K. 1972. Statistical interpretation of term specificity and its application in retrieval. J. Doc. 28, 1, 11-20.","journal-title":"J. Doc."},{"key":"e_1_2_1_38_1","first-page":"417","volume-title":"Proceedings of the Ninth World Wide Web Conference","author":"SUGIURA A.","year":"2000","unstructured":"SUGIURA , A. AND ETZIONI , O. 2000 . Query routing for web search engines: Architecture and experiments . In Proceedings of the Ninth World Wide Web Conference ( Amsterdam, May) , 417 - 429 . SUGIURA,A.AND ETZIONI, O. 2000. Query routing for web search engines: Architecture and experiments. In Proceedings of the Ninth World Wide Web Conference (Amsterdam, May), 417-429."},{"key":"e_1_2_1_39_1","first-page":"172","volume-title":"Proceedings of the ACM SIGIR Conference","author":"VOORHEES E.","year":"1995","unstructured":"VOORHEES , E. , GUPTA , N. , AND JOHNSON-LAIRD , B. 1995 . Learning collection fusion strategies . In Proceedings of the ACM SIGIR Conference ( Seattle, July) , 172 - 179 . 10.1145\/215206.215357 VOORHEES, E., GUPTA,N.,AND JOHNSON-LAIRD, B. 1995. Learning collection fusion strategies. In Proceedings of the ACM SIGIR Conference (Seattle, July), 172-179. 10.1145\/215206.215357"},{"key":"e_1_2_1_40_1","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1109\/WISE.2000.882403","volume-title":"Proceedings of the First International Conference on Web Information Systems Engineering","author":"WANG W.","year":"2000","unstructured":"WANG , W. , MENG , W. , AND YU , C. 2000 . Concept hierarchy based text database categorization in a metasearch engine environment . In Proceedings of the First International Conference on Web Information Systems Engineering ( Hong Kong, June) , 283 - 290 . WANG, W., MENG,W.,AND YU, C. 2000. Concept hierarchy based text database categorization in a metasearch engine environment. In Proceedings of the First International Conference on Web Information Systems Engineering (Hong Kong, June), 283-290."},{"key":"e_1_2_1_41_1","first-page":"112","volume-title":"Proceedings of the ACM SIGIR Conference","author":"XU J.","year":"1998","unstructured":"XU , J. AND CALLAN , J. 1998 . Effective retrieval with distributed collections . In Proceedings of the ACM SIGIR Conference ( Melbourne, Australia) , 112 - 120 . 10.1145\/290941.290974 XU,J.AND CALLAN, J. 1998. Effective retrieval with distributed collections. In Proceedings of the ACM SIGIR Conference (Melbourne, Australia), 112-120. 10.1145\/290941.290974"},{"key":"e_1_2_1_42_1","first-page":"254","volume-title":"Proceedings of the ACM SIGIR Conference","author":"XU J.","year":"1999","unstructured":"XU , J. AND CROFT , B. 1999 . Cluster-based language models for distributed retrieval . In Proceedings of the ACM SIGIR Conference ( Berkeley, Calif., August) , 254 - 261 . 10.1145\/312624.312687 XU,J.AND CROFT, B. 1999. Cluster-based language models for distributed retrieval. In Proceedings of the ACM SIGIR Conference (Berkeley, Calif., August), 254-261. 10.1145\/312624.312687"},{"volume-title":"Principles of Database Query Processing for Advanced Applications","author":"YU C.","key":"e_1_2_1_43_1","unstructured":"YU , C. AND MENG , W. 1998. Principles of Database Query Processing for Advanced Applications . Kaufmann , San Francisco . YU,C.AND MENG, W. 1998. Principles of Database Query Processing for Advanced Applications. Kaufmann, San Francisco."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2002.1047772"},{"key":"e_1_2_1_45_1","first-page":"150","volume-title":"Proceedings of the IEEE Conference on Advances in Digital Libraries","author":"YU C.","year":"1999","unstructured":"YU , C. , LIU , K. , WU , M., W. , W., AND RISHE , N. 1999 a. Finding the most similar documents across multiple text databases . In Proceedings of the IEEE Conference on Advances in Digital Libraries ( Baltimore, May) , 150 - 162 . YU, C., LIU, K., WU, M., W., W., AND RISHE, N. 1999a. Finding the most similar documents across multiple text databases. In Proceedings of the IEEE Conference on Advances in Digital Libraries (Baltimore, May), 150-162."},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the Eighth ACM International Conference on Information and Knowledge Management (Kansas City, November), 217-224","author":"YU C.","year":"1999","unstructured":"YU , C. , MENG , W. , LIU , K. , WU , W. , AND RISHE , N. 1999 b. Efficient and effective metasearch for a large number of text databases . In Proceedings of the Eighth ACM International Conference on Information and Knowledge Management (Kansas City, November), 217-224 . 10.1145\/319950.320005 YU, C., MENG, W., LIU, K., WU,W.,AND RISHE, N. 1999b. Efficient and effective metasearch for a large number of text databases. In Proceedings of the Eighth ACM International Conference on Information and Knowledge Management (Kansas City, November), 217-224. 10.1145\/319950.320005"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the ACMSIGMOD Conference (Santa Barbara, Calif., May), 187-198","author":"YU C.","year":"2001","unstructured":"YU , C. , MENG , W. , WU , W. , AND LIU , K. 2001 a. Efficient and effective metasearch for text databases incorporating linkages among documents . In Proceedings of the ACMSIGMOD Conference (Santa Barbara, Calif., May), 187-198 . 10.1145\/375663.375684 YU, C., MENG, W., WU,W.,AND LIU, K. 2001a. Efficient and effective metasearch for text databases incorporating linkages among documents. In Proceedings of the ACMSIGMOD Conference (Santa Barbara, Calif., May), 187-198. 10.1145\/375663.375684"},{"key":"e_1_2_1_48_1","first-page":"391","volume-title":"Proceedings of the fifth International Conference On Database Systems For Advanced Applications","author":"YUWONO B.","year":"1997","unstructured":"YUWONO , B. AND LEE , D. 1997 . Server ranking for distributed text resource systems on the internet . In Proceedings of the fifth International Conference On Database Systems For Advanced Applications ( Melbourne, Australia, April) , 391 - 400 . YUWONO,B.AND LEE, D. 1997. Server ranking for distributed text resource systems on the internet. In Proceedings of the fifth International Conference On Database Systems For Advanced Applications (Melbourne, Australia, April), 391-400."}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/502115.502120","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/502115.502120","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T21:15:13Z","timestamp":1750281313000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/502115.502120"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2001,7]]},"references-count":46,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2001,7]]}},"alternative-id":["10.1145\/502115.502120"],"URL":"https:\/\/doi.org\/10.1145\/502115.502120","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"type":"print","value":"1046-8188"},{"type":"electronic","value":"1558-2868"}],"subject":[],"published":{"date-parts":[[2001,7]]},"assertion":[{"value":"2001-07-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}