{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T23:20:43Z","timestamp":1775604043530,"version":"3.50.1"},"reference-count":290,"publisher":"Emerald","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,3,7]]},"abstract":"<jats:p>Federated search (federated information retrieval or distributed information retrieval) is a technique for searching multiple text collections simultaneously. Queries are submitted to a subset of collections that are most likely to return relevant answers. The results returned by selected collections are integrated and merged into a single list. Federated search is preferred over centralized search alternatives in many environments. For example, commercial search engines such as Google cannot easily index uncrawlable hidden web collections while federated search systems can search the contents of hidden web collections without crawling. In enterprise environments, where each organization maintains an independent search engine, federated search techniques can provide parallel search over multiple collections.<\/jats:p>\n                  <jats:p>There are three major challenges in federated search. For each query, a subset of collections that are most likely to return relevant documents are selected. This creates the collection selection problem. To be able to select suitable collections, federated search systems need to acquire some knowledge about the contents of each collection, creating the collection representation problem. The results returned from the selected collections are merged before the final presentation to the user. This final step is the result merging problem.<\/jats:p>\n                  <jats:p>The goal of this work, is to provide a comprehensive summary of the previous research on the federated search challenges described above.<\/jats:p>","DOI":"10.1561\/1500000010","type":"journal-article","created":{"date-parts":[[2011,3,25]],"date-time":"2011-03-25T05:47:53Z","timestamp":1301032073000},"page":"1-102","source":"Crossref","is-referenced-by-count":115,"title":["Federated Search"],"prefix":"10.1561","volume":"5","author":[{"given":"Milad","family":"Shokouhi","sequence":"first","affiliation":[{"name":"Microsoft Research , 7 JJ Thomson Avenue, Cambridge, CB30FB,","place":["UK"]}]},{"given":"Luo","family":"Si","sequence":"additional","affiliation":[{"name":"Purdue University , 250N University Street, West Lafayette, IN 47907-2066,","place":["USA"]}]}],"member":"140","published-online":{"date-parts":[[2011,3,7]]},"reference":[{"key":"2026040314280861500_ref001","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1109\/ITCC.2002.1000443","volume-title":"Proceedings of the International Conference on Information Technology: Coding and Computing","author":"Abbaci","year":"2002"},{"issue":"4","key":"2026040314280861500_ref002","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1145\/1107499.1107500","article-title":"Information source selection for resource constrained environments","volume":"34","author":"Aksoy","year":"2005","journal-title":"SIGMOD Record"},{"key":"2026040314280861500_ref003","author":"Allan","year":"2009"},{"key":"2026040314280861500_ref004","first-page":"551","volume-title":"Proceedings of the Ninth Text Retrieval Conference","author":"Allan","year":"2000"},{"key":"2026040314280861500_ref005","first-page":"245","volume-title":"Ellis and Hagino [88]","author":"Anagnostopoulos"},{"key":"2026040314280861500_ref006","volume-title":"Proceedings of the 27th International Conference on Very Large Data Bases","author":"Apers","year":"2001"},{"key":"2026040314280861500_ref007","first-page":"337","article-title":"Extracting structured data from web pages","volume-title":"Proceedings ACM SIGMOD International Conference on Management of Data","author":"Arasu","year":"2003."},{"key":"2026040314280861500_ref008","first-page":"1277","article-title":"Classification-based resource selection","volume-title":"Cheung et al. [54]","author":"Arguello"},{"key":"2026040314280861500_ref009","first-page":"315","volume":"3","author":"Arguello","journal-title":"Sources of evidence for vertical selection"},{"key":"2026040314280861500_ref010","volume-title":"Proceedings of the Seventh International Conference on World Wide Web","author":"Ashman","year":"1998"},{"key":"2026040314280861500_ref011","first-page":"276","volume":"72","author":"Aslam","journal-title":"Models for metasearch"},{"key":"2026040314280861500_ref012","first-page":"484","volume":"152","author":"Aslam","journal-title":"A unified model for metasearch, pooling, and system evaluation"},{"issue":"3","key":"2026040314280861500_ref013","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1002\/asi.20283","article-title":"The FedLemur: federated search in the real world","volume":"57","author":"Avrahami","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"2026040314280861500_ref014","volume-title":"Information retrieval: data structures and algorithms","author":"Baeza-Yates","year":"1992"},{"key":"2026040314280861500_ref015","volume-title":"Modern Information Retrieval","author":"Baeza-Yates","year":"1999"},{"issue":"6","key":"2026040314280861500_ref016","doi-asserted-by":"crossref","first-page":"853","DOI":"10.1016\/S0306-4573(02)00084-5","article-title":"Engineering a multi-purpose test collection for web retrieval experiments","volume":"39","author":"Bailey","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref017","first-page":"316","volume":"69","author":"Baillie","year":"2006","journal-title":"Adaptive query-based sampling of distributed collections"},{"key":"2026040314280861500_ref018","first-page":"1110","article-title":"An evaluation of resource description quality measures","author":"Baillie","year":"2006"},{"key":"2026040314280861500_ref019","volume-title":"Proceedings of the First International Conference on Scalable Information systems","author":"Baillie","year":"2006"},{"key":"2026040314280861500_ref020","first-page":"485","volume-title":"Proceedings of the 31st European Conference on Information Retrieval Research, vol. 5478 of Lecture Notes in Computer Science","author":"Baillie","year":"2009"},{"key":"2026040314280861500_ref021","first-page":"401","volume":"269","author":"Bar-Yossef","journal-title":"Efficient search engine measurements"},{"key":"2026040314280861500_ref022","first-page":"367","volume-title":"in Proceedings of the 15th International Conference on World Wide Web","author":"Bar-Yossef","year":"2006"},{"key":"2026040314280861500_ref023","volume":"269","author":"Barbosa","journal-title":"Combining classifiers to identify online databases"},{"key":"2026040314280861500_ref024","first-page":"258","volume":"27","author":"Baumgarten","journal-title":"probabilistic model for distributed information retrieval"},{"key":"2026040314280861500_ref025","first-page":"246","volume":"106","author":"Baumgarten","journal-title":"A probabilistic solution to the selection and fusion problem in distributed information retrieval"},{"key":"2026040314280861500_ref026","author":"Belkin","year":"2000"},{"key":"2026040314280861500_ref027","author":"Belkin","year":"1997"},{"issue":"1","key":"2026040314280861500_ref028","doi-asserted-by":"crossref","DOI":"10.3998\/3336451.0007.104","article-title":"The deep web: Surfacing hidden value","volume":"7","author":"Bergman","journal-title":"Journal of Electronic Publishing"},{"key":"2026040314280861500_ref029","volume-title":"Proceedings of the 28th International Conference on Very Large Data Bases","author":"Bernstein","year":"2002"},{"key":"2026040314280861500_ref030","first-page":"110","volume":"69","author":"Bernstein","journal-title":"Compact features for detection of near-duplicates in distributed retrieval"},{"key":"2026040314280861500_ref031","first-page":"55","volume-title":"Proceedings of the 11th International String Processing and Information Retrieval Conference, vol. 3246 of Lecture Notes in Computer Science","author":"Bernstein","year":"2004"},{"key":"2026040314280861500_ref032","first-page":"465","volume":"56","author":"Berretti","journal-title":"MIND: resource selection and data fusion in multimedia distributed digital libraries"},{"key":"2026040314280861500_ref033","first-page":"379","volume":"10","author":"Bharat","year":"1998","journal-title":"A technique for measuring the relative size and overlap of public web search engines"},{"issue":"1\u20137","key":"2026040314280861500_ref034","first-page":"379","article-title":"A technique for measuring the relative size and overlap of public web search engines","volume":"30","author":"Bharat","journal-title":"Computer Networks and ISDN Systems"},{"key":"2026040314280861500_ref035","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"2026040314280861500_ref036","first-page":"398","volume-title":"Copy detection mechanisms for digital documents","author":"Brin","year":"1995"},{"key":"2026040314280861500_ref037","first-page":"107","volume":"10","author":"Brin","journal-title":"The anatomy of a large-scale hypertextual web search engine"},{"key":"2026040314280861500_ref038","first-page":"594","volume-title":"Estimating corpus size via queries","author":"Broder","year":"2006"},{"issue":"8\u201313","key":"2026040314280861500_ref039","first-page":"1157","article-title":"Syntactic clustering of the web","volume":"29","author":"Broder","journal-title":"Computer Networks and ISDN System"},{"key":"2026040314280861500_ref040","first-page":"361","volume":"224","author":"Buttler","journal-title":"A fully automated object extraction system for the World Wide Web"},{"key":"2026040314280861500_ref041","first-page":"127","volume-title":"Advances in information retrieval, Chapter 5, vol. 7 of The Information Retrieval Series","author":"Callan","year":"2000"},{"issue":"2","key":"2026040314280861500_ref042","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1145\/382979.383040","article-title":"Query-based sampling of text databases","volume":"19","author":"Callan","year":"2001","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref043","first-page":"479","article-title":"Automatic discovery of language models for text databases","author":"Callan","year":"1999"},{"key":"2026040314280861500_ref044","volume-title":"Distributed Multimedia Information Retrieval, SIGIR 2003 Workshop on Distributed Information Retrieval, Revised Selected and Invited Papers, volume 2924 of Lecture Notes in Computer Science","author":"Callan","year":"2004"},{"key":"2026040314280861500_ref045","first-page":"78","volume-title":"The INQUERY retrieval system","author":"Callan","year":"1992"},{"key":"2026040314280861500_ref046","first-page":"21","volume":"91","author":"Callan","journal-title":"Searching distributed collections with inference networks"},{"key":"2026040314280861500_ref047","volume-title":"The effects of query-based sampling on automatic database selection algorithms","author":"Callan","year":"2000"},{"issue":"3","key":"2026040314280861500_ref048","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1016\/S0306-4573(99)00036-9","article-title":"Database merging strategy based on logistic regression","volume":"36","author":"Calv\u00e9","year":"2000","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref049","first-page":"719","volume":"192","author":"Carman","journal-title":"Towards personalized distributed information retrieval"},{"key":"2026040314280861500_ref050","first-page":"340","volume":"87","author":"Caverlee","journal-title":"Distributed query sampling: A qualityconscious approach"},{"key":"2026040314280861500_ref051","first-page":"1867","volume":"54","author":"Cetinta","journal-title":"Learning from past queries for resource selection"},{"key":"2026040314280861500_ref052","first-page":"4","volume":"91","author":"Chakravarthy","journal-title":"NetSerf: using semantic knowledge to find internet information archives"},{"issue":"10","key":"2026040314280861500_ref053","doi-asserted-by":"crossref","first-page":"1411","DOI":"10.1109\/TKDE.2006.152","article-title":"A survey of web information extraction systems","volume":"18","author":"Chang","year":"2006","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"2026040314280861500_ref054","author":"Cheung","year":"2009"},{"issue":"4","key":"2026040314280861500_ref055","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1145\/958942.958945","article-title":"Effective page refresh policies for web crawlers","volume":"28","author":"Cho","year":"2003","journal-title":"ACM Transactions on Database Systems"},{"key":"2026040314280861500_ref056","author":"Clarke","year":"2003"},{"issue":"1","key":"2026040314280861500_ref057","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/1067268.1067274","article-title":"The TREC terabyte retrieval track","volume":"39","author":"Clarke","year":"2005","journal-title":"SIGIR Forum"},{"key":"2026040314280861500_ref058","first-page":"709","volume-title":"Learning trees and rules with set-valued features","author":"Cohen","year":"1996"},{"issue":"1","key":"2026040314280861500_ref059","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1145\/635484.635488","article-title":"Early user \u2014 system interaction for database selection in massive domain-specific online environments","volume":"21","author":"Conrad","year":"2003","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref060","first-page":"71","volume":"29","author":"Conrad","journal-title":"Database selection using actual physical and acquired logical collection resources in a massive domainspecific operational environment"},{"key":"2026040314280861500_ref061","volume":"142","author":"Conrad","journal-title":"Effective collection metasearch in a hierarchical environment: global vs. localized retrieval performance"},{"key":"2026040314280861500_ref062","first-page":"189","volume-title":"Proceedings of the 14th Australasian database conference","author":"Cope","year":"2003"},{"key":"2026040314280861500_ref063","volume-title":"PhD thesis","author":"Craswell","year":"2000"},{"key":"2026040314280861500_ref064","first-page":"37","volume-title":"Server selection on the World Wide Web","author":"Craswell","year":"2000"},{"key":"2026040314280861500_ref065","first-page":"86","volume-title":"Proceedings of the 11th Text REtrieval Conference","author":"Craswell","year":"2002"},{"key":"2026040314280861500_ref066","first-page":"250","volume":"72","author":"Craswell","journal-title":"Effective site finding using link anchor information"},{"key":"2026040314280861500_ref067","first-page":"189","volume-title":"Proceedings of the 10th Australasian Database Conference","author":"Craswell","year":"1999"},{"key":"2026040314280861500_ref068","first-page":"109","volume":"6","author":"Crescenzi","journal-title":"Roadrunner: Towards automatic data extraction from large web sites"},{"key":"2026040314280861500_ref069","volume-title":"Proceedings of the 13th International String Processing and Information Retrieval Conference, vol. 4209 of Lecture Notes in Computer Science","author":"Crestani","year":"2006"},{"key":"2026040314280861500_ref070","author":"Crestani","year":"2010"},{"key":"2026040314280861500_ref071","first-page":"1","volume-title":"Advances in Information Retrieval, Chapter 1, volume 7 of The Information Retrieval Series","author":"Croft","year":"2000"},{"key":"2026040314280861500_ref072","author":"Croft","year":"2001"},{"key":"2026040314280861500_ref073","author":"Croft","year":"1998"},{"key":"2026040314280861500_ref074","first-page":"66","volume-title":"Methodologies for distributed information retrieval","author":"de Kretser","year":"1998"},{"issue":"6","key":"2026040314280861500_ref075","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9","article-title":"Indexing by latent semantic analysis","volume":"41","author":"Deerwester","year":"1990","journal-title":"Journal of the American Society for Information Sciences"},{"key":"2026040314280861500_ref076","doi-asserted-by":"crossref","DOI":"10.1002\/0471729000","volume-title":"Optimal Statistical Decisions (Wiley Classics Library)","author":"DeGroot","year":"2004"},{"key":"2026040314280861500_ref077","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1145\/1498759.1498825","volume-title":"Proceedings of the Second ACM International Conference on Web Search and Data Mining","author":"Diaz","year":"2009"},{"key":"2026040314280861500_ref078","first-page":"323","volume":"3","author":"Diaz","journal-title":"Adaptation of offline vertical selection predictions in the presence of user feedback"},{"key":"2026040314280861500_ref079","first-page":"323","volume":"70","author":"Diaz","journal-title":"Adaptation of offline vertical selection predictions in the presence of user feedback"},{"key":"2026040314280861500_ref080","first-page":"154","volume":"87","author":"Diaz","journal-title":"Improving the estimation of relevance models using large external corpora"},{"issue":"3","key":"2026040314280861500_ref081","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1145\/256163.256164","article-title":"Experiences with selecting search engines using metasearch","volume":"15","author":"Dreilinger","year":"1997","journal-title":"ACM Transaction on Information Systems"},{"key":"2026040314280861500_ref082","volume-title":"PhD thesis","author":"D\u2019Souza","year":"2005"},{"key":"2026040314280861500_ref083","first-page":"52","volume-title":"Proceedings of the Second International Symposium on Cooperative Database Systems for Advanced Applications (CODAS\u201999)","author":"D\u2019Souza","year":"1999"},{"key":"2026040314280861500_ref084","first-page":"28","volume-title":"Proceedings of the Australasian Database Conference","author":"D\u2019Souza","year":"2000"},{"issue":"3","key":"2026040314280861500_ref085","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1016\/S0306-4573(03)00008-6","article-title":"Collection selection for managed distributed document databases","volume":"40","author":"D\u2019Souza","year":"2004","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref086","first-page":"41","volume-title":"Is CORI effective for collection selection? An exploration of parameters, queries, and data","author":"D\u2019Souza","year":"2004"},{"key":"2026040314280861500_ref087","author":"Efthimiadis","year":"2006"},{"key":"2026040314280861500_ref088","doi-asserted-by":"crossref","DOI":"10.1145\/1060745","volume-title":"Proceedings of the 14th International Conference on World Wide Web","author":"Ellis","year":"2005"},{"key":"2026040314280861500_ref089","first-page":"347","volume":"192","author":"Elsas","journal-title":"Retrieval and feedback models for blog feed search"},{"key":"2026040314280861500_ref090","first-page":"37","volume-title":"Proceedings of the First Conference on Latin American Web Congress","author":"Fetterly","year":"2003"},{"key":"2026040314280861500_ref091","author":"Fox","year":"1995"},{"key":"2026040314280861500_ref092","first-page":"243","volume-title":"Proceedings of the Second Text REtrieval Conference","author":"Fox","year":"1993"},{"key":"2026040314280861500_ref093","first-page":"105","volume-title":"Proceedings of the Third Text REtrieval Conference","author":"Fox","year":"1994"},{"issue":"3","key":"2026040314280861500_ref094","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1023\/A:1019241915635","article-title":"Metrics for evaluating database selection techniques","volume":"3","author":"French","year":"2000","journal-title":"World Wide Web"},{"key":"2026040314280861500_ref095","first-page":"238","volume":"106","author":"French","journal-title":"Comparing the performance of database selection algorithms"},{"key":"2026040314280861500_ref096","first-page":"199","volume":"200","author":"French","journal-title":"Exploiting a controlled vocabulary to improve collection selection and retrieval effectiveness"},{"key":"2026040314280861500_ref097","first-page":"121","volume":"73","author":"French","journal-title":"Evaluating database selection techniques: A testbed and experiment"},{"key":"2026040314280861500_ref098","volume-title":"Optimum database selection in networked IR","author":"Fuhr","year":"1996"},{"issue":"3","key":"2026040314280861500_ref099","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1145\/314516.314517","article-title":"A decision-theoretic approach to database selection in networked IR","volume":"17","author":"Fuhr","year":"1999","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref100","first-page":"35","volume-title":"Resource discovery in distributed digital libraries","author":"Fuhr","year":"1999"},{"key":"2026040314280861500_ref101","first-page":"7","volume-title":"Proceedings of the 27th Australasian Computer Science Conference","author":"Garcia","year":"2004"},{"key":"2026040314280861500_ref102","author":"Gauch","year":"1999"},{"key":"2026040314280861500_ref103","first-page":"174","volume-title":"Information fusion with ProFusion","author":"Gauch","year":"1996"},{"issue":"9","key":"2026040314280861500_ref104","first-page":"637","article-title":"ProFusion: Intelligent fusion from multiple distributed search engines","volume":"2","author":"Gauch","year":"1996","journal-title":"Journal of Universal Computer Science"},{"issue":"9","key":"2026040314280861500_ref105","first-page":"637","article-title":"ProFusion: Intelligent fusion from multiple, distributed search engines","volume":"2","author":"Gauch","year":"1996","journal-title":"Journal of Universal Computer Science"},{"key":"2026040314280861500_ref106","author":"Gey","year":"1999"},{"key":"2026040314280861500_ref107","volume-title":"Selective Retrieval Metasearch Engine (United States Patent 2002\/0165860 a1)","author":"Glover","year":"2001"},{"key":"2026040314280861500_ref108","first-page":"210","volume":"102","author":"Glover","year":"1999","journal-title":"Architecture of a metasearch engine that supports user information needs"},{"key":"2026040314280861500_ref109","first-page":"258","volume-title":"Proceedings of the Seventh International Tools with Artificial Intelligence","author":"Goldberg","year":"1995"},{"key":"2026040314280861500_ref110","volume-title":"PhD thesis","author":"Gravano","year":"1997"},{"key":"2026040314280861500_ref111","first-page":"207","volume-title":"STARTS: Stanford proposal for Internet meta-searching","author":"Gravano","year":"1997"},{"key":"2026040314280861500_ref112","first-page":"78","volume-title":"Proceedings of the 21st International Conference on Very Large Data Bases","author":"Gravano","year":"1995"},{"key":"2026040314280861500_ref113","first-page":"126","volume-title":"The effectiveness of GlOSS for the text database discovery problem","author":"Gravano","year":"1994"},{"key":"2026040314280861500_ref114","first-page":"103","volume-title":"Proceedings of the Third International Conference on Parallel and Distributed Information Systems","author":"Gravano","year":"1994"},{"issue":"2","key":"2026040314280861500_ref115","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1145\/320248.320252","article-title":"GlOSS: text-source discovery over the Internet","volume":"24","author":"Gravano","year":"1999","journal-title":"ACM Transactions on Database Systems"},{"issue":"1","key":"2026040314280861500_ref116","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/635484.635485","article-title":"Qprober: A system for automatic classification of hidden web databases","volume":"21","author":"Gravano","year":"2003","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref117","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1145\/379437.379496","volume-title":"Proceedings of the ACM\/IEEE Joint Conference on Digital Libraries","author":"Green","year":"2001"},{"key":"2026040314280861500_ref118","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-55864-1","volume-title":"Linear Regression","author":"Gross","year":"2003"},{"key":"2026040314280861500_ref119","first-page":"902","volume":"88","author":"Gulli","journal-title":"The indexable web is more than 11.5 billion pages"},{"key":"2026040314280861500_ref120","first-page":"492","volume":"152","author":"Han","year":"2003","journal-title":"Intelligent metasearch engine for knowledge management"},{"key":"2026040314280861500_ref121","first-page":"1","volume-title":"Proceedings of the Third Text REtrieval Conference","author":"Harman","year":"1994"},{"key":"2026040314280861500_ref122","first-page":"1","volume-title":"Proceedings of the Fourth Text REtrieval Conference","author":"Harman","year":"1995"},{"key":"2026040314280861500_ref123","first-page":"93","volume-title":"Proceedings of the Seventh Text REtrieval Conference","author":"Hawking","year":"1997"},{"issue":"1","key":"2026040314280861500_ref124","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1145\/297117.297123","article-title":"Methods for information server selection","volume":"17","author":"Hawking","year":"1999","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref125","first-page":"75","volume":"181","author":"Hawking","year":"2005","journal-title":"Server selection methods in hybrid portal search"},{"key":"2026040314280861500_ref126","first-page":"131","volume-title":"Proceedings of the Eight Text REtrieval Conference","author":"Hawking","year":"2000"},{"key":"2026040314280861500_ref127","first-page":"627","volume-title":"Information extraction from template-generated hidden web documents","author":"Hedley","year":"2004"},{"key":"2026040314280861500_ref128","first-page":"558","volume-title":"Query-related data extraction of hidden web documents","author":"Hedley","year":"2004"},{"key":"2026040314280861500_ref129","first-page":"1","volume-title":"A two-phase sampling technique for information extraction from hidden web databases","author":"Hedley","year":"2004"},{"key":"2026040314280861500_ref130","first-page":"516","volume-title":"Proceedings of the Fifth International Conference on Web Information Systems Engineering, vol. 3306 of Lecture Notes in Computer Science","author":"Hedley","year":"2004"},{"key":"2026040314280861500_ref131","first-page":"295","volume":"132","author":"Henzinger","journal-title":"On nearuniform url sampling"},{"key":"2026040314280861500_ref132","volume-title":"Proceedings of the Ninth International Conference on World Wide Web","author":"Herman","year":"2000"},{"key":"2026040314280861500_ref133","first-page":"1128","volume":"88","author":"Hernandez","journal-title":"Improving text collection selection with coverage and overlap statistics"},{"key":"2026040314280861500_ref134","first-page":"98","volume":"70","author":"Hong","journal-title":"A joint probabilistic classification model for resource selection"},{"key":"2026040314280861500_ref135","volume-title":"Applied Logistic Regression","author":"Hosmer","year":"1989"},{"key":"2026040314280861500_ref136","volume-title":"PhD thesis","author":"Ipeirotis","year":"2004"},{"key":"2026040314280861500_ref137","first-page":"394","volume":"29","author":"Ipeirotis","journal-title":"Distributed search over the hidden web: Hierarchical database sampling and selection"},{"key":"2026040314280861500_ref138","first-page":"767","volume-title":"When one sample is not enough: improving text database selection using shrinkage","author":"Ipeirotis","year":"2004"},{"issue":"2","key":"2026040314280861500_ref139","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1344411.1344412","article-title":"Classification-aware hidden-web text database selection","volume":"26","author":"Ipeirotis","year":"2008","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref140","first-page":"606","volume-title":"Proceedings of the 21st International Conference on Data Engineering","author":"Ipeirotis","year":"2005"},{"key":"2026040314280861500_ref141","volume-title":"Algorithms for Clustering Data","author":"Jain","year":"1988"},{"key":"2026040314280861500_ref142","author":"J\u00e4rvelin","year":"2002"},{"key":"2026040314280861500_ref143","first-page":"41","volume":"26","author":"J\u00e4rvelin","journal-title":"IR evaluation methods for retrieving highly relevant documents"},{"key":"2026040314280861500_ref144","author":"Kalpakis","year":"2002"},{"key":"2026040314280861500_ref145","first-page":"81","volume-title":"Proceedings of the Second International Workshop on Web-based Support Systems","author":"Karnatapu","year":"2004"},{"key":"2026040314280861500_ref146","first-page":"50","volume":"70","author":"Kim","journal-title":"Ranking using multiple document types in desktop search"},{"key":"2026040314280861500_ref147","first-page":"33","volume-title":"Preliminary investigations into ontologybased collection selection","author":"King","year":"2006"},{"key":"2026040314280861500_ref148","volume-title":"Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents (United States Patent 5,659,732)","author":"Kirsch","year":"2003"},{"key":"2026040314280861500_ref149","first-page":"347","volume":"3","author":"K\u00f6nig","journal-title":"Click-through prediction for news queries"},{"issue":"2","key":"2026040314280861500_ref150","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1016\/0169-7552(94)90131-7","article-title":"ALIWEB, Archie-like indexing in the web","volume":"27","author":"Koster","year":"1994","journal-title":"Computer Networks and ISDN Systems"},{"key":"2026040314280861500_ref151","author":"Kraaij","year":"2007"},{"key":"2026040314280861500_ref152","author":"Kraft","year":"2003"},{"key":"2026040314280861500_ref153","volume-title":"Information Theory and Statistics","author":"Kullback","year":"1959"},{"key":"2026040314280861500_ref154","first-page":"111","volume":"72","author":"Lafferty","journal-title":"Document language models, query models, and risk minimization for information retrieval"},{"key":"2026040314280861500_ref155","first-page":"282","volume-title":"Collection selection and results merging with topically organized U.S. patents and TREC data","author":"Larkey","year":"2000"},{"key":"2026040314280861500_ref156","first-page":"399","volume":"142","author":"Larson","journal-title":"A logistic regression approach to distributed IR"},{"key":"2026040314280861500_ref157","first-page":"487","volume-title":"Research and Advanced Technology for Digital Libraries, Seventh European Conference, vol. 2769 of Lecture Notes in Computer Science","author":"Larson","year":"2003"},{"key":"2026040314280861500_ref158","first-page":"95","volume":"10","author":"Lawrence","journal-title":"Inquirus, the NECi meta search engine"},{"key":"2026040314280861500_ref159","first-page":"267","volume":"27","author":"Lee","journal-title":"Analyses of multiple evidence combination"},{"key":"2026040314280861500_ref160","first-page":"139","volume":"87","author":"Lillis","journal-title":"ProbFuse: a probabilistic approach to data fusion"},{"key":"2026040314280861500_ref161","first-page":"358","volume-title":"Proceedings of the 30th European Conference on Information Retrieval Research, volume 4956 of Lecture Notes in Computer Science","author":"Lillis","year":"2008"},{"key":"2026040314280861500_ref162","first-page":"332","volume-title":"Proceedings of the International Conference on Information Technology: Coding and Computing","author":"Lin","year":"2002"},{"key":"2026040314280861500_ref163","doi-asserted-by":"crossref","DOI":"10.1145\/956750.956826","volume-title":"Mining data records in web pages","author":"Liu","year":"2003"},{"key":"2026040314280861500_ref164","first-page":"1017","volume-title":"Allinonenews: development and evaluation of a large-scale news metasearch engine","author":"Liu","year":"2007"},{"key":"2026040314280861500_ref165","first-page":"652","volume":"200","author":"Liu","journal-title":"Discovering the representative of a search engine"},{"issue":"3","key":"2026040314280861500_ref166","doi-asserted-by":"crossref","DOI":"10.1109\/TKDE.2009.109","article-title":"Vide: A vision-based approach for deep web data extraction","volume":"22","author":"Liu","year":"2010","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"2026040314280861500_ref167","first-page":"1485","volume":"223","author":"Lu","journal-title":"Efficient estimation of the size of text deep web data source"},{"key":"2026040314280861500_ref168","volume-title":"PhD thesis","author":"Lu","year":"2007"},{"key":"2026040314280861500_ref169","first-page":"332","volume":"144","author":"Lu","journal-title":"Pruning long documents for distributed information retrieval"},{"key":"2026040314280861500_ref170","first-page":"332","volume":"87","author":"Lu","journal-title":"User modeling for full-text federated search in peer-to-peer networks"},{"key":"2026040314280861500_ref171","first-page":"199","volume":"152","author":"Lu","year":"2003","journal-title":"Content-based retrieval in hybrid peer-to-peer networks"},{"key":"2026040314280861500_ref172","first-page":"1","volume-title":"Proceedings of the 2003 Annual national Conference on Digital Government Research","author":"Lu","year":"2003"},{"key":"2026040314280861500_ref173","first-page":"52","volume-title":"Proceedings of the 27th European Conference on IR Research","author":"Lu","year":"2005"},{"key":"2026040314280861500_ref174","article-title":"Estimating deep web data source size by capturerecapture method","volume-title":"Information Retrieval, page to appear","author":"Lu","year":"2009"},{"key":"2026040314280861500_ref175","doi-asserted-by":"crossref","first-page":"718","DOI":"10.1109\/WIIAT.2008.392","article-title":"An approach to deep web crawling by sampling","volume":"1","author":"Lu","year":"2008","journal-title":"IEEE\/WIC\/ACM International Conference on Web Intelligence and Intelligent Agent Technology"},{"key":"2026040314280861500_ref176","first-page":"53","volume-title":"Proceedings of the Sixth International Conference on Web Information Systems Engineering, vol. 3806 of Lecture Notes in Computer Science","author":"Lu","year":"2005"},{"key":"2026040314280861500_ref177","first-page":"342","article-title":"Web-scale data integration: You can only afford to pay as you go","volume-title":"in Proceedings of Conference on Innovative Data Systems Research","author":"Madhavan","year":"2007"},{"issue":"2","key":"2026040314280861500_ref178","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.14778\/1454159.1454163","article-title":"Google\u2019s Deep Web crawl","volume":"1","author":"Madhavan","year":"2008","journal-title":"Proceedings of VLDB"},{"key":"2026040314280861500_ref179","first-page":"1","volume-title":"Finding similar files in a large file system","author":"Manber","year":"1994"},{"key":"2026040314280861500_ref180","volume-title":"The search broker","author":"Manber","year":"1997"},{"key":"2026040314280861500_ref181","author":"Marchionini","year":"2005"},{"issue":"4","key":"2026040314280861500_ref182","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1016\/0306-4573(84)90001-3","article-title":"On a model of distributed information retrieval systems based on thesauri","volume":"20","author":"Mazur","year":"1984","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref183","first-page":"359","volume-title":"Proceedings of the 15th International Conference on Machine Learning","author":"McCallum","year":"1998"},{"issue":"3","key":"2026040314280861500_ref184","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1145\/502115.502120","article-title":"A highly scalable and effective method for metasearch","volume":"19","author":"Meng","year":"2001","journal-title":"ACM Transactions on Information Systems"},{"issue":"1","key":"2026040314280861500_ref185","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1145\/505282.505284","article-title":"Building efficient and effective metasearch engines","volume":"34","author":"Meng","year":"2002","journal-title":"ACM Computing Surveys"},{"key":"2026040314280861500_ref186","volume-title":"Advanced Metasearch Engine Technology","author":"Meng","year":"2010"},{"key":"2026040314280861500_ref187","first-page":"311","volume":"151","author":"Metzler","journal-title":"Latent concept expansion using markov random fields"},{"key":"2026040314280861500_ref188","first-page":"85","volume-title":"Proceedings of the Third Text REtrieval Conference","author":"Moffat","year":"1994"},{"key":"2026040314280861500_ref189","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1109\/HICSS.2002.993982","volume-title":"Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS\u201902)- Volume 3","author":"Monroe","year":"2002"},{"key":"2026040314280861500_ref190","volume-title":"Determining stopping criteria in the generation of web-derived language models","author":"Monroe","year":"2000"},{"issue":"2","key":"2026040314280861500_ref191","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1145\/1480506.1480520","article-title":"Workshop on aggregated search","volume":"42","author":"Murdock","year":"2008","journal-title":"SIGIR Forum"},{"key":"2026040314280861500_ref192","author":"Myaeng","year":"2008"},{"key":"2026040314280861500_ref193","volume-title":"PhD thesis","author":"Ng","year":"1998"},{"key":"2026040314280861500_ref194","first-page":"43","volume":"44","author":"Nottelmann","journal-title":"Decision-theoretic resource selection for different data types in MIND"},{"key":"2026040314280861500_ref195","first-page":"290","volume":"56","author":"Nottelmann","journal-title":"Evaluating different methods of estimating retrieval quality for resource selection"},{"key":"2026040314280861500_ref196","first-page":"112","volume":"44","author":"Nottelmann","journal-title":"The MIND architecture for heterogeneous multimedia federated digital libraries"},{"key":"2026040314280861500_ref197","first-page":"138","volume-title":"Proceedings of the 26th European Conference on IR Research, vol. 2997 of Lecture Notes in Computer Science","author":"Nottelmann","year":"2004"},{"key":"2026040314280861500_ref198","first-page":"183","volume":"200","author":"Ogilvie","journal-title":"The effectiveness of query expansion for distributed information retrieval"},{"key":"2026040314280861500_ref199","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1145\/511446.511490","volume-title":"Proceedings of the 11th International Conference on World Wide Web","author":"Oztekin","year":"2002"},{"key":"2026040314280861500_ref200","author":"Paques","year":"2001"},{"key":"2026040314280861500_ref201","first-page":"275","volume":"73","author":"Ponte","journal-title":"A language modeling approach to information retrieval"},{"key":"2026040314280861500_ref202","volume-title":"PhD thesis","author":"Powell","year":"2001"},{"issue":"4","key":"2026040314280861500_ref203","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1145\/944012.944016","article-title":"Comparing the performance of collection selection algorithms","volume":"21","author":"Powell","year":"2003","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref204","first-page":"232","volume":"26","author":"Powell","journal-title":"The impact of database selection on distributed searching"},{"key":"2026040314280861500_ref205","volume-title":"Numerical Recipes in C: The Art of Scientific Computing","author":"Press","year":"1988"},{"key":"2026040314280861500_ref206","first-page":"129","volume":"6","author":"Raghavan","journal-title":"Crawling the hidden web"},{"key":"2026040314280861500_ref207","first-page":"191","volume":"200","author":"Rasolofo","journal-title":"Approaches to collection selection and results merging for distributed information retrieval"},{"issue":"4","key":"2026040314280861500_ref208","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1016\/S0306-4573(02)00122-X","article-title":"Result merging strategies for a current news metasearcher","volume":"39","author":"Rasolofo","year":"2003","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref209","first-page":"161","volume-title":"Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications","author":"Ratnasamy","year":"2001"},{"key":"2026040314280861500_ref210","volume-title":"Technical report","author":"Renda","year":"2002"},{"key":"2026040314280861500_ref211","first-page":"841","volume-title":"Web metasearch: Rank vs. score based rank aggregation methods","author":"Renda","year":"2003"},{"issue":"3","key":"2026040314280861500_ref212","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1002\/asi.4630270302","article-title":"Relevance weighting of search terms","volume":"27","author":"Robertson","year":"1976","journal-title":"Journal of the American Society for Information Sciences"},{"key":"2026040314280861500_ref213","first-page":"281","volume-title":"Readings in Information Retrieval","author":"Robertson","year":"1997"},{"key":"2026040314280861500_ref214","first-page":"232","volume-title":"Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Robertson","year":"1994"},{"key":"2026040314280861500_ref215","volume-title":"Technical report","author":"Salton","year":"1983"},{"key":"2026040314280861500_ref216","volume-title":"Introduction to Modern Information Retrieval","author":"Salton","year":"1986"},{"key":"2026040314280861500_ref217","first-page":"489","volume-title":"Proceedings of the Fifth Text REtrieval Conference","author":"Savoy","year":"1996"},{"key":"2026040314280861500_ref218","first-page":"228","article-title":"The estimation of fish populations in lakes and ponds","volume":"18","author":"Schumacher","year":"1943","journal-title":"Journal of the Tennesse Academy of Science"},{"key":"2026040314280861500_ref219","volume-title":"Proceedings of the Fourth International Conference on World Wide Web","author":"Selberg","year":"1995"},{"issue":"1","key":"2026040314280861500_ref220","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/64.577468","article-title":"The MetaCrawler architecture for resource aggregation on the web","volume":"12","author":"Selberg","year":"1997","journal-title":"IEEE Expert"},{"key":"2026040314280861500_ref221","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1109\/64.577468","article-title":"The MetaCrawler architecture for resource aggregation on the web","author":"Selberg","year":"1997","journal-title":"IEEE Expert, January\u2013February"},{"key":"2026040314280861500_ref222","first-page":"1053","volume":"223","author":"Seo","journal-title":"Blog site search using resource selection"},{"key":"2026040314280861500_ref223","author":"Shanahan","year":"2008"},{"key":"2026040314280861500_ref224","doi-asserted-by":"crossref","DOI":"10.1145\/371920","volume-title":"Proceedings of the 10th International Conference on World Wide Web","author":"Shen","year":"2001"},{"key":"2026040314280861500_ref225","first-page":"125","volume-title":"Proceedings of the Second International Conference on Web Information Systems Engineering","author":"Shen","year":"2001"},{"key":"2026040314280861500_ref226","first-page":"160","volume-title":"Proceedings of the 29th European Conference on Information Retrieval Research, vol. 4425 of Lecture Notes in Computer Science","author":"Shokouhi","year":"2007"},{"key":"2026040314280861500_ref227","first-page":"185","volume-title":"Proceedings of the 29th European Conference on Information Retrieval Research, vol. 4425 of Lecture Notes in Computer Science","author":"Shokouhi","year":"2007"},{"key":"2026040314280861500_ref228","first-page":"511","volume":"151","author":"Shokouhi","journal-title":"Updating collection representations for federated search"},{"key":"2026040314280861500_ref229","first-page":"63","volume-title":"Sample sizes for query probing in uncooperative distributed information retrieval","author":"Shokouhi","year":"2006"},{"key":"2026040314280861500_ref230","first-page":"427","volume":"3","author":"Shokouhi","journal-title":"Effective query expansion for federated search"},{"key":"2026040314280861500_ref231","first-page":"495","volume":"151","author":"Shokouhi","journal-title":"Federated text retrieval from uncooperative overlapped collections"},{"issue":"3","key":"2026040314280861500_ref232","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1508850.1508852","article-title":"Robust result merging using sample-based score estimates","volume":"27","author":"Shokouhi","year":"2009","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref233","first-page":"141","volume-title":"Proceedings of the 18th Australasian Database Conference, vol. 63 of CRPIT","author":"Shokouhi","year":"2007"},{"key":"2026040314280861500_ref234","first-page":"141","volume-title":"Distributed text retrieval from overlapping collections","author":"Shokouhi","year":"2007"},{"key":"2026040314280861500_ref235","first-page":"316","volume":"87","author":"Shokouhi","journal-title":"Capturing collection size for distributed non-cooperative retrieval"},{"issue":"1","key":"2026040314280861500_ref236","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/j.ipm.2006.04.003","article-title":"Using query logs to establish vocabularies in distributed information retrieval","volume":"43","author":"Shokouhi","year":"2007","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref237","first-page":"413","volume":"142","author":"Shou","journal-title":"Experiments on data fusion using headline information"},{"key":"2026040314280861500_ref238","author":"Shushmita","year":"2010"},{"key":"2026040314280861500_ref239","volume-title":"PhD thesis","author":"Si","year":"2006"},{"key":"2026040314280861500_ref240","first-page":"31","volume":"44","author":"Si","journal-title":"The effect of database size distribution on resource selection algorithms"},{"key":"2026040314280861500_ref241","first-page":"83","volume":"181","author":"Si","journal-title":"Modeling search engine effectiveness for federated search"},{"key":"2026040314280861500_ref242","first-page":"298","volume":"56","author":"Si","journal-title":"Relevant document distribution estimation method for resource selection"},{"key":"2026040314280861500_ref243","first-page":"19","volume":"142","author":"Si","journal-title":"Using sampled data and regression to merge search engine results"},{"issue":"4","key":"2026040314280861500_ref244","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1145\/944012.944017","article-title":"A semisupervised learning method to merge search engine results","volume":"21","author":"Si","year":"2003","journal-title":"ACM Transactions on Information Systems"},{"key":"2026040314280861500_ref245","first-page":"32","volume-title":"Unified utility maximization framework for resource selection","author":"Si","year":"2004"},{"key":"2026040314280861500_ref246","doi-asserted-by":"crossref","unstructured":"L.\n              Si\n             and J.Callan, \u201cCLEF2005: multilingual retrieval by combining multiple multilingual ranked lists,\u201d in The Sixth Workshop of the Cross-Language Evaluation Forum, Vienna, Austria, 2005. URL http:\/\/www.cs.purdue.edu\/homes\/lsi\/publications.htm.","DOI":"10.1007\/11878773_13"},{"issue":"1","key":"2026040314280861500_ref247","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10791-007-9036-6","article-title":"An effective and efficient results merging strategy for multilingual information retrieval in federated search environments","volume":"11","author":"Si","year":"2008","journal-title":"Information Retrieval"},{"key":"2026040314280861500_ref248","first-page":"391","volume":"144","author":"Si","journal-title":"A language modeling framework for resource selection and results merging"},{"key":"2026040314280861500_ref249","volume-title":"Selected papers from the Sixth International Conference on World Wide Web","author":"Smeaton","year":"1997"},{"key":"2026040314280861500_ref250","unstructured":"M.\n              Sogrine\n            , T.Kechadi, and N.Kushmerick, \u201cLatent semantic indexing for text database selection,\u201d in Proceedings of the SIGIR 2005 Workshop on Heterogeneous and Distributed Information Retrieval, pp. 12\u201319, 2005. URL http:\/\/hdir2005.isti.cnr.it\/index.html."},{"issue":"5","key":"2026040314280861500_ref251","doi-asserted-by":"crossref","first-page":"1379","DOI":"10.1016\/j.ipm.2005.11.001","article-title":"A study of results overlap and uniqueness among major web search engines","volume":"42","author":"Spink","year":"2006","journal-title":"Information Processing and Management"},{"issue":"1","key":"2026040314280861500_ref252","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/TNET.2002.808407","article-title":"Chord: A scalable peer-to-peer lookup protocol for internet applications","volume":"11","author":"Stoica","year":"2003","journal-title":"IEEE\/ACM Transactions on Networking"},{"key":"2026040314280861500_ref253","first-page":"417","volume":"132","author":"Sugiura","journal-title":"Query routing for web search engines: Architectures and experiments"},{"key":"2026040314280861500_ref254","first-page":"839","volume":"192","author":"Thomas","journal-title":"Generalising multiple capture-recapture to non-uniform sample sizes"},{"key":"2026040314280861500_ref255","volume-title":"PhD thesis","author":"Thomas","year":"2008"},{"key":"2026040314280861500_ref256","first-page":"503","volume":"15","author":"Thomas","journal-title":"Evaluating sampling methods for uncooperative collections"},{"key":"2026040314280861500_ref257","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1145\/1414694.1414724","volume-title":"Proceedings of the second international Symposium on Information Interaction in Context","author":"Thomas","year":"2008"},{"issue":"5","key":"2026040314280861500_ref258","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1007\/s10791-009-9094-z","article-title":"Server selection methods in personal metasearch: a comparative empirical study","volume":"12","author":"Thomas","year":"2009","journal-title":"Information Retrieval"},{"key":"2026040314280861500_ref259","first-page":"419","volume":"3","author":"Thomas","journal-title":"SUSHI: scoring scaled samples for server selection"},{"key":"2026040314280861500_ref260","first-page":"540","volume-title":"Learning collection fusion strategies for information retrieval","author":"Towell","year":"1995"},{"key":"2026040314280861500_ref261","first-page":"127","volume":"200","author":"Tsikrika","journal-title":"Merging techniques for performing data fusion on the web"},{"key":"2026040314280861500_ref262","volume-title":"PhD thesis","author":"Turtle","year":"1991"},{"key":"2026040314280861500_ref263","first-page":"1","volume-title":"Inference networks for document retrieval","author":"Turtle","year":"1990"},{"key":"2026040314280861500_ref264","volume-title":"PhD thesis","author":"Vogt","year":"1999"},{"issue":"3","key":"2026040314280861500_ref265","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1023\/A:1009980820262","article-title":"Fusion via a linear combination of scores","volume":"1","author":"Vogt","year":"1999","journal-title":"Information Retrieval"},{"key":"2026040314280861500_ref266","first-page":"172","volume":"91","author":"Voorhees","journal-title":"Learning collection fusion strategies"},{"key":"2026040314280861500_ref267","first-page":"93","volume-title":"Multiple search engines in database merging","author":"Voorhees","year":"1997"},{"key":"2026040314280861500_ref268","first-page":"420","volume-title":"Proceedings of the 30th International Conference on Very Large Data Bases","author":"Wang","year":"2004"},{"key":"2026040314280861500_ref269","doi-asserted-by":"crossref","DOI":"10.1145\/1242572","volume-title":"Proceedings of the 16th International Conference on World Wide Web","author":"Williamson","year":"2007"},{"key":"2026040314280861500_ref270","first-page":"1171","volume":"144","author":"Wu","journal-title":"Multi-objective resource selection in distributed information retrieval"},{"issue":"supp01","key":"2026040314280861500_ref271","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1142\/S0218488503002284","article-title":"Distributed information retrieval: A multi-objective resource selection approach","volume":"11","author":"Wu","year":"2003","journal-title":"International Journal of Uncertainty, Fuzziness Knowledge-Based Systems"},{"key":"2026040314280861500_ref272","first-page":"1067","volume-title":"Shadow document methods of resutls merging","author":"Wu","year":"2004"},{"issue":"4","key":"2026040314280861500_ref273","doi-asserted-by":"crossref","first-page":"899","DOI":"10.1016\/j.ipm.2005.08.004","article-title":"Performance prediction of data fusion for information retrieval","volume":"42","author":"Wu","year":"2006","journal-title":"Information Processing and Management"},{"key":"2026040314280861500_ref274","first-page":"386","volume":"224","author":"Wu","journal-title":"Towards a highly-scalable and effective metasearch engine"},{"key":"2026040314280861500_ref275","first-page":"112","volume":"73","author":"Xu","journal-title":"Effective retrieval with distributed collections"},{"key":"2026040314280861500_ref276","first-page":"254","volume":"106","author":"Xu","journal-title":"Cluster-based language models for distributed retrieval"},{"key":"2026040314280861500_ref277","first-page":"4","volume-title":"Query expansion using local and global document analysis","author":"Xu","year":"1996"},{"key":"2026040314280861500_ref278","first-page":"789","volume":"151","author":"Xu","journal-title":"Estimating collection size with logistic regression"},{"key":"2026040314280861500_ref279","first-page":"143","volume-title":"Proceedings of the Third International Conference on Information Technology and Applications, vol. I","author":"Yang","year":"2005"},{"issue":"1","key":"2026040314280861500_ref280","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1007\/s10791-005-5719-z","article-title":"Two-stage statistical language models for text database selection","volume":"9","author":"Yang","year":"2006","journal-title":"Information Retrieval"},{"issue":"6","key":"2026040314280861500_ref281","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1109\/TKDE.2002.1047772","article-title":"A methodology to retrieve text documents from multiple databases","volume":"14","author":"Yu","year":"2002","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"2026040314280861500_ref282","first-page":"217","volume":"102","author":"Yu","journal-title":"Efficient and effective metasearch for a large number of text databases"},{"issue":"2","key":"2026040314280861500_ref283","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1145\/376284.375684","article-title":"Efficient and effective metasearch for text databases incorporating linkages among documents","volume":"30","author":"Yu","year":"2001","journal-title":"SIGMOD Records"},{"issue":"4","key":"2026040314280861500_ref284","doi-asserted-by":"crossref","first-page":"548","DOI":"10.1109\/69.536248","article-title":"WISE: A world wide web resource database system","volume":"8","author":"Yuwono","year":"1996","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"2026040314280861500_ref285","first-page":"41","volume-title":"Proceedings of the Fifth International Conference on Database Systems for Advanced Applications, vol. 6 of Advanced Database Research and Development Series","author":"Yuwono","year":"1997"},{"key":"2026040314280861500_ref286","doi-asserted-by":"crossref","first-page":"1361","DOI":"10.1016\/S1389-1286(99)00054-7","article-title":"Grouper: a dynamic clustering interface to web search results","volume":"31","author":"Zamir","year":"1999","journal-title":"Computer Networks and ISDN Systems"},{"key":"2026040314280861500_ref287","first-page":"66","volume":"88","author":"Zhao","journal-title":"Fully automatic wrapper generation for search engines"},{"key":"2026040314280861500_ref288","first-page":"989","volume-title":"Proceedings of the 30th International Conference on Very Large Data Bases","author":"Zhao","year":"2006"},{"key":"2026040314280861500_ref289","first-page":"884","volume-title":"Proceedings of the 13th Annual International ACM SIGKDD Conference on Knowledge Discovery and Data Mining","author":"Zhao"},{"key":"2026040314280861500_ref290","first-page":"74","volume-title":"Collection selection via lexicon inspection","author":"Zobel","year":"1997"}],"container-title":["Foundations and Trends\u00ae in Information Retrieval"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/ftinr\/article-pdf\/5\/1\/1\/11486507\/1500000010en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ftinr\/article-pdf\/5\/1\/1\/11486507\/1500000010en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T18:28:46Z","timestamp":1775240926000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ftinr\/article\/5\/1\/1\/1356736\/Federated-Search"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,3,7]]},"references-count":290,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,3,7]]}},"URL":"https:\/\/doi.org\/10.1561\/1500000010","relation":{},"ISSN":["1554-0669","1554-0677"],"issn-type":[{"value":"1554-0669","type":"print"},{"value":"1554-0677","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,3,7]]}}}