{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,4]],"date-time":"2025-03-04T05:13:07Z","timestamp":1741065187848,"version":"3.38.0"},"reference-count":29,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2003,11,1]],"date-time":"2003-11-01T00:00:00Z","timestamp":1067644800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2003,11]]},"abstract":"<jats:p> Distributed query processing (DQP) has been widely used in data intensive applications where data of relevance to users are stored at multiple locations. This paper argues: (i) that DQP can be important in the Grid, as a means of providing high-level, declarative languages for integrating data access and analysis; and (ii) that the Grid provides resource management facilities that are useful to developers of DQP systems. As well as discussing and illustrating how DQP technologies can be deployed within the Grid, the paper describes Polar*, a prototype implementation of a DQP system running over Globus. Polar* can handle complex data by adopting the ODMG object model and its query language OQL, which supports the invocation of user-defined operations. The Globus components are accessed through the MPICH-G interface rather than in a lower level way. A case study from bioinformatics is used throughout the paper, to show the benefits of the approach. <\/jats:p>","DOI":"10.1177\/10943420030174002","type":"journal-article","created":{"date-parts":[[2003,11,11]],"date-time":"2003-11-11T23:39:30Z","timestamp":1068593970000},"page":"353-367","source":"Crossref","is-referenced-by-count":22,"title":["Distributed Query Processing on the Grid"],"prefix":"10.1177","volume":"17","author":[{"given":"Jim","family":"Smith","sequence":"first","affiliation":[]},{"given":"Paul","family":"Watson","sequence":"additional","affiliation":[{"name":"DEPARTMENT OF COMPUTING SCIENCE, UNIVERSITY OF NEWCASTLE UPON TYNE, UK"}]},{"given":"Anastasios","family":"Gounaris","sequence":"additional","affiliation":[]},{"given":"Norman W.","family":"Paton","sequence":"additional","affiliation":[]},{"given":"Alvaro A.A.","family":"Fernandes","sequence":"additional","affiliation":[]},{"given":"Rizos","family":"Sakellariou","sequence":"additional","affiliation":[{"name":"DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF MANCHESTER, UK"}]}],"member":"179","published-online":{"date-parts":[[2003,11,1]]},"reference":[{"key":"atypb1","doi-asserted-by":"publisher","DOI":"10.1016\/S0022-2836(05)80360-2"},{"key":"atypb2","doi-asserted-by":"publisher","DOI":"10.1038\/75556"},{"key":"atypb3","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/28.1.45"},{"key":"atypb4","doi-asserted-by":"crossref","unstructured":"Carey, M., DeWitt, D.J., Franklin, M., Hall, N., McAuliffe, M., Naughton, J., Schuh, D., Solomon, M., Tan, C., Tsatalos, O., White, S., and Zwilling, M., 1994. Shoring up persistent applications . In R. Snodgrass and M. Winslett, editors, Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, 24\u201327 May 1994, ACM Press, New York, pp. 383\u2013394 .","DOI":"10.1145\/191839.191915"},{"key":"atypb5","unstructured":"Cattell, R.G.G., and Barry, D.K. 2000. The Object Database Standard: ODMG 3.0, Morgan Kaufmann, San Mateo, CA ."},{"key":"atypb6","doi-asserted-by":"crossref","unstructured":"Cornell, M., Paton, N.W., Wu, S., Goble, C.A., Miller, C.J., Kirby, P., Eilbeck, K., Brass, A., Hayes, A., and Oliver, S.G., 2001. GIMS\u2013a data warehouse for storage and analysis of genome sequence and functional data . In Proceedings of the 2nd IEEE Symposium on Bioinformatics and Bioengineering (BIBE),IEEE Press, Piscataway, NJ, pp. 15\u201322 .","DOI":"10.1109\/BIBE.2001.974407"},{"key":"atypb7","doi-asserted-by":"publisher","DOI":"10.1147\/sj.402.0512"},{"key":"atypb8","doi-asserted-by":"crossref","unstructured":"DeWitt, D., and Gray, J. 1992. Parallel database systems: The future of high performance database systems . Communication of ACM (CACM) 35(6): 85\u201398 .","DOI":"10.1145\/129888.129894"},{"key":"atypb9","unstructured":"Dinda, P., and Plale, B., 2001. A unified relational approach to Grid information services. Technical Report GWDGIS-012-1,Global Grid Forum."},{"key":"atypb10","doi-asserted-by":"crossref","unstructured":"Fegaras, L., and Maier, D. 2000. Optimizing object queries using an effective calculus . ACM Transactions on Database Systems 24(4): 457\u2013516 .","DOI":"10.1145\/377674.377676"},{"key":"atypb11","doi-asserted-by":"crossref","unstructured":"Foster, I., and Karonis, N.T., 1998. A Grid-enabled MPI: Message passing in heterogeneous distributed computing systems . In Proceedings of Supercomputing (SC), IEEE Computer Society, Los Alamitos, CA. Online at www.supercomp.org\/sc98\/proceedings\/.","DOI":"10.1109\/SC.1998.10051"},{"key":"atypb12","doi-asserted-by":"crossref","unstructured":"Foster, I., Kesselman, C., Nick, J., and Tuecke, S. 2002. Grid services for distributed system integration . IEEE Computer 35: 37\u201346 .","DOI":"10.1109\/MC.2002.1009167"},{"key":"atypb13","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008683107812"},{"key":"atypb14","doi-asserted-by":"crossref","unstructured":"Graefe, G. 1990. Encapsulation of parallelism in the Volcano query processing system. In ACM SIGMOD, pp. 102\u2013111.","DOI":"10.1145\/93605.98720"},{"key":"atypb15","doi-asserted-by":"publisher","DOI":"10.1145\/152610.152611"},{"key":"atypb16","unstructured":"Haas, L., Kossmann, D., Wimmers, E.L., and Yang, J., 1997. Optimizing queries across diverse data sources . In Proceedings of the VLDB, Morgan-Kaufmann, San Mateo, CA, pp. 276\u2013285 ."},{"key":"atypb17","unstructured":"Hasan, W., and Motwani, R., 1995. Coloring away communication in parallel query optimization . In Umeshwar Dayal, Peter M. D. Gray, and Shojiro Nishio, editors, VLDB\u201995, Proceedings of 21th International Conference on Very Large Data Bases, 11\u201315 September 1995, Zurich, Switzerland, Morgan Kaufmann, San Mateo, CA, pp. 239\u2013250 ."},{"key":"atypb18","doi-asserted-by":"publisher","DOI":"10.1145\/234889.234894"},{"key":"atypb19","doi-asserted-by":"publisher","DOI":"10.1007\/BF01228709"},{"key":"atypb20","doi-asserted-by":"publisher","DOI":"10.1145\/371578.371598"},{"key":"atypb21","unstructured":"Ozsu, M.T., and Valduriez, P., editors, 1999. Principles of Distributed Database Systems, 2nd edition, Prentice-Hall, Englewood Cliffs, NJ ."},{"key":"atypb22","unstructured":"Rahm, E., and Marek, R., 1995. Dynamic multi-resource load balancing in parallel database systems . In Proceedings of the 21st VLDB Conference, pp. 395\u2013406 ."},{"key":"atypb23","doi-asserted-by":"crossref","unstructured":"Sampaio, S.F.M., Paton, N.W., Watson, P., and Smith, J., 1999. A parallel algebra for object databases . In Proceedings of the 10th International Workshop on Database and Expert Systems Applications, Florence, Italy, September, IEEE Computer Society, Los Alamitos, CA, pp. 56\u201360 .","DOI":"10.1109\/DEXA.1999.795145"},{"key":"atypb24","doi-asserted-by":"crossref","unstructured":"Sampaio, S.F.M., Smith, J., Paton, N.W., and Watson, P., 2001. An experimental performance evaluation of join algorithms for parallel object databases . In R. Sakellariou et al., editors, Proceedings of the 7th International Euro-Par Conference, Springer-Verlag, Berlin, pp. 280\u2013291 .","DOI":"10.1007\/3-540-44681-8_41"},{"key":"atypb25","doi-asserted-by":"crossref","unstructured":"Smith, J., Sampaio, S.F.M., Watson, P., and Paton, N.W., 2000. Polar: An architecture for a parallel ODMG compliant object database . In Conference on Information and Knowledge Management (CIKM),ACM press, New York, pp. 352\u2013359 .","DOI":"10.1145\/354756.354840"},{"key":"atypb26","unstructured":"Snir, M., Otto, S., Huss-Lederman, S., Walker, D., and Dongarra, J. 1998. MPI\u2013The Complete Reference, MIT Press, Cambridge, MA ."},{"key":"atypb27","doi-asserted-by":"crossref","unstructured":"Szalay, A., Kunszt, P.Z., Thakar, A., Gray, J., and Slut, D.R., 2000. Designing and mining multi-terabyte astronomy archives: The SLOAN digital sky survey . In Proceedings of ACM SIGMOD, ACM Press, New York, pp. 451\u2013462 .","DOI":"10.1145\/335191.335439"},{"key":"atypb28","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.574"},{"key":"atypb29","unstructured":"Watson, P., 2001. Databases and the Grid. Technical Report CS-TR-755,University of Newcastle."}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420030174002","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/10943420030174002","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,3]],"date-time":"2025-03-03T04:19:51Z","timestamp":1740975591000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/10943420030174002"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003,11]]},"references-count":29,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2003,11]]}},"alternative-id":["10.1177\/10943420030174002"],"URL":"https:\/\/doi.org\/10.1177\/10943420030174002","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2003,11]]}}}