{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T05:37:22Z","timestamp":1740721042089,"version":"3.38.0"},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2010,11,18]],"date-time":"2010-11-18T00:00:00Z","timestamp":1290038400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Grid Computing"],"published-print":{"date-parts":[[2011,9]]},"DOI":"10.1007\/s10723-010-9172-x","type":"journal-article","created":{"date-parts":[[2010,11,17]],"date-time":"2010-11-17T10:29:10Z","timestamp":1289989750000},"page":"325-344","source":"Crossref","is-referenced-by-count":5,"title":["A Robust and Efficient Message Passing Library for Volunteer Computing Environments"],"prefix":"10.1007","volume":"9","author":[{"given":"Rakhi","family":"Anand","sequence":"first","affiliation":[]},{"given":"Troy","family":"LeBlanc","sequence":"additional","affiliation":[]},{"given":"Edgar","family":"Gabriel","sequence":"additional","affiliation":[]},{"given":"Jaspal","family":"Subhlok","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,11,18]]},"reference":[{"key":"9172_CR1","unstructured":"Top 100 multi-project BOINC participants. http:\/\/boinc.berkeley.edu\/chart_list.php (2010)"},{"key":"9172_CR2","unstructured":"Agbaria, A., Friedman, R.: Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations. In: 8th IEEE International Symposium on High Performance Distributed Computing (1999)"},{"key":"9172_CR3","unstructured":"Amazon webservices: Amazon Elastic Compute Cloud (Amazon EC2). http:\/\/www.amazon.com\/gp\/browse.html?node=201590011 (2008)"},{"key":"9172_CR4","unstructured":"Anand, R., Gabriel, E., Subhlok, J.: Communication target selection for replicated MPI processes. In: Recent Advances in the Message Passing Interface. LNCS, 6305, pp. 198\u2013207. Stuttgart, Germany (2010)"},{"key":"9172_CR5","unstructured":"Anderson, D.: Boinc: a system for public-resource computing and storage. In: Fifth IEEE\/ACM International Workshop on Grid Computing (2004)"},{"key":"9172_CR6","doi-asserted-by":"crossref","unstructured":"Anderson, D., Fedak, G.: The computation and storage potential of volunteer computing. In: Sixth IEEE International Symposium on Cluster Computing and the Grid (2006)","DOI":"10.1109\/CCGRID.2006.101"},{"key":"9172_CR7","unstructured":"Bailey, D.H., Harris, T., Saphir, W.C., van der Wijngaart, R.F., Woo, A.C.: The NAS Parallel Benchmarks 2.0. NAS Technical Report NAS-95-020, NASA Ames Research Center, Moffett Field, CA (1995)"},{"key":"9172_CR8","unstructured":"Barak, A.: Scalable cluster computing with mosix for linux. In: Proceedings of Linux Expo 99, pp. 95\u2013100 (1999)"},{"key":"9172_CR9","unstructured":"Batchu, R., Neelamegam, J., Cui, Z., Beddhua, M., Skjellum, A., Dandass, Y., Apte, M.: Mpi\/ft: Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing. In: Proceedings of the 1 IEEE International Symposium of Cluster Computing and the Grid (2001)"},{"key":"9172_CR10","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1145\/1048935.1050176","volume-title":"SC \u201903: Proceedings of the 2003 ACM\/IEEE conference on Supercomputing","author":"A Bouteiller","year":"2003","unstructured":"Bouteiller, A., Cappello, F., Herault, T., Krawezik, G., Lemarinier, P., Magniette, F.: Mpich-v2: a fault tolerant mpi for volatile nodes based on pessimistic sender based message logging. In: SC \u201903: Proceedings of the 2003 ACM\/IEEE conference on Supercomputing, p.\u00a025. IEEE Computer Society, Washington, DC, USA (2003)"},{"issue":"5","key":"9172_CR11","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1016\/S0743-7315(03)00006-6","volume":"63","author":"A Chien","year":"2003","unstructured":"Chien, A., Calder, B., Elbert, S., Bhatia, K.: Entropia: architecture and performance of an enterprise desktop Grid system. J. Parallel Distrib. Comput. 63(5), 597\u2013610 (2003)","journal-title":"J. Parallel Distrib. Comput."},{"key":"9172_CR12","unstructured":"Duarte, A., Rexachs, D., Luque, E.: An intelligent management of fault tolerance in cluster using RADICMPI. In: Mohr, B., Tr\u00e4ff, J.L., Worringen, J., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, 4192, pp. 150\u2013157, Springer (2006)"},{"key":"9172_CR13","doi-asserted-by":"crossref","unstructured":"Duell, J., Hargrove, P., Roman, E.: The design and implementation of Berkeley Lab\u2019s linux checkpoint\/restart. In: Berkeley Lab Technical Report (publication LBNL-54941) (2002)","DOI":"10.2172\/793773"},{"key":"9172_CR14","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1177\/1094342005056137","volume":"19","author":"GE Fagg","year":"2005","unstructured":"Fagg, G.E., Gabriel, E., Chen, Z., Angskun, T., Bosilca, G., Pjesivac-Grbovic, J., Dongarra, J.J.: Process fault-tolerance: semantics, design and applications f or high performance computing. Int. J. High Perform. Comput. Appl. 19, 465\u2013477 (2005)","journal-title":"Int. J. High Perform. Comput. Appl."},{"key":"9172_CR15","doi-asserted-by":"crossref","unstructured":"Fagg, G.E., London, K.S., Dongarra, J.J.: MPI_Connect: managing heterogeneous MPI applications interoperation and process control. In: Alexandrov, V., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface (5th European PVM\/MPI Users\u2019 Group Meeting). Lecture Notes in Computer Science, vol. 1497, pp. 93\u201396. Springer (1998)","DOI":"10.1007\/BFb0056563"},{"key":"9172_CR16","unstructured":"Ferreira, K., Riesen, R., Oldfield, R., Stearly, J., Laros, J., Redretti, K., Kordenbrock, T., Brightwell, R.: Increasing Fault Resiliency in a Message-Passing Environment. Tech. rep., Sandia National Laboratories (2009)"},{"key":"9172_CR17","unstructured":"Gabriel, E.: Influence of the progress engine on the performance of asynchronous communication libraries. In: Technical Report UH-CS-02-10, Dept. of Computer Science, University of Houston (2010)"},{"key":"9172_CR18","doi-asserted-by":"crossref","unstructured":"Gabriel, E., Fagg, G.E., Bosilca, G., Angskun, T., Dongarra, J.J., Squyres, J.M., Sahay, V., Kambadur, P., Barrett, B., Lumsdaine, A., Castain, R.H., Daniel, D.J., Graham, R.L., Woodall, T.S.: Open MPI: goals, concept, and design of a next generation MPI implementation. In: Proceedings, 11th European PVM\/MPI Users\u2019 Group Meeting, pp. 97\u2013104. Budapest, Hungary (2004)","DOI":"10.1007\/978-3-540-30218-6_19"},{"key":"9172_CR19","doi-asserted-by":"crossref","unstructured":"Genaud, S., Rattanapoka, C.: Large-scale experiment of co-allocation strategies for peer-to-peer supercomputing in p2p-mpi. In: Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium, pp. 1\u20138 (2008)","DOI":"10.1109\/IPDPS.2008.4536212"},{"key":"9172_CR20","unstructured":"Google Press Center: Google and IBM Announce University Initiative to Address Internet-Scale Computing Challenges. http:\/\/www.google.com\/intl\/en\/press\/pressrel\/20071008_ibm_univ.html (2007)"},{"key":"9172_CR21","doi-asserted-by":"crossref","unstructured":"Hoefler, T., Gottschling, P., Lumsdaine, A.: Leveraging non-blocking collective communication in high-performance applications. In: Proceedings of the Twentieth Annual Symposium on Parallelism in Algorithms and Architectures, SPAA\u201908, pp. 113\u2013115. Association for Computing Machinery (ACM) (2008)","DOI":"10.1145\/1378533.1378554"},{"key":"9172_CR22","doi-asserted-by":"crossref","unstructured":"Huang, C., Lawlor, O., Kal\u00e9, L.V.: Adaptive MPI. In: Languages and Compilers for Parallel Computing. Lecture Notes in Computer Science, vol. 2958, pp. 306\u2013322. Springer (2004)","DOI":"10.1007\/978-3-540-24644-2_20"},{"key":"9172_CR23","doi-asserted-by":"crossref","unstructured":"Hursey, J., Squyres, J.M., Mattox, T.I., Lumsdaine, A.: The design and implementation of checkpoint\/restart process fault tolerance for open MPI. In: Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE Computer Society (2007)","DOI":"10.1109\/IPDPS.2007.370605"},{"key":"9172_CR24","doi-asserted-by":"crossref","unstructured":"Imamura, T., Tsujita, Y., Koide, H., Takemiya, H.: An architecture of Stampi: MPI library on a cluster of parallel computers. In: Dongarra, J., Kacsuk, P., Podhorszki, N. (eds.) Recent Advances in Parallel Virutal Machine and Message Passing Interface (7th European PVM\/MPI Users\u2019 Group Meeting). Lecture Notes In Computer Science, vol. 1908, pp. 200\u2013207. Springer (2000)","DOI":"10.1007\/3-540-45255-9_29"},{"key":"9172_CR25","doi-asserted-by":"crossref","unstructured":"Iosup, A., Jan, M., Sonmez, O., Epema, D.: On the dynamic resource availability in Grids. In: Proc. of the 8th IEEE\/ACM International Conference on Grid Computing (Grid 2007), pp. 26\u201333. Austin, TX, USA (2007)","DOI":"10.1109\/GRID.2007.4354112"},{"issue":"4","key":"9172_CR26","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1007\/s10723-009-9139-y","volume":"7","author":"P Kacsuk","year":"2009","unstructured":"Kacsuk, P., Kovacs, J., Farkas, Z., Marosi, A.C., Gombas, G., Balaton, Z.: SZTAKI Desktop Grid (SZDG): a flexible and scalable desktop Grid system. J. Grid Computing 7(4), 439\u2013461 (2009)","journal-title":"J. Grid Computing"},{"issue":"5","key":"9172_CR27","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1016\/S0743-7315(03)00002-9","volume":"63","author":"N Karonis","year":"2003","unstructured":"Karonis, N., Toonen, B., Foster, I.: Mpich-g2: A Grid-enabled implementation of the message passing interface. J. Parallel Distrib. Comput. 63(5), 551\u2013563 (2003)","journal-title":"J. Parallel Distrib. Comput."},{"issue":"2","key":"9172_CR28","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1023\/B:GRID.0000024071.12177.91","volume":"1","author":"R Keller","year":"2003","unstructured":"Keller, R., Gabriel, E., Krammer, B., M\u00fcller, M.S., Resch, M.M.: Efficient execution of MPI applications on the Grid: porting and optimization issues. J. Grid Computing 1(2), 133\u2013149 (2003)","journal-title":"J. Grid Computing"},{"key":"9172_CR29","unstructured":"Kerbyson, D., Barker, K.: Automatic identification of application communication patterns via templates. In: Proc. 18th International Conference on Parallel and Distributed Computing Systems (PDCS-2005), Las Vegas, NV (2005)"},{"key":"9172_CR30","doi-asserted-by":"crossref","unstructured":"Khalili, O., He, J., Olschanowsky, C., Snavely, A., Casanova, H.: Measuring the performance and reliability of production computational Grids. In: Proceedings of the 7th IEEE\/ACM International Conference on Grid Computing (2006)","DOI":"10.1109\/ICGRID.2006.311028"},{"issue":"8","key":"9172_CR31","first-page":"131","volume":"34","author":"T Kielmann","year":"1999","unstructured":"Kielmann, T., Hofman, R.F.H., Bal, H.E., Plaat, A., Bhoedjang, R.A.F.: MagPIe: MPI\u2019s collective communication operations for clustered wide area systems. ACM SIGPLAN Symp. Princ. Pract. Par. Prog. (PPoPP\u201999) 34(8), 131\u2013140 (1999)","journal-title":"ACM SIGPLAN Symp. Princ. Pract. Par. Prog. (PPoPP\u201999)"},{"issue":"7","key":"9172_CR32","doi-asserted-by":"crossref","first-page":"888","DOI":"10.1016\/j.future.2006.11.001","volume":"23","author":"D Kondo","year":"2007","unstructured":"Kondo, D., Fedak, G., Cappello, F., Chien, A.A., Casanova, H.: Characterizing resource availability in enterprise desktop Grids. Future Gener. Comput. Syst. 23(7), 888\u2013903 (2007)","journal-title":"Future Gener. Comput. Syst."},{"key":"9172_CR33","unstructured":"Kondo, D., Taufer, M., Brooks, C., Casanova, H., Chien, A.: Characterizing and evaluating desktop Grids: an empirical study. In: International Parallel and Distributed Processing Symposium (IPDPS\u201904), citeseer.ist.psu.edu\/kondo04characterizing.html (2004)"},{"key":"9172_CR34","doi-asserted-by":"crossref","unstructured":"Kondo, D., Taufer, M., Brooks, C., Casanova, H., Chien, A.: Characterizing and evaluating desktop Grids: an Empirical Study, Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS), p. 26 (2004)","DOI":"10.1109\/IPDPS.2004.1302936"},{"key":"9172_CR35","unstructured":"LeBlanc, T., Anand, R., Gabriel, E., Subhlok, J.: VolpexMPI: an MPI Library for execution of parallel applications on volatile nodes. In: Ropo, M., Westerholm, J., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, 5759, pp. 124\u2013134. Espoo, Finnland (2009)"},{"key":"9172_CR36","unstructured":"Li, J., Deshpande, A., Srinivasan, J., Ma, X.: Energy and performance impact of aggressive volunteer computing with multi-core computers. In: 2009 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), London, UK (2009)"},{"issue":"5","key":"9172_CR37","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1016\/j.jpdc.2007.09.004","volume":"68","author":"H Ltaief","year":"2008","unstructured":"Ltaief, H., Gabriel, E., Garbey, M.: Fault tolerant algorithms for heat transfer problems. J. Parallel Distrib. Comput 68(5), 663\u2013677 (2008)","journal-title":"J. Parallel Distrib. Comput"},{"key":"9172_CR38","unstructured":"Message Passing Interface Forum: MPI: A Message Passing Interface Standard. http:\/\/www.mpi-forum.org (1995)"},{"key":"9172_CR39","unstructured":"Rao, S., Alvisi, L., Vin, H.M.: Egida: an extensible toolkit for low-overhead fault-tolerance. In: Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing, FTCS 99. IEEE Computer Society (1999)"},{"key":"9172_CR40","doi-asserted-by":"crossref","unstructured":"Ren, X., Eigenmann, R.: iShare\u2014open internet sharing built on peer-to-peer and web. In: European Grid Conference, Amsterdam, Netherlands (2005)","DOI":"10.1007\/11508380_114"},{"issue":"4","key":"9172_CR41","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1177\/1094342005056139","volume":"19","author":"S Sankaran","year":"2005","unstructured":"Sankaran, S., Squyres, J.M., Barrett, B., Lumsdaine, A., Duell, J., Hargrove, P., Roman, E.: The LAM\/MPI checkpoint\/restart framework: system-initiated check pointing. Int. J. High Perform. Comput. Appl. 19(4), 479\u2013493 (2005)","journal-title":"Int. J. High Perform. Comput. Appl."},{"key":"9172_CR42","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1109\/DSN.2006.5","volume-title":"Proceedings of the International Conference on Dependable Systems and Networks","author":"B Schroeder","year":"2006","unstructured":"Schroeder, B., Gibson, G.A.: A large-scale study of failures in high-performance computing systems. In: Proceedings of the International Conference on Dependable Systems and Networks, pp. 249\u2013258. IEEE Computer Society, Washington, DC, USA (2006)"},{"key":"9172_CR43","unstructured":"Stellner, G.: Co-Check: checkpointing and process migration for MPI. In: Proceedings of the 10th International Parallel Processing Symposium (IPPS \u201996) (1996)"},{"key":"9172_CR44","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1016\/S0009-2614(99)01123-9","volume":"314","author":"Y Sugita","year":"1999","unstructured":"Sugita, Y., Okamoto, Y.: Replica-exchange molecular dynamics method for protein folding. Chem. Phys. Lett. 314, 141\u2013151 (1999). doi: 10.1016\/S0009-2614(99)01123-9","journal-title":"Chem. Phys. Lett."},{"key":"9172_CR45","unstructured":"Tabe, T., Stout, Q.: The Use of the MPI Communication Library in the NAS Parallel Benchmark. Tech. Rep. CSE-TR-386-99, Department of Computer Science, University of Michigan (1999)"},{"key":"9172_CR46","unstructured":"Taufer, M., Anderson, D., Cicotti, P., III, C.B.: Homogeneous technique to ensure integrity of molecular simulation results using public resources. In: Proceedings of the 14th Heterogeneous Computing Workshop (HCW 2005) (2004)"},{"issue":"2\u20134","key":"9172_CR47","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1002\/cpe.938","volume":"17","author":"D Thain","year":"2005","unstructured":"Thain, D., Tannenbaum, T., Livny, M.: Distributed computing in practice: the condor experience. Concurrency-Pract. Ex. 17(2\u20134), 323\u2013356 (2005)","journal-title":"Concurrency-Pract. Ex."},{"key":"9172_CR48","doi-asserted-by":"crossref","unstructured":"Vo, A., Vakkalanka, S., DeLisi, M., Gopalakrishnan, G., Kirby, R.M., Thakur, R.: Formal verification of practical mpi programs. In: Principles and Practices of Parallel Programming (PPoPP), pp. 261\u2013269 (2009)","DOI":"10.1145\/1594835.1504214"},{"key":"9172_CR49","unstructured":"Vyas,D., Subhlok, J.: Volunteer computing on clusters. In: 12th Workshop on Job Scheduling Strategies for Parallel Processing. Saint Malo, France (2006). Published as Springer Verlag Lecture Notes in Computer Science, vol. 4376\/2007, pp. 161\u2013175, May 2007"},{"key":"9172_CR50","unstructured":"Zheng, R., Subhlok, J.: A Quantatitive Comparison of Checkpoint with Restart and Replication in Volatile Environments. Technical report UH-CS-08-06, Dept. of Computer Science, University of Houston (2008)"}],"container-title":["Journal of Grid Computing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10723-010-9172-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10723-010-9172-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10723-010-9172-x","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,27]],"date-time":"2025-02-27T22:00:41Z","timestamp":1740693641000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10723-010-9172-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,11,18]]},"references-count":50,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2011,9]]}},"alternative-id":["9172"],"URL":"https:\/\/doi.org\/10.1007\/s10723-010-9172-x","relation":{},"ISSN":["1570-7873","1572-9184"],"issn-type":[{"type":"print","value":"1570-7873"},{"type":"electronic","value":"1572-9184"}],"subject":[],"published":{"date-parts":[[2010,11,18]]}}}