{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:34:15Z","timestamp":1750221255579,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,6,11]],"date-time":"2018-06-11T00:00:00Z","timestamp":1528675200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,6,11]]},"DOI":"10.1145\/3208040.3208054","type":"proceedings-article","created":{"date-parts":[[2018,6,11]],"date-time":"2018-06-11T12:36:20Z","timestamp":1528720580000},"page":"118-130","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["ADAPT"],"prefix":"10.1145","author":[{"given":"Xi","family":"Luo","sequence":"first","affiliation":[{"name":"University of Tennessee"}]},{"given":"Wei","family":"Wu","sequence":"additional","affiliation":[{"name":"Los Alamos National Laboratory"}]},{"given":"George","family":"Bosilca","sequence":"additional","affiliation":[{"name":"University of Tennessee"}]},{"given":"Thananon","family":"Patinyasakdikul","sequence":"additional","affiliation":[{"name":"University of Tennessee"}]},{"given":"Linnan","family":"Wang","sequence":"additional","affiliation":[{"name":"Brown University"}]},{"given":"Jack","family":"Dongarra","sequence":"additional","affiliation":[{"name":"University of Tennessee and Oak Ridge National Laboratory and University of Manchester, Manchester, UK"}]}],"member":"320","published-online":{"date-parts":[[2018,6,11]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2966884.2966912"},{"volume-title":"2006 IEEE International Conference on Cluster Computing. 1--12","author":"Beckman P.","key":"e_1_3_2_1_2_1","unstructured":"P. Beckman , K. Iskra , K. Yoshii , and S. Coghlan . 2006. The Influence of Operating Systems on the Performance of Collective Operations at Extreme Scale . In 2006 IEEE International Conference on Cluster Computing. 1--12 . P. Beckman, K. Iskra, K. Yoshii, and S. Coghlan. 2006. The Influence of Operating Systems on the Performance of Collective Operations at Extreme Scale. In 2006 IEEE International Conference on Cluster Computing. 1--12."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the Linux Symposium. 371--386","author":"Bhattacharya Suparna","year":"2003","unstructured":"Suparna Bhattacharya , Steven Pratt , Badari Pulavarty , and Janet Morgan . 2003 . Asynchronous I\/O support in Linux 2.5 . In Proceedings of the Linux Symposium. 371--386 . Suparna Bhattacharya, Steven Pratt, Badari Pulavarty, and Janet Morgan. 2003. Asynchronous I\/O support in Linux 2.5. In Proceedings of the Linux Symposium. 371--386."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/PDP.2010.67"},{"volume-title":"CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 726--735","author":"Chu C. H.","key":"e_1_3_2_1_5_1","unstructured":"C. H. Chu , K. Hamidouche , A. Venkatesh , A. A. Awan , and D. K. Panda . 2016 . CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 726--735 . C. H. Chu, K. Hamidouche, A. Venkatesh, A. A. Awan, and D. K. Panda. 2016. CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 726--735."},{"volume-title":"Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. In 2017 46th International Conference on Parallel Processing (ICPP). 161--170","author":"Chu C. H.","key":"e_1_3_2_1_6_1","unstructured":"C. H. Chu , X. Lu , A. A. Awan , H. Subramoni , J. Hashmi , B. Elton , and D. K. Panda . 2017 . Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. In 2017 46th International Conference on Parallel Processing (ICPP). 161--170 . C. H. Chu, X. Lu, A. A. Awan, H. Subramoni, J. Hashmi, B. Elton, and D. K. Panda. 2017. Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. In 2017 46th International Conference on Parallel Processing (ICPP). 161--170."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/SCCC.2007.4"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1133373.1133410"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1182807.1182811"},{"volume-title":"2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis. 1--12","author":"Ferreira K. B.","key":"e_1_3_2_1_10_1","unstructured":"K. B. Ferreira , P. Bridges , and R. Brightwell . 2008. Characterizing application sensitivity to OS interference using kernel-level noise injection . In 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis. 1--12 . K. B. Ferreira, P. Bridges, and R. Brightwell. 2008. Characterizing application sensitivity to OS interference using kernel-level noise injection. In 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis. 1--12."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2010.41"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2014.77"},{"key":"e_1_3_2_1_13_1","volume-title":"MPI: A Message-Passing Interface Standard","author":"Interface Forum Message Passing","year":"2012","unstructured":"Message Passing Interface Forum . 2012 . MPI: A Message-Passing Interface Standard , http:\/\/www.mpi-forum.org\/. (September 2012). Message Passing Interface Forum. 2012. MPI: A Message-Passing Interface Standard, http:\/\/www.mpi-forum.org\/. (September 2012)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2005.214"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCGrid.2011.42"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8191(06)80021-9"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/SC.2010.12"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2009.70"},{"volume-title":"2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum (IPDPSW). 1--8.","author":"Kandalla K.","key":"e_1_3_2_1_19_1","unstructured":"K. Kandalla , H. Subramoni , A. Vishnu , and D. K. Panda . 2010. Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather . In 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum (IPDPSW). 1--8. K. Kandalla, H. Subramoni, A. Vishnu, and D. K. Panda. 2010. Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather. In 2010 IEEE International Symposium on Parallel Distributed Processing, Workshops and Phd Forum (IPDPSW). 1--8."},{"key":"e_1_3_2_1_20_1","volume-title":"IPDPS","author":"Karonis N. T.","year":"2000","unstructured":"N. T. Karonis , B. R. de Supinski , I. Foster , W. Gropp , E. Lusk , and J. Bresnahan . 2000. Exploiting hierarchy in parallel computer networks to optimize collective operation performance . In IPDPS 2000 . 377--384. N. T. Karonis, B. R. de Supinski, I. Foster, W. Gropp, E. Lusk, and J. Bresnahan. 2000. Exploiting hierarchy in parallel computer networks to optimize collective operation performance. In IPDPS 2000. 377--384."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/301104.301116"},{"volume-title":"Using Simulation to Evaluate the Performance of Resilience Strategies at Scale","author":"Levy Scott","key":"e_1_3_2_1_22_1","unstructured":"Scott Levy , Bryan Topp , Kurt B. Ferreira , Dorian Arnold , Torsten Hoefler , and Patrick Widener . 2014. Using Simulation to Evaluate the Performance of Resilience Strategies at Scale . Springer International Publishing , Cham , 91--114. Scott Levy, Bryan Topp, Kurt B. Ferreira, Dorian Arnold, Torsten Hoefler, and Patrick Widener. 2014. Using Simulation to Evaluate the Performance of Resilience Strategies at Scale. Springer International Publishing, Cham, 91--114."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2012.91"},{"volume-title":"Scheduling In-Situ Analytics in Next-Generation Applications. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 102--105","author":"Mondragon O. H.","key":"e_1_3_2_1_24_1","unstructured":"O. H. Mondragon , P. G. Bridges , S. Levy , K. B. Ferreira , and P. Widener . 2016 . Scheduling In-Situ Analytics in Next-Generation Applications. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 102--105 . O. H. Mondragon, P. G. Bridges, S. Levy, K. B. Ferreira, and P. Widener. 2016. Scheduling In-Situ Analytics in Next-Generation Applications. In 2016 16th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 102--105."},{"key":"e_1_3_2_1_25_1","unstructured":"NVIDIA. 2016. NCCL. https:\/\/github.com\/NVIDIA\/nccl. (2016).  NVIDIA. 2016. NCCL. https:\/\/github.com\/NVIDIA\/nccl. (2016)."},{"volume-title":"Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs. In 2014 14th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 483--492","author":"Oden L.","key":"e_1_3_2_1_26_1","unstructured":"L. Oden , B. Klenk , and H. Fr\u00f6ning . 2014 . Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs. In 2014 14th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 483--492 . L. Oden, B. Klenk, and H. Fr\u00f6ning. 2014. Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs. In 2014 14th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing. 483--492."},{"key":"e_1_3_2_1_27_1","volume-title":"Flash: An efficient and portable Web server.","author":"Pai Vivek S.","year":"1999","unstructured":"Vivek S. Pai , Peter Druschel , and Willy Zwaenepoel . 1999 . Flash: An efficient and portable Web server. (1999). Vivek S. Pai, Peter Druschel, and Willy Zwaenepoel. 1999. Flash: An efficient and portable Web server. (1999)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2014.32"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-007-0012-0"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/520549.822752"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2009.09.001"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2011.67"},{"volume-title":"Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '12)","author":"Subramoni H.","key":"e_1_3_2_1_33_1","unstructured":"H. Subramoni , S. Potluri , K. Kandalla , B. Barth , J. Vienne , J. Keasler , K. Tomko , K. Schulz , A. Moody , and D. K. Panda . 2012. Design of a Scalable InfiniBand Topology Service to Enable Network-topology-aware Placement of Processes . In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '12) . Article 70, 12 pages. H. Subramoni, S. Potluri, K. Kandalla, B. Barth, J. Vienne, J. Keasler, K. Tomko, K. Schulz, A. Moody, and D. K. Panda. 2012. Design of a Scalable InfiniBand Topology Service to Enable Network-topology-aware Placement of Processes. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '12). Article 70, 12 pages."},{"key":"e_1_3_2_1_34_1","volume-title":"Fast Collective Operations Using Shared and Remote Memory Access Protocols on Clusters (IPDPS '03)","author":"Tipparaju Vinod","year":"2003","unstructured":"Vinod Tipparaju , Jarek Nieplocha , and Dhabaleswar Panda . 2003 . Fast Collective Operations Using Shared and Remote Memory Access Protocols on Clusters (IPDPS '03) . 84.1-. Vinod Tipparaju, Jarek Nieplocha, and Dhabaleswar Panda. 2003. Fast Collective Operations Using Shared and Remote Memory Access Protocols on Clusters (IPDPS '03). 84.1-."},{"volume-title":"The Impact of Noise on the Scaling of Collectives: The Nearest Neighbor Model","author":"Vishnoi Nisheeth K.","key":"e_1_3_2_1_35_1","unstructured":"Nisheeth K. Vishnoi . 2007. The Impact of Noise on the Scaling of Collectives: The Nearest Neighbor Model . Springer , 476--487. Nisheeth K. Vishnoi. 2007. The Impact of Noise on the Scaling of Collectives: The Nearest Neighbor Model. Springer, 476--487."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00450-011-0171-3"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2925426.2926256"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3178487.3178491"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342015611952"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2907294.2907317"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2015.56"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPPW.2009.35"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2503210.2503279"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-03770-2_41"}],"event":{"name":"HPDC '18: The 27th International Symposium on High-Performance Parallel and Distributed Computing","sponsor":["SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing","SIGARCH ACM Special Interest Group on Computer Architecture"],"location":"Tempe Arizona","acronym":"HPDC '18"},"container-title":["Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3208040.3208054","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3208040.3208054","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:08:06Z","timestamp":1750212486000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3208040.3208054"}},"subtitle":["an event-based adaptive collective communication framework"],"short-title":[],"issued":{"date-parts":[[2018,6,11]]},"references-count":44,"alternative-id":["10.1145\/3208040.3208054","10.1145\/3208040"],"URL":"https:\/\/doi.org\/10.1145\/3208040.3208054","relation":{},"subject":[],"published":{"date-parts":[[2018,6,11]]},"assertion":[{"value":"2018-06-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}