{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:21:01Z","timestamp":1750220461124,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,9]]},"DOI":"10.1145\/3472456.3472499","type":"proceedings-article","created":{"date-parts":[[2021,10,5]],"date-time":"2021-10-05T18:46:04Z","timestamp":1633459564000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Sparker: Efficient Reduction for More Scalable Machine Learning with Spark"],"prefix":"10.1145","author":[{"given":"Bowen","family":"Yu","sequence":"first","affiliation":[{"name":"Tsinghua University"}]},{"given":"Huanqi","family":"Cao","sequence":"additional","affiliation":[{"name":"Tsinghua University"}]},{"given":"Tianyi","family":"Shan","sequence":"additional","affiliation":[{"name":"University of California San Diego"}]},{"given":"Haojie","family":"Wang","sequence":"additional","affiliation":[{"name":"Tsinghua University, China"}]},{"given":"Xiongchao","family":"Tang","sequence":"additional","affiliation":[{"name":"Sangfor Technologies Inc. and Tsinghua Shenzhen International Graduate School"}]},{"given":"Wenguang","family":"Chen","sequence":"additional","affiliation":[{"name":"Tsinghua University"}]}],"member":"320","published-online":{"date-parts":[[2021,10,5]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Mart\u00edn Abadi Paul Barham Jianmin Chen Zhifeng Chen Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Geoffrey Irving Michael Isard 2016. Tensorflow: a system for large-scale machine learning.. In OSDI Vol.\u00a016. 265\u2013283.  Mart\u00edn Abadi Paul Barham Jianmin Chen Zhifeng Chen Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Geoffrey Irving Michael Isard 2016. Tensorflow: a system for large-scale machine learning.. In OSDI Vol.\u00a016. 265\u2013283."},{"key":"e_1_3_2_1_2_1","volume-title":"COOL: A Cloud-Optimized Structure for MPI Collective Operations. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD). IEEE, 746\u2013753","author":"Alfatafta Mohammed","year":"2018","unstructured":"Mohammed Alfatafta , Zuhair AlSader , and Samer Al-Kiswany . 2018 . COOL: A Cloud-Optimized Structure for MPI Collective Operations. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD). IEEE, 746\u2013753 . Mohammed Alfatafta, Zuhair AlSader, and Samer Al-Kiswany. 2018. COOL: A Cloud-Optimized Structure for MPI Collective Operations. In 2018 IEEE 11th International Conference on Cloud Computing (CLOUD). IEEE, 746\u2013753."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2523616.2523625"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3126908.3126954"},{"key":"e_1_3_2_1_5_1","volume-title":"Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912)","author":"Costa Paolo","year":"2012","unstructured":"Paolo Costa , Austin Donnelly , Antony Rowstron , and Greg O\u2019Shea . 2012 . Camdoop: Exploiting In-network Aggregation for Big Data Applications . In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912) . USENIX Association, Berkeley, CA, USA, 3\u20133. http:\/\/dl.acm.org\/citation.cfm?id=2228298.2228302 Paolo Costa, Austin Donnelly, Antony Rowstron, and Greg O\u2019Shea. 2012. Camdoop: Exploiting In-network Aggregation for Big Data Applications. In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912). USENIX Association, Berkeley, CA, USA, 3\u20133. http:\/\/dl.acm.org\/citation.cfm?id=2228298.2228302"},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the ACM Symposium on Cloud Computing(SoCC\u201919)","author":"Wang Yiheng","year":"2019","unstructured":"Jason\u00a0(Jinquan) Dai, Yiheng Wang , Xin Qiu , Ding Ding , Yao Zhang , Yanzhang Wang , Xianyan Jia , Li\u00a0(Cherry) Zhang, Yan Wan , Zhichao Li , Jiao Wang , Shengsheng Huang , Zhongyuan Wu , Yang Wang , Yuhao Yang , Bowen She , Dongjie Shi , Qi Lu , Kai Huang , and Guoqiong Song . 2019 . BigDL: A Distributed Deep Learning Framework for Big Data . In Proceedings of the ACM Symposium on Cloud Computing(SoCC\u201919) . Association for Computing Machinery, 50\u201360. https:\/\/doi.org\/10.1145\/3357223.3362707 Jason\u00a0(Jinquan) Dai, Yiheng Wang, Xin Qiu, Ding Ding, Yao Zhang, Yanzhang Wang, Xianyan Jia, Li\u00a0(Cherry) Zhang, Yan Wan, Zhichao Li, Jiao Wang, Shengsheng Huang, Zhongyuan Wu, Yang Wang, Yuhao Yang, Bowen She, Dongjie Shi, Qi Lu, Kai Huang, and Guoqiong Song. 2019. BigDL: A Distributed Deep Learning Framework for Big Data. In Proceedings of the ACM Symposium on Cloud Computing(SoCC\u201919). Association for Computing Machinery, 50\u201360. https:\/\/doi.org\/10.1145\/3357223.3362707"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2013.96"},{"volume-title":"O\u2019Reilly Media","author":"Hintjens Pieter","key":"e_1_3_2_1_8_1","unstructured":"Pieter Hintjens . 2013. ZeroMQ: messaging for many applications . \u201d O\u2019Reilly Media , Inc .\u201d. Pieter Hintjens. 2013. ZeroMQ: messaging for many applications. \u201d O\u2019Reilly Media, Inc.\u201d."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196892"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2017.87"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2901318.2901351"},{"volume-title":"Euro-Par 2016: Parallel Processing Workshops","author":"Ma He","key":"e_1_3_2_1_12_1","unstructured":"He Ma , Fei Mao , and Graham\u00a0 W. Taylor . 2017. Theano-MPI: A Theano-Based Distributed Training Framework . In Euro-Par 2016: Parallel Processing Workshops . Springer International Publishing , Cham , 800\u2013813. He Ma, Fei Mao, and Graham\u00a0W. Taylor. 2017. Theano-MPI: A Theano-Based Distributed Training Framework. In Euro-Par 2016: Parallel Processing Workshops. Springer International Publishing, Cham, 800\u2013813."},{"key":"e_1_3_2_1_13_1","unstructured":"Amith\u00a0R. Mamidala Georgios Kollias Chris Ward and Fausto Artico. 2018. MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning. CoRR abs\/1801.03855(2018). arxiv:1801.03855http:\/\/arxiv.org\/abs\/1801.03855  Amith\u00a0R. Mamidala Georgios Kollias Chris Ward and Fausto Artico. 2018. MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning. CoRR abs\/1801.03855(2018). arxiv:1801.03855http:\/\/arxiv.org\/abs\/1801.03855"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/2946645.2946679"},{"key":"e_1_3_2_1_15_1","unstructured":"OSU Micro-Benchmarks. [n. d.]. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/.  OSU Micro-Benchmarks. [n. d.]. http:\/\/mvapich.cse.ohio-state.edu\/benchmarks\/."},{"key":"e_1_3_2_1_16_1","volume-title":"Making Sense of Performance in Data Analytics Frameworks. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15)","author":"Ousterhout Kay","year":"2015","unstructured":"Kay Ousterhout , Ryan Rasti , Sylvia Ratnasamy , Scott Shenker , and Byung-Gon Chun . 2015 . Making Sense of Performance in Data Analytics Frameworks. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15) . USENIX Association, Oakland, CA, 293\u2013307. https:\/\/www.usenix.org\/conference\/nsdi15\/technical-sessions\/presentation\/ousterhout Kay Ousterhout, Ryan Rasti, Sylvia Ratnasamy, Scott Shenker, and Byung-Gon Chun. 2015. Making Sense of Performance in Data Analytics Frameworks. In 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI 15). USENIX Association, Oakland, CA, 293\u2013307. https:\/\/www.usenix.org\/conference\/nsdi15\/technical-sessions\/presentation\/ousterhout"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2008.09.002"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2815400.2815418"},{"key":"e_1_3_2_1_19_1","unstructured":"Amedeo Sapio Marco Canini Chen-Yu Ho Jacob Nelson Panos Kalnis Changhoon Kim Arvind Krishnamurthy Masoud Moshref Dan\u00a0RK Ports and Peter Richt\u00e1rik. 2019. Scaling Distributed Machine Learning with In-Network Aggregation. arXiv preprint arXiv:1903.06701(2019).  Amedeo Sapio Marco Canini Chen-Yu Ho Jacob Nelson Panos Kalnis Changhoon Kim Arvind Krishnamurthy Masoud Moshref Dan\u00a0RK Ports and Peter Richt\u00e1rik. 2019. Scaling Distributed Machine Learning with In-Network Aggregation. arXiv preprint arXiv:1903.06701(2019)."},{"volume-title":"MPI\u2013the Complete Reference: The MPI core. Vol.\u00a01","author":"Snir Marc","key":"e_1_3_2_1_20_1","unstructured":"Marc Snir , William Gropp , Steve Otto , Steven Huss-Lederman , Jack Dongarra , and David Walker . 1998. MPI\u2013the Complete Reference: The MPI core. Vol.\u00a01 . MIT press . Marc Snir, William Gropp, Steve Otto, Steven Huss-Lederman, Jack Dongarra, and David Walker. 1998. MPI\u2013the Complete Reference: The MPI core. Vol.\u00a01. MIT press."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342005051521"},{"key":"e_1_3_2_1_22_1","first-page":"56","article-title":"MPI: a standard message passing interface","volume":"12","author":"Walker W","year":"1996","unstructured":"David\u00a0 W Walker and Jack\u00a0 J Dongarra . 1996 . MPI: a standard message passing interface . Supercomputer 12 (1996), 56 \u2013 68 . David\u00a0W Walker and Jack\u00a0J Dongarra. 1996. MPI: a standard message passing interface. Supercomputer 12(1996), 56\u201368.","journal-title":"Supercomputer"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629575.1629600"},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912)","author":"Zaharia Matei","year":"2012","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Tathagata Das , Ankur Dave , Justin Ma , Murphy McCauley , Michael\u00a0 J. Franklin , Scott Shenker , and Ion Stoica . 2012 . Resilient Distributed Datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing . In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912) . USENIX Association, Berkeley, CA, USA, 2\u20132. http:\/\/dl.acm.org\/citation.cfm?id=2228298.2228301 Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael\u00a0J. Franklin, Scott Shenker, and Ion Stoica. 2012. Resilient Distributed Datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing. In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation(NSDI\u201912). USENIX Association, Berkeley, CA, USA, 2\u20132. http:\/\/dl.acm.org\/citation.cfm?id=2228298.2228301"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2934664"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-020-00636-3"}],"event":{"name":"ICPP 2021: 50th International Conference on Parallel Processing","acronym":"ICPP 2021","location":"Lemont IL USA"},"container-title":["50th International Conference on Parallel Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3472499","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472456.3472499","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:12Z","timestamp":1750193292000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472456.3472499"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,9]]},"references-count":26,"alternative-id":["10.1145\/3472456.3472499","10.1145\/3472456"],"URL":"https:\/\/doi.org\/10.1145\/3472456.3472499","relation":{},"subject":[],"published":{"date-parts":[[2021,8,9]]},"assertion":[{"value":"2021-10-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}