{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T15:47:03Z","timestamp":1770220023702,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2014,11,3]],"date-time":"2014-11-03T00:00:00Z","timestamp":1414972800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100002418","name":"Intel Corporation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100002418","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000144","name":"Division of Computer and Network Systems","doi-asserted-by":"publisher","award":["CNS-1042537 and CNS-1042543"],"award-info":[{"award-number":["CNS-1042537 and CNS-1042543"]}],"id":[{"id":"10.13039\/100000144","id-type":"DOI","asserted-by":"publisher"}]},{"name":"PDL Consortium"},{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["FA87501220324"],"award-info":[{"award-number":["FA87501220324"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2014,11,3]]},"DOI":"10.1145\/2670979.2670984","type":"proceedings-article","created":{"date-parts":[[2014,11,7]],"date-time":"2014-11-07T17:10:54Z","timestamp":1415380254000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Exploiting iterative-ness for parallel ML computations"],"prefix":"10.1145","author":[{"given":"Henggang","family":"Cui","sequence":"first","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexey","family":"Tumanov","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jinliang","family":"Wei","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lianghong","family":"Xu","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Dai","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jesse","family":"Haber-Kucharsky","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qirong","family":"Ho","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gregory R.","family":"Ganger","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Phillip B.","family":"Gibbons","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Garth A.","family":"Gibson","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric P.","family":"Xing","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,11,3]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2124295.2124312"},{"key":"e_1_3_2_1_2_1","volume-title":"ICML","author":"Ahn S.","year":"2014","unstructured":"S. Ahn , B. Shahbaba , and M. Welling . Distributed stochastic gradient MCMC . In ICML , 2014 . S. Ahn, B. Shahbaba, and M. Welling. Distributed stochastic gradient MCMC. In ICML, 2014."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1854273.1854314"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/74850.74854"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-7552(98)00110-X"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/377769.377774"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920881"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1413370.1413415"},{"key":"e_1_3_2_1_9_1","volume-title":"OSDI","author":"Cao P.","year":"1994","unstructured":"P. Cao , E. Felton , and K. Li . Implementation and performance of application-controlled file caching . In OSDI , 1994 . P. Cao, E. Felton, and K. Li. Implementation and performance of application-controlled file caching. In OSDI, 1994."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/195473.195485"},{"key":"e_1_3_2_1_11_1","volume-title":"OSDI","author":"Chang F.","year":"1999","unstructured":"F. Chang and G. A. Gibson . Automatic I\/O hint generation through speculative execution . In OSDI , 1999 . F. Chang and G. A. Gibson. Automatic I\/O hint generation through speculative execution. In OSDI, 1999."},{"key":"e_1_3_2_1_12_1","volume-title":"NIPS","author":"Chu C.-T.","year":"2006","unstructured":"C.-T. Chu , S. K. Kim , Y. A. Lin , Y. Yu , G. Bradski , A. Ng , and K. Olukotun . Map-reduce for machine learning on multicore . In NIPS , 2006 . C.-T. Chu, S. K. Kim, Y. A. Lin, Y. Yu, G. Bradski, A. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In NIPS, 2006."},{"key":"e_1_3_2_1_13_1","volume-title":"HotOS","author":"Cipar J.","year":"2013","unstructured":"J. Cipar , Q. Ho , J. K. Kim , S. Lee , G. R. Ganger , G. Gibson , K. Keeton , and E. Xing . Solving the straggler problem with bounded staleness . In HotOS , 2013 . J. Cipar, Q. Ho, J. K. Kim, S. Lee, G. R. Ganger, G. Gibson, K. Keeton, and E. Xing. Solving the straggler problem with bounded staleness. In HotOS, 2013."},{"key":"e_1_3_2_1_14_1","volume-title":"ICML","author":"Coates A.","year":"2013","unstructured":"A. Coates , B. Huval , T. Wang , D. Wu , B. Catanzaro , and N. Andrew . Deep learning with COTS HPC systems . In ICML , 2013 . A. Coates, B. Huval, T. Wang, D. Wu, B. Catanzaro, and N. Andrew. Deep learning with COTS HPC systems. In ICML, 2013."},{"key":"e_1_3_2_1_15_1","volume-title":"USENIX ATC","author":"Cui H.","year":"2014","unstructured":"H. Cui , J. Cipar , Q. Ho , J. K. Kim , S. Lee , A. Kumar , J. Wei , W. Dai , G. R. Ganger , P. B. Gibbons , G. A. Gibson , and E. P. Xing . Exploiting bounded staleness to speed up big data analytics . In USENIX ATC , 2014 . H. Cui, J. Cipar, Q. Ho, J. K. Kim, S. Lee, A. Kumar, J. Wei, W. Dai, G. R. Ganger, P. B. Gibbons, G. A. Gibson, and E. P. Xing. Exploiting bounded staleness to speed up big data analytics. In USENIX ATC, 2014."},{"key":"e_1_3_2_1_16_1","volume-title":"NIPS","author":"Dean J.","year":"2012","unstructured":"J. Dean , G. Corrado , R. Monga , K. Chen , M. Devin , Q. Le , M. Mao , M. Ranzato , A. Senior , P. Tucker , K. Yang , and A. Ng . Large scale distributed deep networks . In NIPS , 2012 . J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng. Large scale distributed deep networks. In NIPS, 2012."},{"key":"e_1_3_2_1_17_1","volume-title":"USENIX Annual Technical Conference","author":"Fraser K.","year":"2003","unstructured":"K. Fraser and F. Chang . Operating system I\/O speculation: How two invocations are faster than one . In USENIX Annual Technical Conference , 2003 . K. Fraser and F. Chang. Operating system I\/O speculation: How two invocations are faster than one. In USENIX Annual Technical Conference, 2003."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020426"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/645896.671952"},{"key":"e_1_3_2_1_20_1","volume-title":"PRObE: A thousand-node experimental cluster for computer systems research. USENIX","author":"Gibson G.","year":"2013","unstructured":"G. Gibson , G. Grider , A. Jacobson , and W Lloyd . PRObE: A thousand-node experimental cluster for computer systems research. USENIX ; login:, 2013 . G. Gibson, G. Grider, A. Jacobson, and W Lloyd. PRObE: A thousand-node experimental cluster for computer systems research. USENIX; login:, 2013."},{"key":"e_1_3_2_1_21_1","volume-title":"OSDI","author":"Gonzalez J.","year":"2012","unstructured":"J. Gonzalez , Y Low , H. Gu , D. Bickson , and C. Guestrin . PowerGraph: Distributed graph-parallel computation on natural graphs . In OSDI , 2012 . J. Gonzalez, Y Low, H. Gu, D. Bickson, and C. Guestrin. PowerGraph: Distributed graph-parallel computation on natural graphs. In OSDI, 2012."},{"key":"e_1_3_2_1_22_1","volume-title":"Summer USENIX","author":"Griffioen J.","year":"1994","unstructured":"J. Griffioen and R. Appleton . Reducing file system latency using a predictive approach . In Summer USENIX , 1994 . J. Griffioen and R. Appleton. Reducing file system latency using a predictive approach. In Summer USENIX, 1994."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0307752101"},{"key":"e_1_3_2_1_24_1","volume-title":"NIPS","author":"Ho Q.","year":"2013","unstructured":"Q. Ho , J. Cipar , H. Cui , S. Lee , J. K. Kim , P. B. Gibbons , G. A. Gibson , G. R. Ganger , and E. P. Xing . More effective distributed ML via a stale synchronous parallel parameter server . In NIPS , 2013 . Q. Ho, J. Cipar, H. Cui, S. Lee, J. K. Kim, P. B. Gibbons, G. A. Gibson, G. R. Ganger, and E. P. Xing. More effective distributed ML via a stale synchronous parallel parameter server. In NIPS, 2013."},{"key":"e_1_3_2_1_25_1","unstructured":"Intel. Intel\u00ae Threading Building Blocks. https:\/\/www.threadingbuildingblocks.org.  Intel. Intel\u00ae Threading Building Blocks. https:\/\/www.threadingbuildingblocks.org."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772751"},{"key":"e_1_3_2_1_27_1","volume-title":"OSDI","author":"Kyrola A.","year":"2012","unstructured":"A. Kyrola , G. Blelloch , and C. Guestrin . GraphChi: Large-scale graph computation on just a PC . In OSDI , 2012 . A. Kyrola, G. Blelloch, and C. Guestrin. GraphChi: Large-scale graph computation on just a PC. In OSDI, 2012."},{"key":"e_1_3_2_1_28_1","volume-title":"NIPS","author":"Langford J.","year":"2009","unstructured":"J. Langford , A. J. Smola , and M. Zinkevich . Slow learners are fast . In NIPS , 2009 . J. Langford, A. J. Smola, and M. Zinkevich. Slow learners are fast. In NIPS, 2009."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/1268680.1268701"},{"key":"e_1_3_2_1_30_1","volume-title":"UAI","author":"Low Y.","year":"2010","unstructured":"Y. Low , J. Gonzalez , A. Kyrola , D. Bickson , C. Guestrin , and J. M. Hellerstein . GraphLab: A new parallel framework for machine learning . In UAI , 2010 . Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein. GraphLab: A new parallel framework for machine learning. In UAI, 2010."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522738"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/224056.224064"},{"key":"e_1_3_2_1_34_1","volume-title":"OSDI","author":"Peng D.","year":"2010","unstructured":"D. Peng and F. Dabek . Large-scale incremental processing using distributed transactions and notifications . In OSDI , 2010 . D. Peng and F. Dabek. Large-scale incremental processing using distributed transactions and notifications. In OSDI, 2010."},{"key":"e_1_3_2_1_35_1","volume-title":"OSDI","author":"Power R.","year":"2010","unstructured":"R. Power and J. Li . Piccolo: Building fast, distributed programs with partitioned tables . In OSDI , 2010 . R. Power and J. Li. Piccolo: Building fast, distributed programs with partitioned tables. In OSDI, 2010."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522740"},{"key":"e_1_3_2_1_37_1","unstructured":"TILEPro. TILEPro processor family: TILEPro64 overview. http:\/\/www.tilera.com\/products\/processors\/TILEPro_Family 2013.  TILEPro. TILEPro processor family: TILEPro64 overview. http:\/\/www.tilera.com\/products\/processors\/TILEPro_Family 2013."},{"key":"e_1_3_2_1_38_1","volume-title":"Workshop on Systems for Future Multicore Architectures (SFMA)","author":"Tumanov A.","year":"2013","unstructured":"A. Tumanov , J. Wise , O. Mutlu , and G. R. Ganger . Asymmetry-aware execution placement on manycore chips . In Workshop on Systems for Future Multicore Architectures (SFMA) , 2013 . A. Tumanov, J. Wise, O. Mutlu, and G. R. Ganger. Asymmetry-aware execution placement on manycore chips. In Workshop on Systems for Future Multicore Architectures (SFMA), 2013."},{"key":"e_1_3_2_1_39_1","unstructured":"UCI. UCI Machine Learning Repository. http:\/\/archive.ics.uci.edu\/ml\/datasets\/Bag+of+Words.  UCI. UCI Machine Learning Repository. http:\/\/archive.ics.uci.edu\/ml\/datasets\/Bag+of+Words."},{"key":"e_1_3_2_1_40_1","volume-title":"Towards topic modeling for big data. arXiv preprint arXiv:1405.4402","author":"Wang Y.","year":"2014","unstructured":"Y. Wang , X. Zhao , Z. Sun , H. Yan , L. Wang , Z. Jin , L. Wang , Y. Gao , J. Zeng , Q. Yang , Towards topic modeling for big data. arXiv preprint arXiv:1405.4402 , 2014 . Y. Wang, X. Zhao, Z. Sun, H. Yan, L. Wang, Z. Jin, L. Wang, Y. Gao, J. Zeng, Q. Yang, et al. Towards topic modeling for big data. arXiv preprint arXiv:1405.4402, 2014."},{"key":"e_1_3_2_1_41_1","volume-title":"SOSP","author":"Zaharia M.","year":"2013","unstructured":"M. Zaharia , T. Das , H. Li , S. Shenker , and I. Stoica . Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters . In SOSP , 2013 . M. Zaharia, T. Das, H. Li, S. Shenker, and I. Stoica. Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters. In SOSP, 2013."},{"key":"e_1_3_2_1_42_1","volume-title":"ICML","author":"Zhang R.","year":"2014","unstructured":"R. Zhang and J. Kwok . Asynchronous distributed ADMM algorithm for global variable consensus optimization . In ICML , 2014 . R. Zhang and J. Kwok. Asynchronous distributed ADMM algorithm for global variable consensus optimization. In ICML, 2014."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2038916.2038929"}],"event":{"name":"SOCC '14: ACM Symposium on Cloud Computing","location":"Seattle WA USA","acronym":"SOCC '14","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGOPS ACM Special Interest Group on Operating Systems"]},"container-title":["Proceedings of the ACM Symposium on Cloud Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2670979.2670984","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2670979.2670984","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:12:18Z","timestamp":1750227138000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2670979.2670984"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,11,3]]},"references-count":43,"alternative-id":["10.1145\/2670979.2670984","10.1145\/2670979"],"URL":"https:\/\/doi.org\/10.1145\/2670979.2670984","relation":{},"subject":[],"published":{"date-parts":[[2014,11,3]]},"assertion":[{"value":"2014-11-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}