{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T03:47:09Z","timestamp":1772164029211,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":39,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,2,10]],"date-time":"2018-02-10T00:00:00Z","timestamp":1518220800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["1649880"],"award-info":[{"award-number":["1649880"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,2,10]]},"DOI":"10.1145\/3178487.3178501","type":"proceedings-article","created":{"date-parts":[[2018,2,6]],"date-time":"2018-02-06T13:12:23Z","timestamp":1517922743000},"page":"183-194","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["FlashR"],"prefix":"10.1145","author":[{"given":"Da","family":"Zheng","sequence":"first","affiliation":[{"name":"Johns Hopkins University"}]},{"given":"Disa","family":"Mhembere","sequence":"additional","affiliation":[{"name":"Johns Hopkins University"}]},{"given":"Joshua T.","family":"Vogelstein","sequence":"additional","affiliation":[{"name":"Johns Hopkins University"}]},{"given":"Carey E.","family":"Priebe","sequence":"additional","affiliation":[{"name":"Johns Hopkins University"}]},{"given":"Randal","family":"Burns","sequence":"additional","affiliation":[{"name":"Johns Hopkins University"}]}],"member":"320","published-online":{"date-parts":[[2018,2,10]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)","author":"Abadi Mart\u00edn","year":"2016","unstructured":"Mart\u00edn Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , Andy Davis , Jeffrey Dean , Matthieu Devin , Sanjay Ghemawat , Geoffrey Irving , Michael Isard , Manjunath Kudlur , Josh Levenberg , Rajat Monga , Sherry Moore , Derek G. Murray , Benoit Steiner , Paul Tucker , Vijay Vasudevan , Pete Warden , Martin Wicke , Yuan Yu , and Xiaoqiang Zheng . 2016 . TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) . Mart\u00edn Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.14778\/3007263.3007279"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732286.2732292"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1941553.1941561"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-012-0197-6"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2976456.2976492"},{"key":"e_1_3_2_1_8_1","unstructured":"criteo Accessed 2\/11\/2017. Criteo's 1TB Click Prediction Dataset. https:\/\/blogs.technet.microsoft.com\/machinelearning\/2015\/04\/01\/now-available-on-azure-ml-criteos-1tb-click-prediction-dataset\/. (Accessed 2\/11\/2017).  criteo Accessed 2\/11\/2017. Criteo's 1TB Click Prediction Dataset. https:\/\/blogs.technet.microsoft.com\/machinelearning\/2015\/04\/01\/now-available-on-azure-ml-criteos-1tb-click-prediction-dataset\/. (Accessed 2\/11\/2017)."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation -","volume":"6","author":"Dean Jeffrey","year":"2004","unstructured":"Jeffrey Dean and Sanjay Ghemawat . 2004 . MapReduce: Simplified Data Processing on Large Clusters . In Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation - Volume 6 (OSDI'04). USENIX Association, Berkeley, CA, USA. Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified Data Processing on Large Clusters. In Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation - Volume 6 (OSDI'04). USENIX Association, Berkeley, CA, USA."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994515"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188543"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767930"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/512760.512761"},{"key":"e_1_3_2_1_14_1","unstructured":"H2O Accessed 2\/7\/2017. H2O machine learning library. http:\/\/www.h2o.ai\/. (Accessed 2\/7\/2017).  H2O Accessed 2\/7\/2017. H2O machine learning library. http:\/\/www.h2o.ai\/. (Accessed 2\/7\/2017)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/322217.322232"},{"key":"e_1_3_2_1_16_1","volume-title":"Mathematical Programming: Series A and B","author":"Liu D. C.","year":"1989","unstructured":"D. C. Liu and J. Nocedal . 1989 . On the limited memory BFGS method for large scale optimization. Mathematical Programming: Series A and B (1989). D. C. Liu and J. Nocedal. 1989. On the limited memory BFGS method for large scale optimization. Mathematical Programming: Series A and B (1989)."},{"key":"e_1_3_2_1_17_1","volume-title":"Graphene: Fine-Grained IO Management for Graph Computing. In 15th USENIX Conference on File and Storage Technologies (FAST 17)","author":"Liu Hang","unstructured":"Hang Liu and H. Howie Huang . 2017 . Graphene: Fine-Grained IO Management for Graph Computing. In 15th USENIX Conference on File and Storage Technologies (FAST 17) . Santa Clara, CA. Hang Liu and H. Howie Huang. 2017. Graphene: Fine-Grained IO Management for Graph Computing. In 15th USENIX Conference on File and Storage Technologies (FAST 17). Santa Clara, CA."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.14778\/2212351.2212354"},{"key":"e_1_3_2_1_20_1","unstructured":"mass Accessed 2\/12\/2017. Package MASS. https:\/\/cran.r-project.org\/web\/packages\/MASS\/index.html. (Accessed 2\/12\/2017).  mass Accessed 2\/12\/2017. Package MASS. https:\/\/cran.r-project.org\/web\/packages\/MASS\/index.html. (Accessed 2\/12\/2017)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018743.3018766"},{"key":"e_1_3_2_1_22_1","volume-title":"15th Workshop on Hot Topics in Operating Systems (HotOS XV).","author":"McSherry Frank","unstructured":"Frank McSherry , Michael Isard , and Derek G. Murray . 2015. Scalability! But at what COST? . In 15th Workshop on Hot Topics in Operating Systems (HotOS XV). Frank McSherry, Michael Isard, and Derek G. Murray. 2015. Scalability! But at what COST?. In 15th Workshop on Hot Topics in Operating Systems (HotOS XV)."},{"key":"e_1_3_2_1_23_1","article-title":"MLlib: Machine Learning in Apache Spark","volume":"17","author":"Meng Xiangrui","year":"2015","unstructured":"Xiangrui Meng , Joseph Bradley , Burak Yavuz , Evan Sparks , Shivaram Venkataraman , Davies Liu , Jeremy Freeman , DB Tsai , Manish Amde , Sean Owen , Doris Xin , Reynold Xin , Michael J. Franklin , Reza Zadeh , Matei Zaharia , and Ameet Talwalkar . 2015 . MLlib: Machine Learning in Apache Spark . The Journal of Machine Learning Research 17 , 1 (2015). Xiangrui Meng, Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, and Ameet Talwalkar. 2015. MLlib: Machine Learning in Apache Spark. The Journal of Machine Learning Research 17, 1 (2015).","journal-title":"The Journal of Machine Learning Research"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.14778\/3025111.3025117"},{"key":"e_1_3_2_1_25_1","volume-title":"Proceedings of the Royal Society of London. 240--242","author":"Pearson Karl","year":"1895","unstructured":"Karl Pearson . 1895 . Notes on regression and inheritance in the case of two parents . In Proceedings of the Royal Society of London. 240--242 . Karl Pearson. 1895. Notes on regression and inheritance in the case of two parents. In Proceedings of the Royal Society of London. 240--242."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2331130.2331133"},{"key":"e_1_3_2_1_27_1","unstructured":"rro Accessed 2\/12\/2017. Microsoft R Open. https:\/\/mran.microsoft.com\/open\/. (Accessed 2\/12\/2017).  rro Accessed 2\/12\/2017. Microsoft R Open. https:\/\/mran.microsoft.com\/open\/. (Accessed 2\/12\/2017)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.scico.2008.06.002"},{"key":"e_1_3_2_1_29_1","volume-title":"in Proceedings of the 28th International Conference on Machine Learning.","author":"Sujeeth Arvind K.","unstructured":"Arvind K. Sujeeth , Hyoukjoong Lee , Kevin J. Brown , Hassan Chafi , Michael Wu , Anand R. Atreya , Kunle Olukotun, Tiark Rompf, and Martin Odersky. 2011. OptiML: an implicitly parallel domainspecific language for machine learning . In in Proceedings of the 28th International Conference on Machine Learning. Arvind K. Sujeeth, Hyoukjoong Lee, Kevin J. Brown, Hassan Chafi, Michael Wu, Anand R. Atreya, Kunle Olukotun, Tiark Rompf, and Martin Odersky. 2011. OptiML: an implicitly parallel domainspecific language for machine learning. In in Proceedings of the 28th International Conference on Machine Learning."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2370816.2370825"},{"key":"e_1_3_2_1_31_1","volume-title":"RIPQ: Advanced Photo Caching on Flash for Facebook. In 13th USENIX Conference on File and Storage Technologies (FAST 15)","author":"Tang Linpeng","year":"2015","unstructured":"Linpeng Tang , Qi Huang , Wyatt Lloyd , Sanjeev Kumar , and Kai Li . 2015 . RIPQ: Advanced Photo Caching on Flash for Facebook. In 13th USENIX Conference on File and Storage Technologies (FAST 15) . Santa Clara, CA. Linpeng Tang, Qi Huang, Wyatt Lloyd, Sanjeev Kumar, and Kai Li. 2015. RIPQ: Advanced Photo Caching on Flash for Facebook. In 13th USENIX Conference on File and Storage Technologies (FAST 15). Santa Clara, CA."},{"key":"e_1_3_2_1_32_1","volume-title":"External Memory Algorithms","author":"Toledo Sivan","unstructured":"Sivan Toledo . 1999. External Memory Algorithms . Boston, MA, USA , Chapter A Survey of Out-of-core Algorithms in Numerical Linear Algebra , 161--179. Sivan Toledo. 1999. External Memory Algorithms. Boston, MA, USA, Chapter A Survey of Out-of-core Algorithms in Numerical Linear Algebra, 161--179."},{"key":"e_1_3_2_1_33_1","unstructured":"webgraph Accessed 4\/18\/2014. Web graph. http:\/\/webdatacommons.org\/hyperlinkgraph\/. (Accessed 4\/18\/2014).  webgraph Accessed 4\/18\/2014. Web graph. http:\/\/webdatacommons.org\/hyperlinkgraph\/. (Accessed 4\/18\/2014)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/373574.373576"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2783323"},{"key":"e_1_3_2_1_36_1","volume-title":"Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12)","author":"Zaharia Matei","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Tathagata Das , Ankur Dave , Justin Ma , Murphy McCauly , Michael J. Franklin , Scott Shenker , and Ion Stoica . 2012. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing . In Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12) . USENIX , San Jose, CA , 15--28. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauly, Michael J. Franklin, Scott Shenker, and Ion Stoica. 2012. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. In Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12). USENIX, San Jose, CA, 15--28."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2503210.2503225"},{"key":"e_1_3_2_1_38_1","volume-title":"13th USENIX Conference on File and Storage Technologies (FAST 15)","author":"Zheng Da","unstructured":"Da Zheng , Disa Mhembere , Randal Burns , Joshua Vogelstein , Carey E. Priebe , and Alexander S. Szalay . 2015. FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs . In 13th USENIX Conference on File and Storage Technologies (FAST 15) . Da Zheng, Disa Mhembere, Randal Burns, Joshua Vogelstein, Carey E. Priebe, and Alexander S. Szalay. 2015. FlashGraph: Processing Billion-Node Graphs on an Array of Commodity SSDs. In 13th USENIX Conference on File and Storage Technologies (FAST 15)."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2016.2618791"},{"key":"e_1_3_2_1_40_1","volume-title":"2015 USENIX Annual Technical Conference (USENIX ATC 15)","author":"Zhu Xiaowei","year":"2015","unstructured":"Xiaowei Zhu , Wentao Han , and Wenguang Chen . 2015 . GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning . In 2015 USENIX Annual Technical Conference (USENIX ATC 15) . Xiaowei Zhu, Wentao Han, and Wenguang Chen. 2015. GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning. In 2015 USENIX Annual Technical Conference (USENIX ATC 15)."}],"event":{"name":"PPoPP '18: 23nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","location":"Vienna Austria","acronym":"PPoPP '18","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing"]},"container-title":["Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3178487.3178501","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3178487.3178501","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3178487.3178501","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:39:08Z","timestamp":1750196348000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3178487.3178501"}},"subtitle":["parallelize and scale R for machine learning using SSDs"],"short-title":[],"issued":{"date-parts":[[2018,2,10]]},"references-count":39,"alternative-id":["10.1145\/3178487.3178501","10.1145\/3178487"],"URL":"https:\/\/doi.org\/10.1145\/3178487.3178501","relation":{"is-identical-to":[{"id-type":"doi","id":"10.1145\/3200691.3178501","asserted-by":"object"}]},"subject":[],"published":{"date-parts":[[2018,2,10]]},"assertion":[{"value":"2018-02-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}