{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T17:54:20Z","timestamp":1773510860342,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":40,"publisher":"ACM","license":[{"start":{"date-parts":[[2012,10,14]],"date-time":"2012-10-14T00:00:00Z","timestamp":1350172800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000144","name":"Division of Computer and Network Systems","doi-asserted-by":"publisher","award":["CSR-1116079MRI CNS-0923523"],"award-info":[{"award-number":["CSR-1116079MRI CNS-0923523"]}],"id":[{"id":"10.13039\/100000144","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2012,10,14]]},"DOI":"10.1145\/2391229.2391242","type":"proceedings-article","created":{"date-parts":[[2012,11,13]],"date-time":"2012-11-13T15:04:07Z","timestamp":1352819047000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":51,"title":["Themis"],"prefix":"10.1145","author":[{"given":"Alexander","family":"Rasmussen","sequence":"first","affiliation":[{"name":"UC San Diego"}]},{"given":"Vinh The","family":"Lam","sequence":"additional","affiliation":[{"name":"UC San Diego"}]},{"given":"Michael","family":"Conley","sequence":"additional","affiliation":[{"name":"UC San Diego"}]},{"given":"George","family":"Porter","sequence":"additional","affiliation":[{"name":"UC San Diego"}]},{"given":"Rishi","family":"Kapoor","sequence":"additional","affiliation":[{"name":"UC San Diego"}]},{"given":"Amin","family":"Vahdat","sequence":"additional","affiliation":[{"name":"UC San Diego &amp; Google, Inc."}]}],"member":"320","published-online":{"date-parts":[[2012,10,14]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/48529.48535"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1740390.1740400"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","DOI":"10.1002\/9780470455401","volume-title":"Practical System Reliability (pg. 226)","author":"Bauer E.","year":"2009","unstructured":"E. Bauer , X. Zhang , and D. Kimber . Practical System Reliability (pg. 226) . Wiley-IEEE Press , 2009 . E. Bauer, X. Zhang, and D. Kimber. Practical System Reliability (pg. 226). Wiley-IEEE Press, 2009."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920881"},{"key":"e_1_3_2_1_5_1","volume-title":"OSDI","author":"Candea G.","year":"2004","unstructured":"G. Candea , S. Kawamoto , Y. Fujiki , G. Friedman , and A. Fox . Microreboot -- A Technique for Cheap Recovery . In OSDI , 2004 . G. Candea, S. Kawamoto, Y. Fujiki, G. Friedman, and A. Fox. Microreboot -- A Technique for Cheap Recovery. In OSDI, 2004."},{"key":"e_1_3_2_1_6_1","volume-title":"Proc. VLDB Endowment","author":"Chattopadhyay B.","year":"2011","unstructured":"B. Chattopadhyay , L. Lin , W. Liu , S. Mittal , P. Aragonda , V. Lychagina , Y. Kwon , and M. Wong . Tenzing: A SQL Implementation On The MapReduce Framework . In Proc. VLDB Endowment , 2011 . B. Chattopadhyay, L. Lin, W. Liu, S. Mittal, P. Aragonda, V. Lychagina, Y. Kwon, and M. Wong. Tenzing: A SQL Implementation On The MapReduce Framework. In Proc. VLDB Endowment, 2011."},{"key":"e_1_3_2_1_7_1","unstructured":"Dell and Cloudera Hadoop Platform. http:\/\/www.cloudera.com\/company\/press-center\/releases\/dell-and-cloudera-collaborate-to-enable-large-scale-data-analysis-and-modeling-through-open-source\/.  Dell and Cloudera Hadoop Platform. http:\/\/www.cloudera.com\/company\/press-center\/releases\/dell-and-cloudera-collaborate-to-enable-large-scale-data-analysis-and-modeling-through-open-source\/."},{"key":"e_1_3_2_1_8_1","volume-title":"OSDI","author":"Dean J.","year":"2004","unstructured":"J. Dean and S. Ghemawat . MapReduce: Simplified Data Processing on Large Clusters . In OSDI , 2004 . J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, 2004."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/129888.129894"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/382009.383693"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/568522.568525"},{"key":"e_1_3_2_1_12_1","volume-title":"OSDI","author":"Ford D.","year":"2010","unstructured":"D. Ford , F. Labelle , F. I. Popovici , M. Stokely , V.-A. Truong , L. Barroso , C. Grimes , and S. Quinlan . Availability in Globally Distributed Storage Systems . In OSDI , 2010 . D. Ford, F. Labelle, F. I. Popovici, M. Stokely, V.-A. Truong, L. Barroso, C. Grimes, and S. Quinlan. Availability in Globally Distributed Storage Systems. In OSDI, 2010."},{"key":"e_1_3_2_1_14_1","unstructured":"Hadoop PoweredBy Index. http:\/\/wiki.apache.org\/hadoop\/PoweredBy.  Hadoop PoweredBy Index. http:\/\/wiki.apache.org\/hadoop\/PoweredBy."},{"key":"e_1_3_2_1_15_1","unstructured":"B. Howe. lakewash_combined_v2.genes.nucleotide. https:\/\/dada.cs.washington.edu\/research\/projects\/db-data-L1_bu\/escience_datasets\/seq_alignment\/.  B. Howe. lakewash_combined_v2.genes.nucleotide. https:\/\/dada.cs.washington.edu\/research\/projects\/db-data-L1_bu\/escience_datasets\/seq_alignment\/."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807140"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213840"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807138"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/304181.304204"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11222-007-9021-3"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btp236"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/263326.263335"},{"key":"e_1_3_2_1_23_1","unstructured":"C. Monash. Petabyte-Scale Hadoop Clusters (Dozens of Them). http:\/\/www.dbms2.com\/2011\/07\/06\/petabyte-hadoop-clusters\/.  C. Monash. Petabyte-Scale Hadoop Clusters (Dozens of Them). http:\/\/www.dbms2.com\/2011\/07\/06\/petabyte-hadoop-clusters\/."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8191(99)00070-8"},{"key":"e_1_3_2_1_25_1","volume-title":"NSDI","author":"Nath S.","year":"2006","unstructured":"S. Nath , H. Yu , P. B. Gibbons , and S. Seshan . Subtleties in Tolerating Correlated Failures in Wide-Area Storage Systems . In NSDI , 2006 . S. Nath, H. Yu, P. B. Gibbons, and S. Seshan. Subtleties in Tolerating Correlated Failures in Wide-Area Storage Systems. In NSDI, 2006."},{"key":"e_1_3_2_1_26_1","author":"Peng D.","year":"2010","unstructured":"D. Peng and F. Dabek . Large-Scale Incremental Processing Using Distributed Transactions and Notifications. In OSDI , 2010 . D. Peng and F. Dabek. Large-Scale Incremental Processing Using Distributed Transactions and Notifications. In OSDI, 2010.","journal-title":"Large-Scale Incremental Processing Using Distributed Transactions and Notifications. In OSDI"},{"key":"e_1_3_2_1_27_1","volume-title":"FAST","author":"Pinheiro E.","year":"2007","unstructured":"E. Pinheiro , W. Weber , and L. A. Barroso . Failure Trends in a Large Disk Drive Population . In FAST , 2007 . E. Pinheiro, W. Weber, and L. A. Barroso. Failure Trends in a Large Disk Drive Population. In FAST, 2007."},{"key":"e_1_3_2_1_29_1","volume-title":"NSDI","author":"Rasmussen A.","year":"2011","unstructured":"A. Rasmussen , G. Porter , M. Conley , H. V. Madhyastha , R. N. Mysore , A. Pucher , and A. Vahdat . TritonSort: A Balanced Large-Scale Sorting System . In NSDI , 2011 . A. Rasmussen, G. Porter, M. Conley, H. V. Madhyastha, R. N. Mysore, A. Pucher, and A. Vahdat. TritonSort: A Balanced Large-Scale Sorting System. In NSDI, 2011."},{"key":"e_1_3_2_1_30_1","unstructured":"Recovery-Oriented Computing. http:\/\/roc.cs.berkeley.edu\/.  Recovery-Oriented Computing. http:\/\/roc.cs.berkeley.edu\/."},{"key":"e_1_3_2_1_31_1","volume-title":"Arpaci-Dusseau. Fail-Stutter Fault Tolerance. In HotOS","author":"H.","year":"2001","unstructured":"Remzi H. Arpaci-Dusseau and Andrea C . Arpaci-Dusseau. Fail-Stutter Fault Tolerance. In HotOS , 2001 . Remzi H. Arpaci-Dusseau and Andrea C. Arpaci-Dusseau. Fail-Stutter Fault Tolerance. In HotOS, 2001."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSN.2006.5"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1288783.1288785"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/894174"},{"key":"e_1_3_2_1_35_1","unstructured":"A. D. Smith and W. Chung. The RMAP Software for Short-Read Mapping. http:\/\/rulai.cshl.edu\/rmap\/.  A. D. Smith and W. Chung. The RMAP Software for Short-Read Mapping. http:\/\/rulai.cshl.edu\/rmap\/."},{"key":"e_1_3_2_1_36_1","unstructured":"Sort Benchmark. http:\/\/sortbenchmark.org\/.  Sort Benchmark. http:\/\/sortbenchmark.org\/."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3147.3165"},{"key":"e_1_3_2_1_38_1","unstructured":"Freebase Wikipedia Extraction (WEX). http:\/\/wiki.freebase.com\/wiki\/WEX.  Freebase Wikipedia Extraction (WEX). http:\/\/wiki.freebase.com\/wiki\/WEX."},{"key":"e_1_3_2_1_39_1","unstructured":"Apache Hadoop. http:\/\/hadoop.apache.org\/.  Apache Hadoop. http:\/\/hadoop.apache.org\/."},{"key":"e_1_3_2_1_40_1","unstructured":"Scaling Hadoop to 4000 Nodes at Yahoo! http:\/\/developer.yahoo.net\/blogs\/hadoop\/2008\/09\/scaling_hadoop_to_4000_nodes_a.html.  Scaling Hadoop to 4000 Nodes at Yahoo! http:\/\/developer.yahoo.net\/blogs\/hadoop\/2008\/09\/scaling_hadoop_to_4000_nodes_a.html."},{"key":"e_1_3_2_1_41_1","volume-title":"NSDI","author":"Zaharia M.","year":"2012","unstructured":"M. Zaharia , M. Chowdhury , T. Das , A. Dave , J. Ma , M. McCauley , M. J. Franklin , S. Shenker , and I. Stoica . Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing . In NSDI , 2012 . M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. In NSDI, 2012."},{"key":"e_1_3_2_1_42_1","volume-title":"OSDI","author":"Zaharia M.","year":"2008","unstructured":"M. Zaharia , A. Konwinski , A. D. Joseph , R. Katz , and I. Stoica . Improving MapReduce Performance in Heterogeneous Environments . In OSDI , 2008 . M. Zaharia, A. Konwinski, A. D. Joseph, R. Katz, and I. Stoica. Improving MapReduce Performance in Heterogeneous Environments. In OSDI, 2008."}],"event":{"name":"SOCC '12: ACM Symposium on Cloud Computing","location":"San Jose California","acronym":"SOCC '12","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGOPS ACM Special Interest Group on Operating Systems"]},"container-title":["Proceedings of the Third ACM Symposium on Cloud Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2391229.2391242","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2391229.2391242","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:34:32Z","timestamp":1750239272000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2391229.2391242"}},"subtitle":["an I\/O-efficient MapReduce"],"short-title":[],"issued":{"date-parts":[[2012,10,14]]},"references-count":40,"alternative-id":["10.1145\/2391229.2391242","10.1145\/2391229"],"URL":"https:\/\/doi.org\/10.1145\/2391229.2391242","relation":{},"subject":[],"published":{"date-parts":[[2012,10,14]]},"assertion":[{"value":"2012-10-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}