{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T03:47:19Z","timestamp":1772164039677,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2011,10,22]],"date-time":"2011-10-22T00:00:00Z","timestamp":1319241600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2011,10,22]]},"DOI":"10.1145\/2048066.2048082","type":"proceedings-article","created":{"date-parts":[[2011,10,25]],"date-time":"2011-10-25T08:23:06Z","timestamp":1319530986000},"page":"171-188","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":55,"title":["PREFAIL"],"prefix":"10.1145","author":[{"given":"Pallavi","family":"Joshi","sequence":"first","affiliation":[{"name":"UC Berkeley, Berkeley, CA, USA"}]},{"given":"Haryadi S.","family":"Gunawi","sequence":"additional","affiliation":[{"name":"UC Berkeley, Berkeley, CA, USA"}]},{"given":"Koushik","family":"Sen","sequence":"additional","affiliation":[{"name":"UC Berkeley, Berkeley, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2011,10,22]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Hadoop MapReduce. http:\/\/hadoop.apache.org\/mapreduce.  Hadoop MapReduce. http:\/\/hadoop.apache.org\/mapreduce."},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the 18th European Conference on Object-Oriented Programming (ECOOP '04)","author":"Aldrich Jonathan","year":"2004","unstructured":"Jonathan Aldrich and Craig Chambers . Ownership Domains : Separating Aliasing Policy from Mechanism . In Proceedings of the 18th European Conference on Object-Oriented Programming (ECOOP '04) , Oslo, Norway , June 2004 . Jonathan Aldrich and Craig Chambers. Ownership Domains: Separating Aliasing Policy from Mechanism. In Proceedings of the 18th European Conference on Object-Oriented Programming (ECOOP '04), Oslo, Norway, June 2004."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/566172.566191"},{"key":"e_1_3_2_1_4_1","volume-title":"Jonathan Traupman. FIG: A Prototype Tool for Online Verification of Recovery Mechanisms. In Workshop on Self-Healing, Adaptive and Self-Managed Systems.","author":"Broadwell Pete","unstructured":"Pete Broadwell , Naveen Sastry , and Jonathan Traupman. FIG: A Prototype Tool for Online Verification of Recovery Mechanisms. In Workshop on Self-Healing, Adaptive and Self-Managed Systems. Pete Broadwell, Naveen Sastry, and Jonathan Traupman. FIG: A Prototype Tool for Online Verification of Recovery Mechanisms. In Workshop on Self-Healing, Adaptive and Self-Managed Systems."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/1298455.1298487"},{"key":"e_1_3_2_1_6_1","volume-title":"Candea and Armando Fox. Crash-Only Software. In The Ninth Workshop on Hot Topics in Operating Systems (HotOS IX)","author":"George","year":"2003","unstructured":"George Candea and Armando Fox. Crash-Only Software. In The Ninth Workshop on Hot Topics in Operating Systems (HotOS IX) , Lihue, Hawaii , May 2003 . George Candea and Armando Fox. Crash-Only Software. In The Ninth Workshop on Hot Topics in Operating Systems (HotOS IX), Lihue, Hawaii, May 2003."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281100.1281103"},{"key":"e_1_3_2_1_8_1","volume-title":"Robert Gruber. Bigtable: A Distributed Storage System for Structured Data. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI '06)","author":"Chang Fay","year":"2006","unstructured":"Fay Chang , Jeffrey Dean , Sanjay Ghemawat , Wilson C. Hsieh , Deborah A. Wallach , Michael Burrows , Tushar Chandra , Andrew Fikes , and Robert Gruber. Bigtable: A Distributed Storage System for Structured Data. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI '06) , Seattle, Washington , November 2006 . Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, and Robert Gruber. Bigtable: A Distributed Storage System for Structured Data. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI '06), Seattle, Washington, November 2006."},{"key":"e_1_3_2_1_9_1","volume-title":"Contact Persons at Cloudera Inc","author":"Collins Eli","year":"2011","unstructured":"Eli Collins and Todd Lipcon . Contact Persons at Cloudera Inc ., 2011 . Eli Collins and Todd Lipcon. Contact Persons at Cloudera Inc., 2011."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807152"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1287624.1287651"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-024X(199712)27:12%3C1385::AID-SPE133%3E3.0.CO;2-7"},{"key":"e_1_3_2_1_13_1","volume-title":"Google I\/O","author":"Dean Jeffrey","year":"2008","unstructured":"Jeffrey Dean . Underneath the covers at google: Current systems and future directions . In Google I\/O , 2008 . Jeffrey Dean. Underneath the covers at google: Current systems and future directions. In Google I\/O, 2008."},{"key":"e_1_3_2_1_14_1","volume-title":"Sean Quinlna. Availability in Globally Distributed Storage Systems. In Proceedings of the 9th Symposium on Operating Systems Design and Implementation (OSDI '10)","author":"Ford Daniel","year":"2010","unstructured":"Daniel Ford , Franis Labelle , Florentina I. Popovici , Murray Stokely , Van-Anh Truong , Luiz Barroso , Carrie Grimes , and Sean Quinlna. Availability in Globally Distributed Storage Systems. In Proceedings of the 9th Symposium on Operating Systems Design and Implementation (OSDI '10) , Vancouver, Canada , October 2010 . Daniel Ford, Franis Labelle, Florentina I. Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Grimes, and Sean Quinlna. Availability in Globally Distributed Storage Systems. In Proceedings of the 9th Symposium on Operating Systems Design and Implementation (OSDI '10), Vancouver, Canada, October 2010."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1007512.1007516"},{"key":"e_1_3_2_1_16_1","volume-title":"High-End Computing File Systems and I\/O Workshop (HEC FSIO '10)","author":"Garth Gibson. Reliability\/Resilience Panel","year":"2010","unstructured":"Garth Gibson. Reliability\/Resilience Panel . In High-End Computing File Systems and I\/O Workshop (HEC FSIO '10) , Arlington , VA , August 2010 . Garth Gibson. Reliability\/Resilience Panel. In High-End Computing File Systems and I\/O Workshop (HEC FSIO '10), Arlington, VA, August 2010."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1806799.1806835"},{"key":"e_1_3_2_1_18_1","volume-title":"Koushik Sen. Fate and Destini: A Framework for Cloud Recovery Testing. In Proceedings of the 8th Symposium on Networked Systems Design and Implementation (NSDI '11)","author":"Gunawi Haryadi S.","year":"2011","unstructured":"Haryadi S. Gunawi , Thanh Do , Pallavi Joshi , Peter Alvaro , Joseph M. Hellerstein , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau , and Koushik Sen. Fate and Destini: A Framework for Cloud Recovery Testing. In Proceedings of the 8th Symposium on Networked Systems Design and Implementation (NSDI '11) , Boston, Massachusetts , March 2011 . Haryadi S. Gunawi, Thanh Do, Pallavi Joshi, Peter Alvaro, Joseph M. Hellerstein, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, and Koushik Sen. Fate and Destini: A Framework for Cloud Recovery Testing. In Proceedings of the 8th Symposium on Networked Systems Design and Implementation (NSDI '11), Boston, Massachusetts, March 2011."},{"key":"e_1_3_2_1_19_1","volume-title":"Koushik Sen. Towards Automatically Checking Thousands of Failures with Micro-specifications. In The 6th Workshop on Hot Topics in System Dependability (HotDep '10)","author":"Gunawi Haryadi S.","year":"2010","unstructured":"Haryadi S. Gunawi , Thanh Do , Pallavi Joshi , Joseph M. Hellerstein , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau , and Koushik Sen. Towards Automatically Checking Thousands of Failures with Micro-specifications. In The 6th Workshop on Hot Topics in System Dependability (HotDep '10) , Vancouver, Canada , October 2010 . Haryadi S. Gunawi, Thanh Do, Pallavi Joshi, Joseph M. Hellerstein, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, and Koushik Sen. Towards Automatically Checking Thousands of Failures with Micro-specifications. In The 6th Workshop on Hot Topics in System Dependability (HotDep '10), Vancouver, Canada, October 2010."},{"key":"e_1_3_2_1_20_1","volume-title":"Henry. Cloud Storage FUD: Failure and Uncertainty and Durability. In Proceedings of the 7th USENIX Symposium on File and Storage Technologies (FAST '09)","author":"Alyssa","year":"2009","unstructured":"Alyssa Henry. Cloud Storage FUD: Failure and Uncertainty and Durability. In Proceedings of the 7th USENIX Symposium on File and Storage Technologies (FAST '09) , San Francisco, California , February 2009 . Alyssa Henry. Cloud Storage FUD: Failure and Uncertainty and Durability. In Proceedings of the 7th USENIX Symposium on File and Storage Technologies (FAST '09), San Francisco, California, February 2009."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2007.01.005"},{"key":"e_1_3_2_1_22_1","unstructured":"Todd Hoff. Netflix: Continually Test by Failing Servers with Chaos Monkey. http:\/\/highscalability.com December 2010.  Todd Hoff. Netflix: Continually Test by Failing Servers with Chaos Monkey. http:\/\/highscalability.com December 2010."},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10)","author":"Hunt Patrick","year":"2010","unstructured":"Patrick Hunt , Mahadev Konar , Flavio P. Junqueira , and Benjamin Reed . ZooKeeper : Wait-free coordination for Internet-scale systems . In Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10) , Boston, Massachusetts , June 2010 . Patrick Hunt, Mahadev Konar, Flavio P. Junqueira, and Benjamin Reed. ZooKeeper: Wait-free coordination for Internet-scale systems. In Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10), Boston, Massachusetts, June 2010."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSN.2005.45"},{"key":"e_1_3_2_1_26_1","volume-title":"Juszczyk and Schahram Dustdar. Programmable Fault Injection Testbeds for Complex SOA. In Proceedings of the 8th International Conference on Service Oriented Computing (ICSOC '10)","author":"Lukasz","year":"2010","unstructured":"Lukasz Juszczyk and Schahram Dustdar. Programmable Fault Injection Testbeds for Complex SOA. In Proceedings of the 8th International Conference on Service Oriented Computing (ICSOC '10) , San Francisco, California , December 2010 . Lukasz Juszczyk and Schahram Dustdar. Programmable Fault Injection Testbeds for Complex SOA. In Proceedings of the 8th International Conference on Service Oriented Computing (ICSOC '10), San Francisco, California, December 2010."},{"key":"e_1_3_2_1_27_1","volume-title":"AFEX: An Automated Fault Explorer for Faster System Testing","author":"Keller Lorenzo","year":"2008","unstructured":"Lorenzo Keller , Paul Marinescu , and George Candea . AFEX: An Automated Fault Explorer for Faster System Testing , 2008 . Lorenzo Keller, Paul Marinescu, and George Candea. AFEX: An Automated Fault Explorer for Faster System Testing, 2008."},{"key":"e_1_3_2_1_28_1","volume-title":"Koopman and John DeVale. Comparing the Robustness of POSIX Operating Systems. In Proceedings of the 29th International Symposium on Fault-Tolerant Computing (FTCS-29)","author":"Philip","year":"1999","unstructured":"Philip Koopman and John DeVale. Comparing the Robustness of POSIX Operating Systems. In Proceedings of the 29th International Symposium on Fault-Tolerant Computing (FTCS-29) , Madison, Wisconsin , June 1999 . Philip Koopman and John DeVale. Comparing the Robustness of POSIX Operating Systems. In Proceedings of the 29th International Symposium on Fault-Tolerant Computing (FTCS-29), Madison, Wisconsin, June 1999."},{"key":"e_1_3_2_1_29_1","volume-title":"The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS '09)","author":"Lakshman Avinash","year":"2009","unstructured":"Avinash Lakshman and Prashant Malik . Cassandra - a decentralized structured storage system . In The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS '09) , Florianopolis, Brazil , October 2009 . Avinash Lakshman and Prashant Malik. Cassandra - a decentralized structured storage system. In The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS '09), Florianopolis, Brazil, October 2009."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/800213.806531"},{"key":"e_1_3_2_1_31_1","volume-title":"Marinescu and George Candea. LFI: A Practical and General Library-Level Fault Injector. In Proceedings of the International Conference on Dependable Systems and Networks (DSN '09)","author":"Paul","year":"2009","unstructured":"Paul Marinescu and George Candea. LFI: A Practical and General Library-Level Fault Injector. In Proceedings of the International Conference on Dependable Systems and Networks (DSN '09) , Lisbon, Portugal , June 2009 . Paul Marinescu and George Candea. LFI: A Practical and General Library-Level Fault Injector. In Proceedings of the International Conference on Dependable Systems and Networks (DSN '09), Lisbon, Portugal, June 2009."},{"key":"e_1_3_2_1_32_1","volume-title":"George Candea. An Extensible Technique for High-Precision Testing of Recovery Code. In Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10)","author":"Marinescu Paul D.","year":"2010","unstructured":"Paul D. Marinescu , Radu Banabic , and George Candea. An Extensible Technique for High-Precision Testing of Recovery Code. In Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10) , Boston, Massachusetts , June 2010 . Paul D. Marinescu, Radu Banabic, and George Candea. An Extensible Technique for High-Precision Testing of Recovery Code. In Proceedings of the 2010 USENIX Annual Technical Conference (ATC '10), Boston, Massachusetts, June 2010."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/50202.50214"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/1267903.1267905"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1095810.1095830"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0951-8320(01)00136-3"},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 5th USENIX Symposium on File and Storage Technologies (FAST '07)","author":"Schroeder Bianca","year":"2007","unstructured":"Bianca Schroeder and Garth Gibson . Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Symposium on File and Storage Technologies (FAST '07) , San Jose, California , February 2007 . Bianca Schroeder and Garth Gibson. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Symposium on File and Storage Technologies (FAST '07), San Jose, California, February 2007."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496972"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/972374.972389"},{"key":"e_1_3_2_1_40_1","unstructured":"Hadoop Team. Hadoop Fault Injection Framework and Development Guide. http:\/\/hadoop.apache.org\/hdfs\/docs\/r0.21.0\/faultinject_framework.html.  Hadoop Team. Hadoop Fault Injection Framework and Development Guide. http:\/\/hadoop.apache.org\/hdfs\/docs\/r0.21.0\/faultinject_framework.html."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807161"},{"key":"e_1_3_2_1_42_1","volume-title":"Hadoop The Definitive Guide. O'Reilly","author":"White Tom","year":"2009","unstructured":"Tom White . Hadoop The Definitive Guide. O'Reilly , 2009 . Tom White. Hadoop The Definitive Guide. O'Reilly, 2009."},{"key":"e_1_3_2_1_43_1","volume-title":"Lidong Zhou. MODIST: Transparent Model Checking of Unmodified Distributed Systems. In Proceedings of the 6th Symposium on Networked Systems Design and Implementation (NSDI '09)","author":"Yang Junfeng","year":"2009","unstructured":"Junfeng Yang , Tisheng Chen , Ming Wu , Zhilei Xu , Xuezheng Liu , Haoxiang Lin , Mao Yang , Fan Long , Lintao Zhang , and Lidong Zhou. MODIST: Transparent Model Checking of Unmodified Distributed Systems. In Proceedings of the 6th Symposium on Networked Systems Design and Implementation (NSDI '09) , Boston, Massachusetts , April 2009 . Junfeng Yang, Tisheng Chen, Ming Wu, Zhilei Xu, Xuezheng Liu, Haoxiang Lin, Mao Yang, Fan Long, Lintao Zhang, and Lidong Zhou. MODIST: Transparent Model Checking of Unmodified Distributed Systems. In Proceedings of the 6th Symposium on Networked Systems Design and Implementation (NSDI '09), Boston, Massachusetts, April 2009."},{"key":"e_1_3_2_1_44_1","volume-title":"Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI '04)","author":"Yang Junfeng","year":"2004","unstructured":"Junfeng Yang , Paul Twohey , Dawson Engler , and Madanlal Musuvathi . Using Model Checking to Find Serious File System Errors . In Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI '04) , San Francisco, California , December 2004 . Junfeng Yang, Paul Twohey, Dawson Engler, and Madanlal Musuvathi. Using Model Checking to Find Serious File System Errors. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI '04), San Francisco, California, December 2004."}],"event":{"name":"SPLASH '11: Conference on Systems, Programming, and Applications: Software for Humanity","location":"Portland Oregon USA","acronym":"SPLASH '11","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"]},"container-title":["Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2048066.2048082","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2048066.2048082","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:48:31Z","timestamp":1750225711000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2048066.2048082"}},"subtitle":["a programmable tool for multiple-failure injection"],"short-title":[],"issued":{"date-parts":[[2011,10,22]]},"references-count":43,"alternative-id":["10.1145\/2048066.2048082","10.1145\/2048066"],"URL":"https:\/\/doi.org\/10.1145\/2048066.2048082","relation":{"is-identical-to":[{"id-type":"doi","id":"10.1145\/2076021.2048082","asserted-by":"object"}]},"subject":[],"published":{"date-parts":[[2011,10,22]]},"assertion":[{"value":"2011-10-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}