{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T04:14:14Z","timestamp":1769746454376,"version":"3.49.0"},"reference-count":60,"publisher":"Association for Computing Machinery (ACM)","issue":"OOPSLA","license":[{"start":{"date-parts":[[2017,10,12]],"date-time":"2017-10-12T00:00:00Z","timestamp":1507766400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CCF-1302327 and CCF-1715387"],"award-info":[{"award-number":["CCF-1302327 and CCF-1715387"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Program. Lang."],"published-print":{"date-parts":[[2017,10,12]]},"abstract":"<jats:p>Today's cloud services extensively rely on replication techniques to ensure availability and reliability. In complex datacenter network architectures, however, seemingly independent replica servers may inadvertently share deep dependencies (e.g., aggregation switches). Such unexpected common dependencies may potentially result in correlated failures across the entire replication deployments, invalidating the efforts. Although existing cloud management and diagnosis tools have been able to offer post-failure forensics, they, nevertheless, typically lead to quite prolonged failure recovery time in the cloud-scale systems. In this paper, we propose a novel language framework, named RepAudit, that manages to prevent correlated failure risks before service outages occur, by allowing cloud administrators to proactively audit the replication deployments of interest. In particular, RepAudit consists of three new components: 1) a declarative domain-specific language, RAL, for cloud administrators to write auditing programs expressing diverse auditing tasks; 2) a high-performance RAL auditing engine that generates the auditing results by accurately and efficiently analyzing the underlying structures of the target replication deployments; and 3) an RAL-code generator that can automatically produce complex RAL programs based on easily written specifications. Our evaluation result shows that RepAudit uses 80x less lines of code than state-of-the-art efforts in expressing the auditing task of determining the top-20 critical correlated-failure root causes. To the best of our knowledge, RepAudit is the first effort capable of simultaneously offering expressive, accurate and efficient correlated failure auditing to the cloud-scale replication systems.<\/jats:p>","DOI":"10.1145\/3133921","type":"journal-article","created":{"date-parts":[[2017,10,13]],"date-time":"2017-10-13T15:15:45Z","timestamp":1507907745000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["An auditing language for preventing correlated failures in the cloud"],"prefix":"10.1145","volume":"1","author":[{"given":"Ennan","family":"Zhai","sequence":"first","affiliation":[{"name":"Yale University, USA"}]},{"given":"Ruzica","family":"Piskac","sequence":"additional","affiliation":[{"name":"Yale University, USA"}]},{"given":"Ronghui","family":"Gu","sequence":"additional","affiliation":[{"name":"Columbia University, USA"}]},{"given":"Xun","family":"Lao","sequence":"additional","affiliation":[{"name":"Yale University, USA"}]},{"given":"Xi","family":"Wang","sequence":"additional","affiliation":[{"name":"Yale University, USA"}]}],"member":"320","published-online":{"date-parts":[[2017,10,12]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/945445.945454"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02579196"},{"key":"e_1_2_2_3_1","volume-title":"Maxino: A fast MaxSAT solver","author":"Alviano Mario","year":"2015","unstructured":"Mario Alviano . 2015 . Maxino: A fast MaxSAT solver . http:\/\/alviano.net\/software\/maxino\/ . (2015). Online; accessed Feb 24 2017. Mario Alviano. 2015. Maxino: A fast MaxSAT solver. http:\/\/alviano.net\/software\/maxino\/ . (2015). Online; accessed Feb 24 2017."},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/2832581.2832623"},{"key":"e_1_2_2_5_1","volume-title":"Maria Luisa Bonet, and Jordi Levy","author":"Ans\u00f3tegui Carlos","year":"2009","unstructured":"Carlos Ans\u00f3tegui , Maria Luisa Bonet, and Jordi Levy . 2009 . Solving Weighted partial MaxSAT through satisfiability testing. In 12th Theory and Applications of Satisfiability Testing (SAT) . Carlos Ans\u00f3tegui, Maria Luisa Bonet, and Jordi Levy. 2009. Solving Weighted partial MaxSAT through satisfiability testing. In 12th Theory and Applications of Satisfiability Testing (SAT)."},{"key":"e_1_2_2_6_1","volume-title":"24th Conference on Artificial Intelligence (AAAI).","author":"Ans\u00f3tegui Carlos","year":"2010","unstructured":"Carlos Ans\u00f3tegui , Maria Luisa Bonet , and Jordi Levy . 2010 . A new algorithm for weighted partial MaxSAT . In 24th Conference on Artificial Intelligence (AAAI). Carlos Ans\u00f3tegui, Maria Luisa Bonet, and Jordi Levy. 2010. A new algorithm for weighted partial MaxSAT. In 24th Conference on Artificial Intelligence (AAAI)."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1282380.1282383"},{"key":"e_1_2_2_8_1","unstructured":"Tomas Balyo Marijn J. H. Heule and Matti Jarvisalo. 2016. SAT Competition 2016 : Solver and Benchmark Descriptions. In SAT.  Tomas Balyo Marijn J. H. Heule and Matti Jarvisalo. 2016. SAT Competition 2016 : Solver and Benchmark Descriptions. In SAT."},{"key":"e_1_2_2_9_1","volume-title":"6th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Barham Paul","year":"2004","unstructured":"Paul Barham , Austin Donnelly , Rebecca Isaacs , and Richard Mortier . 2004 . Using Magpie for request extraction and workload modelling . In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Paul Barham, Austin Donnelly, Rebecca Isaacs, and Richard Mortier. 2004. Using Magpie for request extraction and workload modelling. In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966445.1966449"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2342356.2342439"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807162"},{"key":"e_1_2_2_13_1","unstructured":"Danny Bradbury. 2016. The bigger they get the harder we fall: Thinking our way out of cloud crash. http:\/\/www.theregister. co.uk\/2016\/07\/29\/bryan_ford_bigger_icebergs\/ . (2016).  Danny Bradbury. 2016. The bigger they get the harder we fall: Thinking our way out of cloud crash. http:\/\/www.theregister. co.uk\/2016\/07\/29\/bryan_ford_bigger_icebergs\/ . (2016)."},{"key":"e_1_2_2_14_1","volume-title":"8th Biennial Conference on Innovative Data Systems Research (CIDR).","author":"Chen Ang","year":"2017","unstructured":"Ang Chen , Yang Wu , Andreas Haeberlen , Boon Thau Loo , and Wenchao Zhou . 2017 . Data provenance at Internet scale: Architecture, experiences, and the road ahead . In 8th Biennial Conference on Innovative Data Systems Research (CIDR). Ang Chen, Yang Wu, Andreas Haeberlen, Boon Thau Loo, and Wenchao Zhou. 2017. Data provenance at Internet scale: Architecture, experiences, and the road ahead. In 8th Biennial Conference on Innovative Data Systems Research (CIDR)."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2934872.2934910"},{"key":"e_1_2_2_16_1","volume-title":"1st USENIX Symposium on Networked System Design and Implementation (NSDI).","author":"Chen Mike Y.","unstructured":"Mike Y. Chen , Anthony Accardi , Emre Kiciman , David A. Patterson , Armando Fox , and Eric A. Brewer . 2004. Path-based failure and evolution management . In 1st USENIX Symposium on Networked System Design and Implementation (NSDI). Mike Y. Chen, Anthony Accardi, Emre Kiciman, David A. Patterson, Armando Fox, and Eric A. Brewer. 2004. Path-based failure and evolution management. In 1st USENIX Symposium on Networked System Design and Implementation (NSDI)."},{"key":"e_1_2_2_17_1","volume-title":"8th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Chen Xu","year":"2008","unstructured":"Xu Chen , Ming Zhang , Zhuoqing Morley Mao , and Paramvir Bahl . 2008 . Automating network application dependency discovery: Experiences, limitations, and new Solutions . In 8th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Xu Chen, Ming Zhang, Zhuoqing Morley Mao, and Paramvir Bahl. 2008. Automating network application dependency discovery: Experiences, limitations, and new Solutions. In 8th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_18_1","volume-title":"6th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Cohen Ira","year":"2004","unstructured":"Ira Cohen , Jeffrey S. Chase , Mois\u00e9s Goldszmidt , Terence Kelly , and Julie Symons . 2004 . Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control . In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Ira Cohen, Jeffrey S. Chase, Mois\u00e9s Goldszmidt, Terence Kelly, and Julie Symons. 2004. Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control. In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_19_1","volume-title":"6th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Dunagan John","year":"2004","unstructured":"John Dunagan , Nicholas J. A. Harvey , Michael B. Jones , Dejan Kostic , Marvin Theimer , and Alec Wolman . 2004 . F USE: Lightweight guaranteed distributed failure notification . In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI). John Dunagan, Nicholas J. A. Harvey, Michael B. Jones, Dejan Kostic, Marvin Theimer, and Alec Wolman. 2004. F USE: Lightweight guaranteed distributed failure notification. In 6th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_20_1","volume-title":"9th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Ford Daniel","year":"2010","unstructured":"Daniel Ford , Fran\u00e7ois Labelle , Florentina I. Popovici , Murray Stokely , Van-Anh Truong , Luiz Barroso , Carrie Grimes , and Sean Quinlan . 2010 . Availability in globally distributed storage systems . In 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Daniel Ford, Fran\u00e7ois Labelle, Florentina I. Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Grimes, and Sean Quinlan. 2010. Availability in globally distributed storage systems. In 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2018436.2018477"},{"key":"e_1_2_2_22_1","volume-title":"5th ACM Symposium on Cloud Computing (SoCC).","author":"Gunawi Haryadi S.","unstructured":"Haryadi S. Gunawi , Mingzhe Hao , Tanakorn Leesatapornwongsa , Tiratat Patana-anake, Thanh Do , Jeffry Adityama , Kurnia J. Eliazar , Agung Laksono , Jeffrey F. Lukman , Vincentius Martin , and Anang D. Satria . 2014. What bugs live in the cloud? A study of 3000+ issues in cloud systems . In 5th ACM Symposium on Cloud Computing (SoCC). Haryadi S. Gunawi, Mingzhe Hao, Tanakorn Leesatapornwongsa, Tiratat Patana-anake, Thanh Do, Jeffry Adityama, Kurnia J. Eliazar, Agung Laksono, Jeffrey F. Lukman, Vincentius Martin, and Anang D. Satria. 2014. What bugs live in the cloud? A study of 3000+ issues in cloud systems. In 5th ACM Symposium on Cloud Computing (SoCC)."},{"key":"e_1_2_2_23_1","volume-title":"7th ACM Symposium on Cloud Computing (SoCC).","author":"Gunawi Haryadi S.","unstructured":"Haryadi S. Gunawi , Mingzhe Hao , Riza O. Suminto , Agung Laksono , Anang D. Satria , Jeffry Adityatama , and Kurnia J. Eliazar . 2016. Why does the cloud stop computing? Lessons from hundreds of service outages . In 7th ACM Symposium on Cloud Computing (SoCC). Haryadi S. Gunawi, Mingzhe Hao, Riza O. Suminto, Agung Laksono, Anang D. Satria, Jeffry Adityatama, and Kurnia J. Eliazar. 2016. Why does the cloud stop computing? Lessons from hundreds of service outages. In 7th ACM Symposium on Cloud Computing (SoCC)."},{"key":"e_1_2_2_24_1","volume-title":"3rd ACM SIGOPS International Workshop on Large-Scale Distributed Systems and Middleware (LADIS).","author":"Haeberlen Andreas","year":"2009","unstructured":"Andreas Haeberlen . 2009 . A case for the accountable cloud . In 3rd ACM SIGOPS International Workshop on Large-Scale Distributed Systems and Middleware (LADIS). Andreas Haeberlen. 2009. A case for the accountable cloud. In 3rd ACM SIGOPS International Workshop on Large-Scale Distributed Systems and Middleware (LADIS)."},{"key":"e_1_2_2_25_1","volume-title":"9th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Haeberlen Andreas","year":"2010","unstructured":"Andreas Haeberlen , Paarijaat Aditya , Rodrigo Rodrigues , and Peter Druschelnd . 2010 . Accountable virtual machines . In 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Andreas Haeberlen, Paarijaat Aditya, Rodrigo Rodrigues, and Peter Druschelnd. 2010. Accountable virtual machines. In 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_26_1","unstructured":"Devindra Hardaware. 2011. Apple\u2019s iCloud runs on Microsoft\u2019s Azure and Amazon\u2019s cloud. http:\/\/venturebeat.com\/2011\/ 09\/03\/icloud- azure- amazon\/ . (2011).  Devindra Hardaware. 2011. Apple\u2019s iCloud runs on Microsoft\u2019s Azure and Amazon\u2019s cloud. http:\/\/venturebeat.com\/2011\/ 09\/03\/icloud- azure- amazon\/ . (2011)."},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2076732.2076738"},{"key":"e_1_2_2_28_1","volume-title":"Conf Valley: A systematic configuration validation framework for cloud services. In 10th European Conference on Computer Systems (EuroSys).","author":"Huang Peng","year":"2015","unstructured":"Peng Huang , William J. Bolosky , Abhishek Singh , and Yuanyuan Zhou . 2015 . Conf Valley: A systematic configuration validation framework for cloud services. In 10th European Conference on Computer Systems (EuroSys). Peng Huang, William J. Bolosky, Abhishek Singh, and Yuanyuan Zhou. 2015. Conf Valley: A systematic configuration validation framework for cloud services. In 10th European Conference on Computer Systems (EuroSys)."},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2737924.2737957"},{"key":"e_1_2_2_30_1","volume-title":"Optical Fiber Telecommunications IIIA","author":"Kaminow Ivan P","unstructured":"Ivan P Kaminow and Thomas L Koch . 1997. Optical Fiber Telecommunications IIIA . Academic Press , New York . Ivan P Kaminow and Thomas L Koch. 1997. Optical Fiber Telecommunications IIIA. Academic Press, New York."},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1080173.1080178"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1592568.1592597"},{"key":"e_1_2_2_33_1","volume-title":"IP Fault Localization Via Risk Modeling. In 2nd USENIX Symposium on Networked System Design and Implementation (NSDI).","author":"Kompella Ramana Rao","unstructured":"Ramana Rao Kompella , Jennifer Yates , Albert G. Greenberg , and Alex C. Snoeren . 2005 . IP Fault Localization Via Risk Modeling. In 2nd USENIX Symposium on Networked System Design and Implementation (NSDI). Ramana Rao Kompella, Jennifer Yates, Albert G. Greenberg, and Alex C. Snoeren. 2005. IP Fault Localization Via Risk Modeling. In 2nd USENIX Symposium on Networked System Design and Implementation (NSDI)."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-31424-7_32"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2043556.2043583"},{"key":"e_1_2_2_36_1","volume-title":"12th USENIX Symposium on Networked System Design and Implementation (NSDI).","author":"Lopes Nuno P.","year":"2015","unstructured":"Nuno P. Lopes , Nikolaj Bj\u00f8rner , Patrice Godefroid , Karthick Jayaraman , and George Varghese . 2015 . Checking beliefs in dynamic networks . In 12th USENIX Symposium on Networked System Design and Implementation (NSDI). Nuno P. Lopes, Nikolaj Bj\u00f8rner, Patrice Godefroid, Karthick Jayaraman, and George Varghese. 2015. Checking beliefs in dynamic networks. In 12th USENIX Symposium on Networked System Design and Implementation (NSDI)."},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2737924.2737980"},{"key":"e_1_2_2_38_1","unstructured":"Radhika Niranjan Mysore Andreas Pamboris Nathan Farrington Nelson Huang Pardis Miri Sivasankar Radhakrishnan Vikram Subramanya and Amin Vahdat. 2009. PortLand: A Scalable Fault-tolerant Layer 2 Data Center Network Fabric. In ACM SIGCOMM (SIGCOMM).  Radhika Niranjan Mysore Andreas Pamboris Nathan Farrington Nelson Huang Pardis Miri Sivasankar Radhakrishnan Vikram Subramanya and Amin Vahdat. 2009. PortLand: A Scalable Fault-tolerant Layer 2 Data Center Network Fabric. In ACM SIGCOMM (SIGCOMM)."},{"key":"e_1_2_2_39_1","volume-title":"Hutchinson","author":"Natarajan Arun","year":"2012","unstructured":"Arun Natarajan , Peng Ning , Yao Liu , Sushil Jajodia , and Steve E . Hutchinson . 2012 . NSDMiner: Automated discovery of network service dependencies. In 31st IEEE INFOCOM (INFOCOM) . Arun Natarajan, Peng Ning, Yao Liu, Sushil Jajodia, and Steve E. Hutchinson. 2012. NSDMiner: Automated discovery of network service dependencies. In 31st IEEE INFOCOM (INFOCOM)."},{"key":"e_1_2_2_40_1","volume-title":"3rd USENIX\/ACM Symposium on Networked Systems Design and Implementation (NSDI).","author":"Nath Suman","year":"2006","unstructured":"Suman Nath , Haifeng Yu , Phillip B. Gibbons , and Srinivasan Seshan . 2006 . Subtleties in tolerating correlated failures in wide-area storage systems . In 3rd USENIX\/ACM Symposium on Networked Systems Design and Implementation (NSDI). Suman Nath, Haifeng Yu, Phillip B. Gibbons, and Srinivasan Seshan. 2006. Subtleties in tolerating correlated failures in wide-area storage systems. In 3rd USENIX\/ACM Symposium on Networked Systems Design and Implementation (NSDI)."},{"key":"e_1_2_2_41_1","volume-title":"On the Accurate Identification of Network Service Dependencies in Distributed Systems. In 26th Large Installation System Administration Conference (LISA).","author":"Peng Ning Barry Peddycord III","year":"2012","unstructured":"Barry Peddycord III , Peng Ning , and Sushil Jajodia . 2012 . On the Accurate Identification of Network Service Dependencies in Distributed Systems. In 26th Large Installation System Administration Conference (LISA). Barry Peddycord III, Peng Ning, and Sushil Jajodia. 2012. On the Accurate Identification of Network Service Dependencies in Distributed Systems. In 26th Large Installation System Administration Conference (LISA)."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2837614.2837657"},{"key":"e_1_2_2_43_1","volume-title":"3rd Symposium on Networked Systems Design and Implementation (NSDI).","author":"Reynolds Patrick","year":"2006","unstructured":"Patrick Reynolds , Charles Edwin Killian , Janet L. Wiener , Jeffrey C. Mogul , Mehul A. Shah , and Amin Vahdat . 2006 . Pip: Detecting the unexpected in distributed systems . In 3rd Symposium on Networked Systems Design and Implementation (NSDI). Patrick Reynolds, Charles Edwin Killian, Janet L. Wiener, Jeffrey C. Mogul, Mehul A. Shah, and Amin Vahdat. 2006. Pip: Detecting the unexpected in distributed systems. In 3rd Symposium on Networked Systems Design and Implementation (NSDI)."},{"key":"e_1_2_2_44_1","unstructured":"Lorenzo Saino Cosmin Cocora and George Pavlou. 2013. Fast Network Simulation Setup. https:\/\/github.com\/fnss\/fnss . (2013).  Lorenzo Saino Cosmin Cocora and George Pavlou. 2013. Fast Network Simulation Setup. https:\/\/github.com\/fnss\/fnss . (2013)."},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.5555\/1361397.1361408"},{"key":"e_1_2_2_46_1","volume-title":"Rackspace Outage Nov 12th","author":"Steven Rew","unstructured":"Rew Steven . 2014. Rackspace Outage Nov 12th . http:\/\/www.realestatewebmasters.com\/blogs\/rew- steven\/ rackspace- outage- nov- 12th\/show\/ . (2014). Online; accessed Feb 24 2017. Rew Steven. 2014. Rackspace Outage Nov 12th. http:\/\/www.realestatewebmasters.com\/blogs\/rew- steven\/ rackspace- outage- nov- 12th\/show\/ . (2014). Online; accessed Feb 24 2017."},{"key":"e_1_2_2_47_1","volume-title":"Summary of the","author":"Team The AWS","year":"2012","unstructured":"The AWS Team . 2012. Summary of the October 22, 2012 AWS Service Event in the US-East Region. https:\/\/aws.amazon. com\/message\/680342\/ . (2012). Online ; accessed Feb 24 2017. The AWS Team. 2012. Summary of the October 22, 2012 AWS Service Event in the US-East Region. https:\/\/aws.amazon. com\/message\/680342\/ . (2012). Online; accessed Feb 24 2017."},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2666356.2594310"},{"key":"e_1_2_2_49_1","doi-asserted-by":"crossref","unstructured":"Xin Wu Daniel Turner Chao-Chih Chen David A. Maltz Xiaowei Yang Lihua Yuan and Ming Zhang. 2012. NetPilot: Automating datacenter network failure mitigation. In ACM SIGCOMM (SIGCOMM).  Xin Wu Daniel Turner Chao-Chih Chen David A. Maltz Xiaowei Yang Lihua Yuan and Ming Zhang. 2012. NetPilot: Automating datacenter network failure mitigation. In ACM SIGCOMM (SIGCOMM).","DOI":"10.1145\/2342356.2342438"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2619239.2626335"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517488.2517493"},{"key":"e_1_2_2_52_1","volume-title":"23rd Annual Symposium on Foundations of Computer Science (FOCS).","author":"Chi-Chih Yao Andrew","year":"1982","unstructured":"Andrew Chi-Chih Yao . 1982 . Protocols for Secure Computations (Extended Abstract) . In 23rd Annual Symposium on Foundations of Computer Science (FOCS). Andrew Chi-Chih Yao. 1982. Protocols for Secure Computations (Extended Abstract). In 23rd Annual Symposium on Foundations of Computer Science (FOCS)."},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/2524224.2524231"},{"key":"e_1_2_2_54_1","volume-title":"11th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Zhai Ennan","year":"2014","unstructured":"Ennan Zhai , Ruichuan Chen , David Isaac Wolinsky , and Bryan Ford . 2014 . Heading off correlated failures through Independence-as-a-service . In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Ennan Zhai, Ruichuan Chen, David Isaac Wolinsky, and Bryan Ford. 2014. Heading off correlated failures through Independence-as-a-service. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICWS.2015.94"},{"key":"e_1_2_2_56_1","volume-title":"Hongda Xiao, Hongqiang Liu, Xueyuan Su, and Bryan Ford.","author":"Zhai Ennan","year":"2013","unstructured":"Ennan Zhai , David Isaac Wolinsky , Hongda Xiao, Hongqiang Liu, Xueyuan Su, and Bryan Ford. 2013 . Auditing the Structural Reliability of the Clouds. Technical Report YALEU\/DCS\/TR-1479. Department of Computer Science, Yale University . Available at http:\/\/cpsc.yale.edu\/sites\/default\/files\/files\/tr1479.pdf . Ennan Zhai, David Isaac Wolinsky, Hongda Xiao, Hongqiang Liu, Xueyuan Su, and Bryan Ford. 2013. Auditing the Structural Reliability of the Clouds. Technical Report YALEU\/DCS\/TR-1479. Department of Computer Science, Yale University. Available at http:\/\/cpsc.yale.edu\/sites\/default\/files\/files\/tr1479.pdf ."},{"key":"e_1_2_2_57_1","volume-title":"12th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Zhao Xu","year":"2016","unstructured":"Xu Zhao , Kirk Rodrigues , Yu Luo , Ding Yuan , and Michael Stumm . 2016 . Non-intrusive performance profiling for entire software stacks based on the flow reconstruction principle . In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Xu Zhao, Kirk Rodrigues, Yu Luo, Ding Yuan, and Michael Stumm. 2016. Non-intrusive performance profiling for entire software stacks based on the flow reconstruction principle. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_58_1","volume-title":"11th USENIX Symposium on Operating Systems Design and Implementation (OSDI).","author":"Zhao Xu","year":"2014","unstructured":"Xu Zhao , Yongle Zhang , David Lion , Muhammad Faizan Ullah , Yu Luo , Ding Yuan , and Michael Stumm . 2014 . lprof: A Non-intrusive Request Flow Profiler for Distributed Systems . In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Xu Zhao, Yongle Zhang, David Lion, Muhammad Faizan Ullah, Yu Luo, Ding Yuan, and Michael Stumm. 2014. lprof: A Non-intrusive Request Flow Profiler for Distributed Systems. In 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI)."},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/2043556.2043584"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989488"}],"container-title":["Proceedings of the ACM on Programming Languages"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3133921","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3133921","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3133921","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:13:25Z","timestamp":1750212805000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3133921"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,12]]},"references-count":60,"journal-issue":{"issue":"OOPSLA","published-print":{"date-parts":[[2017,10,12]]}},"alternative-id":["10.1145\/3133921"],"URL":"https:\/\/doi.org\/10.1145\/3133921","relation":{},"ISSN":["2475-1421"],"issn-type":[{"value":"2475-1421","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,10,12]]},"assertion":[{"value":"2017-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}