{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T10:42:26Z","timestamp":1770460946356,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,26]],"date-time":"2021-04-26T00:00:00Z","timestamp":1619395200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Horizon 2020 Framework Programme","award":["894204"],"award-info":[{"award-number":["894204"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,26]]},"DOI":"10.1145\/3447851.3458738","type":"proceedings-article","created":{"date-parts":[[2021,4,25]],"date-time":"2021-04-25T09:52:49Z","timestamp":1619344369000},"page":"18-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Frisbee"],"prefix":"10.1145","author":[{"given":"Fotis","family":"Nikolaidis","sequence":"first","affiliation":[{"name":"Institute of Computer Science, FORTH, Heraklion, Greece"}]},{"given":"Antony","family":"Chazapis","sequence":"additional","affiliation":[{"name":"Institute of Computer Science, FORTH Heraklion, Greece"}]},{"given":"Manolis","family":"Marazakis","sequence":"additional","affiliation":[{"name":"Institute of Computer Science, FORTH) Heraklion, Greece"}]},{"given":"Angelos","family":"Bilas","sequence":"additional","affiliation":[{"name":"Institute of Computer Science, FORTH) Heraklion, Greece"}]}],"member":"320","published-online":{"date-parts":[[2021,4,26]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3030207.3030229"},{"key":"e_1_3_2_1_2_1","unstructured":"Algirdas Avizienis Jean-Claude Laprie Brian Randell etal 2001. Fundamental concepts of dependability. University of Newcastle upon Tyne Computing Science.  Algirdas Avizienis Jean-Claude Laprie Brian Randell et al. 2001. Fundamental concepts of dependability. University of Newcastle upon Tyne Computing Science."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/MS.2016.60"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/FTDCS.1990.138293"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807152"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSN.2004.1311898"},{"key":"e_1_3_2_1_7_1","volume-title":"Reliability engineering","author":"Elsayed Elsayed","unstructured":"Elsayed Elsayed . 2021. Reliability engineering . Wiley , Hoboken, NJ . Elsayed Elsayed. 2021. Reliability engineering. Wiley, Hoboken, NJ."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2006.06.011"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2934872.2934891"},{"key":"e_1_3_2_1_10_1","unstructured":"grafana. 2020. The open and composable observability and data visualization platform. https:\/\/github.com\/grafana\/grafana  grafana. 2020. The open and composable observability and data visualization platform. https:\/\/github.com\/grafana\/grafana"},{"key":"e_1_3_2_1_11_1","volume-title":"Failure as a service (faas): A cloud service for large-scale, online failure drills","author":"Gunawi Haryadi S","year":"2011","unstructured":"Haryadi S Gunawi , Thanh Do , Joseph M Hellerstein , Ion Stoica , Dhruba Borthakur , and Jesse Robbins . 2011. Failure as a service (faas): A cloud service for large-scale, online failure drills . University of California , Berkeley, Berkeley 3 ( 2011 ). Haryadi S Gunawi, Thanh Do, Joseph M Hellerstein, Ion Stoica, Dhruba Borthakur, and Jesse Robbins. 2011. Failure as a service (faas): A cloud service for large-scale, online failure drills. University of California, Berkeley, Berkeley 3 (2011)."},{"key":"e_1_3_2_1_12_1","volume-title":"Improving Availability in Distributed Systems with Failure Informers. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13)","author":"Gupta Trinabh","year":"2013","unstructured":"Trinabh Gupta , Joshua B. Leners , Marcos K. Aguilera , and Michael Walfish . 2013 . Improving Availability in Distributed Systems with Failure Informers. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13) . USENIX Association, Lombard, IL, 427--441. https:\/\/www.usenix.org\/conference\/nsdi13\/technical-sessions\/presentation\/leners Trinabh Gupta, Joshua B. Leners, Marcos K. Aguilera, and Michael Walfish. 2013. Improving Availability in Distributed Systems with Failure Informers. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13). USENIX Association, Lombard, IL, 427--441. https:\/\/www.usenix.org\/conference\/nsdi13\/technical-sessions\/presentation\/leners"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236332"},{"key":"e_1_3_2_1_15_1","unstructured":"Jepsen. 2020. A framework for distributed systems verification with fault injection. https:\/\/github.com\/jepsen-io\/jepsen  Jepsen. 2020. A framework for distributed systems verification with fault injection. https:\/\/github.com\/jepsen-io\/jepsen"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2885497"},{"key":"e_1_3_2_1_17_1","unstructured":"Henrique Madeira and Philip Koopman. 2001. Dependability Benchmarking: making choices in an n-dimensional problem space. (2001).  Henrique Madeira and Philip Koopman. 2001. Dependability Benchmarking: making choices in an n-dimensional problem space. (2001)."},{"key":"e_1_3_2_1_18_1","unstructured":"Fabrizio Montesi and Janine Weber. 2016. Circuit Breakers Discovery and API Gateways in Microservices. arXiv:1609.05830 [cs.SE]  Fabrizio Montesi and Janine Weber. 2016. Circuit Breakers Discovery and API Gateways in Microservices. arXiv:1609.05830 [cs.SE]"},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the 4th Conference on USENIX Symposium on Internet Technologies and Systems -","volume":"4","author":"Nagaraja Kiran","unstructured":"Kiran Nagaraja , Xiaoyan Li , Ricardo Bianchini , Richard P. Martin , and Thu D. Nguyen . 2003. Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services . In Proceedings of the 4th Conference on USENIX Symposium on Internet Technologies and Systems - Volume 4 (Seattle, WA) (USITS'03). USENIX Association, USA, 2. Kiran Nagaraja, Xiaoyan Li, Ricardo Bianchini, Richard P. Martin, and Thu D. Nguyen. 2003. Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services. In Proceedings of the 4th Conference on USENIX Symposium on Internet Technologies and Systems - Volume 4 (Seattle, WA) (USITS'03). USENIX Association, USA, 2."},{"key":"e_1_3_2_1_20_1","volume-title":"Ndbench: Benchmarking microservices at scale. arXiv preprint arXiv:1807.10792","author":"Papapanagiotou Ioannis","year":"2018","unstructured":"Ioannis Papapanagiotou and Vinay Chella . 2018 . Ndbench: Benchmarking microservices at scale. arXiv preprint arXiv:1807.10792 (2018). Ioannis Papapanagiotou and Vinay Chella. 2018. Ndbench: Benchmarking microservices at scale. arXiv preprint arXiv:1807.10792 (2018)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2038916.2038925"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/1050517.1050538"},{"key":"e_1_3_2_1_23_1","unstructured":"PingCAP. 2020. A Chaos Engineering Platform for Kubernetes. https:\/\/github.com\/chaos-mesh\/chaos-mesh  PingCAP. 2020. A Chaos Engineering Platform for Kubernetes. https:\/\/github.com\/chaos-mesh\/chaos-mesh"},{"key":"e_1_3_2_1_24_1","volume-title":"TIKV: A distributed transactional key-value database. https:\/\/tikv.org\/","author":"CAP.","year":"2020","unstructured":"Ping CAP. 2020 . TIKV: A distributed transactional key-value database. https:\/\/tikv.org\/ PingCAP. 2020. TIKV: A distributed transactional key-value database. https:\/\/tikv.org\/"},{"key":"e_1_3_2_1_25_1","unstructured":"HBM Prenscia. 2007. Availability and the Different Ways to Calculate It. https:\/\/www.weibull.com\/hotwire\/issue79\/relbasics79.htm  HBM Prenscia. 2007. Availability and the Different Ways to Calculate It. https:\/\/www.weibull.com\/hotwire\/issue79\/relbasics79.htm"},{"key":"e_1_3_2_1_26_1","unstructured":"prometheus. 2020. The Prometheus monitoring system and time series database. https:\/\/github.com\/prometheus\/prometheus  prometheus. 2020. The Prometheus monitoring system and time series database. https:\/\/github.com\/prometheus\/prometheus"},{"key":"e_1_3_2_1_27_1","first-page":"1","article-title":"A Survey on Self-Healing Systems","volume":"91","author":"Psaier Harald","year":"2011","unstructured":"Harald Psaier and Schahram Dustdar . 2011 . A Survey on Self-Healing Systems : Approaches and Systems. Computing 91 , 1 (Jan. 2011), 43--73. https:\/\/doi.org\/10.1007\/s00607-010-0107-y 10.1007\/s00607-010-0107-y Harald Psaier and Schahram Dustdar. 2011. A Survey on Self-Healing Systems: Approaches and Systems. Computing 91, 1 (Jan. 2011), 43--73. https:\/\/doi.org\/10.1007\/s00607-010-0107-y","journal-title":"Approaches and Systems. Computing"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920902"},{"key":"e_1_3_2_1_29_1","unstructured":"telegraf. 2020. The plugin-driven server agent for collecting & reporting metrics. https:\/\/github.com\/influxdata\/telegraf  telegraf. 2020. The plugin-driven server agent for collecting & reporting metrics. https:\/\/github.com\/influxdata\/telegraf"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/FTCS.1996.534616"},{"key":"e_1_3_2_1_31_1","unstructured":"Enrique Vargas and Sun BluePrints. 2000. High availability fundamentals. Sun Blueprints series (2000) 1--7.  Enrique Vargas and Sun BluePrints. 2000. High availability fundamentals. Sun Blueprints series (2000) 1--7."},{"key":"e_1_3_2_1_32_1","volume-title":"2017 47th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE Computer Society","author":"Wang G.","year":"2017","unstructured":"G. Wang , L. Zhang , and W. Xu . 2017. What Can We Learn from Four Years of Data Center Hardware Failures? . In 2017 47th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE Computer Society , Los Alamitos, CA, USA, 25--36. https:\/\/doi.org\/10.1109\/DSN. 2017 .26 10.1109\/DSN.2017.26 G. Wang, L. Zhang, and W. Xu. 2017. What Can We Learn from Four Years of Data Center Hardware Failures?. In 2017 47th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE Computer Society, Los Alamitos, CA, USA, 25--36. https:\/\/doi.org\/10.1109\/DSN.2017.26"},{"key":"e_1_3_2_1_33_1","volume-title":"Proc. DSN 2002 Workshop on Dependability Benchmarking. IEEE Computer Society","author":"Wilson Don","year":"2002","unstructured":"Don Wilson , Brendan Murphy , and Lisa Spainhower . 2002 . Progress on defining standardized classes for comparing the dependability of computer systems . In Proc. DSN 2002 Workshop on Dependability Benchmarking. IEEE Computer Society , Los Alamitos, CA, USA. Don Wilson, Brendan Murphy, and Lisa Spainhower. 2002. Progress on defining standardized classes for comparing the dependability of computer systems. In Proc. DSN 2002 Workshop on Dependability Benchmarking. IEEE Computer Society, Los Alamitos, CA, USA."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2019.2954871"}],"event":{"name":"EuroSys '21: Sixteenth European Conference on Computer Systems","location":"Online United Kingdom","acronym":"EuroSys '21","sponsor":["SIGOPS ACM Special Interest Group on Operating Systems"]},"container-title":["Proceedings of the 1st Workshop on High Availability and Observability of Cloud Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447851.3458738","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447851.3458738","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:27Z","timestamp":1750268967000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447851.3458738"}},"subtitle":["A Suite for Benchmarking Systems Recovery"],"short-title":[],"issued":{"date-parts":[[2021,4,26]]},"references-count":33,"alternative-id":["10.1145\/3447851.3458738","10.1145\/3447851"],"URL":"https:\/\/doi.org\/10.1145\/3447851.3458738","relation":{},"subject":[],"published":{"date-parts":[[2021,4,26]]},"assertion":[{"value":"2021-04-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}