{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T06:55:34Z","timestamp":1760597734945,"version":"3.41.0"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2017,5,10]],"date-time":"2017-05-10T00:00:00Z","timestamp":1494374400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMETRICS Perform. Eval. Rev."],"published-print":{"date-parts":[[2017,5,10]]},"abstract":"<jats:p>We quantify the resiliency of large scale systems upon changes encountered beyond the normal system behavior. Formal definitions for resiliency and change are provided together with general steps for resiliency quantification and a set of resiliency metrics that can be used to quantify the effects of changes. A formalization of the approach is also shown in the form of a set of four algorithms that can be applied when large scale systems are modeled through stochastic analytic state space models (monolithic models or interacting sub-models). In particular, in the case of interacting submodels, since resiliency quantification involves understanding the transient behavior of the system, fixed-point variables evolve with time leading to non-homogenous Markov chains. At the best of our knowledge, this is the first paper facing this problem in a general way. The proposed approach is applied to an Infrastructure-as-a-Service (IaaS) Cloud use case. Specifically, we assess the impact of changes in demand and available capacity on the Cloud resiliency and we show that the approach proposed in this paper can scale for a real sized Cloud without significantly compromising the accuracy.<\/jats:p>","DOI":"10.1145\/3092819.3092825","type":"journal-article","created":{"date-parts":[[2017,5,10]],"date-time":"2017-05-10T18:08:53Z","timestamp":1494439733000},"page":"37-48","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["An Approach for Resiliency Quantification of Large Scale Systems"],"prefix":"10.1145","volume":"44","author":[{"given":"Francesco","family":"Longo","sequence":"first","affiliation":[{"name":"Universit\u00e0 degli Studi di, Messina, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rahul","family":"Ghosh","sequence":"additional","affiliation":[{"name":"Xerox Research Center, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vijay K.","family":"Naik","sequence":"additional","affiliation":[{"name":"IBM T. J. Watson Research, Center, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrew J.","family":"Rindos","sequence":"additional","affiliation":[{"name":"IBM, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kishor S.","family":"Trivedi","sequence":"additional","affiliation":[{"name":"Duke University, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,5,10]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFCOM.2011.5934942"},{"key":"e_1_2_1_2_1","volume-title":"Reliability assessment of wireless sensor nodes with non-linear battery discharge","author":"Bruneo D.","year":"2010","unstructured":"D. Bruneo , S. Distefano , F. Longo , A. Puliafito , and M. Scarpa . Reliability assessment of wireless sensor nodes with non-linear battery discharge . 2010 . D. Bruneo, S. Distefano, F. Longo, A. Puliafito, and M. Scarpa. Reliability assessment of wireless sensor nodes with non-linear battery discharge. 2010."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342009347767"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14529\/jsfi140101"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/12.257705"},{"key":"e_1_2_1_6_1","volume-title":"DeBardeleben et al. High-End Computing Resilience: Analysis of Issues Facing the HEC Community and Path-Forward for Research and Development","author":"N.","year":"2010","unstructured":"N. DeBardeleben et al. High-End Computing Resilience: Analysis of Issues Facing the HEC Community and Path-Forward for Research and Development . 2010 . N. DeBardeleben et al. High-End Computing Resilience: Analysis of Issues Facing the HEC Community and Path-Forward for Research and Development. 2010."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/HASE.2010.28"},{"key":"e_1_2_1_8_1","volume-title":"National HPC Workshop on Resilience","author":"Engelmann C.","year":"2009","unstructured":"C. Engelmann and C. Leangsuksun . Modeling techniques towards resilience . In National HPC Workshop on Resilience , 2009 . C. Engelmann and C. Leangsuksun. Modeling techniques towards resilience. In National HPC Workshop on Resilience, 2009."},{"key":"e_1_2_1_9_1","volume-title":"ICT","author":"Erdene-Ochir O.","year":"2010","unstructured":"O. Erdene-Ochir , M. Minier , F. Valois , and A. Kountouris . Resiliency of wireless sensor networks: Definitions and analyses . In ICT , 2010 . O. Erdene-Ochir, M. Minier, F. Valois, and A. Kountouris. Resiliency of wireless sensor networks: Definitions and analyses. In ICT, 2010."},{"key":"e_1_2_1_10_1","volume-title":"InfQ 2016 workshop","author":"Ghosh R.","year":"2016","unstructured":"R. Ghosh , F. Longo , V. Naik , R. A.J., and K. Trivedi . Resiliency quantification for large scale systems: An iaas cloud use case . In InfQ 2016 workshop , 2016 . R. Ghosh, F. Longo, V. Naik, R. A.J., and K. Trivedi. Resiliency quantification for large scale systems: An iaas cloud use case. In InfQ 2016 workshop, 2016."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2012.06.005"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2012.06.005"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/PRDC.2010.30"},{"key":"e_1_2_1_14_1","volume-title":"Performability Modeling Tools and Techniques","author":"Haverkort B.","year":"2001","unstructured":"B. Haverkort , R. Marie , G. Rubino , and K. S. Trivedi (eds.). Performability Modeling Tools and Techniques . Wiley , 2001 . B. Haverkort, R. Marie, G. Rubino, and K. S. Trivedi (eds.). Performability Modeling Tools and Techniques. Wiley, 2001."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comnet.2009.02.014"},{"key":"e_1_2_1_16_1","series-title":"Lecture Notes in Computer Science","volume-title":"SPNP: stochastic Petri nets. version 6","author":"Hirel C.","year":"2000","unstructured":"C. Hirel , B. Tuffin , and K. S. Trivedi . SPNP: stochastic Petri nets. version 6 . In Lecture Notes in Computer Science , 2000 . C. Hirel, B. Tuffin, and K. S. Trivedi. SPNP: stochastic Petri nets. version 6. In Lecture Notes in Computer Science, 2000."},{"key":"e_1_2_1_17_1","volume-title":"Computer and Information Security Handbook","author":"Jhawar R.","year":"2013","unstructured":"R. Jhawar and V. Piuri . Fault tolerance and resilience in cloud computing environments . In J. R. Vacca, editor, Computer and Information Security Handbook , 2 nd Edition. Morgan Kaufmann , MA , USA, 2013 . R. Jhawar and V. Piuri. Fault tolerance and resilience in cloud computing environments. In J. R. Vacca, editor, Computer and Information Security Handbook, 2nd Edition. Morgan Kaufmann, MA, USA, 2013.","edition":"2"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.2307\/1427109"},{"key":"e_1_2_1_19_1","volume-title":"DSN","author":"Laprie J. C.","year":"2008","unstructured":"J. C. Laprie . From dependability to resilience . In DSN , 2008 . J. C. Laprie. From dependability to resilience. In DSN, 2008."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2011.72"},{"issue":"1","key":"e_1_2_1_21_1","first-page":"29","article-title":"Survivability quantification: The analytical modeling approach","volume":"2","author":"Liu Y.","year":"2006","unstructured":"Y. Liu and K. S. Trivedi . Survivability quantification: The analytical modeling approach . Intl. Journal of Performability Eng. , 2 ( 1 ): 29 -- 44 , 2006 . Y. Liu and K. S. Trivedi. Survivability quantification: The analytical modeling approach. Intl. Journal of Performability Eng., 2(1):29--44, 2006.","journal-title":"Journal of Performability Eng."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/12.45203"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ESCIW.2009.5407992"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCNC.2012.6167569"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2009.5160867"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comnet.2010.03.005"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICNP.2004.1348112"},{"key":"e_1_2_1_28_1","volume-title":"Probability and Statistics with Reliability, Queuing and Computer Science Applications","author":"Trivedi K. S.","year":"2001","unstructured":"K. S. Trivedi . Probability and Statistics with Reliability, Queuing and Computer Science Applications . Wiley , 2001 . K. S. Trivedi. Probability and Statistics with Reliability, Queuing and Computer Science Applications. Wiley, 2001."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1530873.1530884"},{"key":"e_1_2_1_30_1","volume-title":"Jie Wu","author":"Xuan D.","year":"2005","unstructured":"D. Xuan , S. Chellappan , and X. Wang . Resilience of structured peer to peer systems: Analysis and enhancement . In Jie Wu , editor, Handbook On Theoretical And Algorithmic Aspects Of Sensor, Ad Hoc Wireless, and Peer-to-Peer Networks. Auerbach Publications , 2005 D. Xuan, S. Chellappan, and X. Wang. Resilience of structured peer to peer systems: Analysis and enhancement. In Jie Wu, editor, Handbook On Theoretical And Algorithmic Aspects Of Sensor, Ad Hoc Wireless, and Peer-to-Peer Networks. Auerbach Publications, 2005"}],"container-title":["ACM SIGMETRICS Performance Evaluation Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3092819.3092825","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3092819.3092825","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:37:27Z","timestamp":1750217847000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3092819.3092825"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,5,10]]},"references-count":30,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2017,5,10]]}},"alternative-id":["10.1145\/3092819.3092825"],"URL":"https:\/\/doi.org\/10.1145\/3092819.3092825","relation":{},"ISSN":["0163-5999"],"issn-type":[{"type":"print","value":"0163-5999"}],"subject":[],"published":{"date-parts":[[2017,5,10]]},"assertion":[{"value":"2017-05-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}