{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T19:39:31Z","timestamp":1778701171198,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,11,2]],"date-time":"2020-11-02T00:00:00Z","timestamp":1604275200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,11,2]]},"DOI":"10.1145\/3415958.3433046","type":"proceedings-article","created":{"date-parts":[[2020,11,27]],"date-time":"2020-11-27T23:10:58Z","timestamp":1606518658000},"page":"149-156","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Event Management and Monitoring Framework for HPC Environments using ServiceNow and Prometheus"],"prefix":"10.1145","author":[{"given":"Nitin","family":"Sukhija","sequence":"first","affiliation":[{"name":"Department of Computer Science, Slippery Rock University of Pennsylvania, Slippery Rock, PA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elizabeth","family":"Bautista","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Owen","family":"James","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Gens","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siqi","family":"Deng","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yulok","family":"Lam","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tony","family":"Quan","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Basil","family":"Lalli","sequence":"additional","affiliation":[{"name":"NERSC, Lawrence Berkeley National Laboratory, Berkeley, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,11,27]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n. d.]. Apache Kafka. https:\/\/kafka.apache.org\/  [n. d.]. Apache Kafka. https:\/\/kafka.apache.org\/"},{"key":"e_1_3_2_1_2_1","unstructured":"[n. d.]. Cori: NERSC's newest supercomputer. https:\/\/www.nersc.gov\/users\/computational- systems\/cori\/  [n. d.]. Cori: NERSC's newest supercomputer. https:\/\/www.nersc.gov\/users\/computational- systems\/cori\/"},{"key":"e_1_3_2_1_3_1","unstructured":"[n. d.]. Elasticsearch: Distributed RESTful Engine. https:\/\/www.elastic.co\/products\/elasticsearch  [n. d.]. Elasticsearch: Distributed RESTful Engine. https:\/\/www.elastic.co\/products\/elasticsearch"},{"key":"e_1_3_2_1_4_1","unstructured":"[n. d.]. Grafana. https:\/\/grafana.com\/  [n. d.]. Grafana. https:\/\/grafana.com\/"},{"key":"e_1_3_2_1_5_1","unstructured":"[n. d.]. Icinga. https:\/\/icinga.com\/  [n. d.]. Icinga. https:\/\/icinga.com\/"},{"key":"e_1_3_2_1_6_1","unstructured":"[n. d.]. Kibana:Your Window into the Elastic Stack. https:\/\/www.elastic.co\/products\/kibana  [n. d.]. Kibana:Your Window into the Elastic Stack. https:\/\/www.elastic.co\/products\/kibana"},{"key":"e_1_3_2_1_7_1","unstructured":"[n. d.]. Logstash: Centralize Transform and Stash Your Data. https:\/\/www.elastic.co\/products\/logstash  [n. d.]. Logstash: Centralize Transform and Stash Your Data. https:\/\/www.elastic.co\/products\/logstash"},{"key":"e_1_3_2_1_8_1","unstructured":"[n. d.]. Nagios. https:\/\/www.nagios.org\/  [n. d.]. Nagios. https:\/\/www.nagios.org\/"},{"key":"e_1_3_2_1_9_1","unstructured":"[n. d.]. OpenLorenz: Web-Based HPC Dashboard and More. https:\/\/software.llnl.gov\/repo\/#\/hpc\/OpenLorenz  [n. d.]. OpenLorenz: Web-Based HPC Dashboard and More. https:\/\/software.llnl.gov\/repo\/#\/hpc\/OpenLorenz"},{"key":"e_1_3_2_1_10_1","unstructured":"[n. d.]. Paessler PRTG Network Monitor. https:\/\/www.paessler.com\/prtg  [n. d.]. Paessler PRTG Network Monitor. https:\/\/www.paessler.com\/prtg"},{"key":"e_1_3_2_1_11_1","unstructured":"[n. d.]. Perlmutter: NERSC's Next Supercomputer. https:\/\/www.nersc.gov\/systems\/perlmutter\/  [n. d.]. Perlmutter: NERSC's Next Supercomputer. https:\/\/www.nersc.gov\/systems\/perlmutter\/"},{"key":"e_1_3_2_1_12_1","unstructured":"[n. d.]. Prometheus. https:\/\/prometheus.io\/  [n. d.]. Prometheus. https:\/\/prometheus.io\/"},{"key":"e_1_3_2_1_13_1","unstructured":"[n. d.]. ServiceNow. https:\/\/www.servicenow.com\/  [n. d.]. ServiceNow. https:\/\/www.servicenow.com\/"},{"key":"e_1_3_2_1_14_1","unstructured":"[n. d.]. Spiceworks Network Monitoring Management Software. https:\/\/www.spiceworks.com\/  [n. d.]. Spiceworks Network Monitoring Management Software. https:\/\/www.spiceworks.com\/"},{"key":"e_1_3_2_1_15_1","unstructured":"[n. d.]. VictoriaMetrics. https:\/\/victoriametrics.com\/  [n. d.]. VictoriaMetrics. https:\/\/victoriametrics.com\/"},{"key":"e_1_3_2_1_16_1","unstructured":"[n. d.]. Zabbix. https:\/\/www.zabbix.com\/  [n. d.]. Zabbix. https:\/\/www.zabbix.com\/"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/NBiS.2015.61"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/513\/6\/062032"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3339186.3339213"},{"key":"e_1_3_2_1_20_1","volume-title":"Conquering Big Data with High Performance Computing","author":"Bautista Elizabeth","unstructured":"Elizabeth Bautista , Cary Whitney , and Thomas Davis . 2016. Big data behind big data . In Conquering Big Data with High Performance Computing . Springer , 163--189. Elizabeth Bautista, Cary Whitney, and Thomas Davis. 2016. Big data behind big data. In Conquering Big Data with High Performance Computing. Springer, 163--189."},{"key":"e_1_3_2_1_21_1","unstructured":"Keren Bergman Shekhar Borkar Dan Campbell William Carlson William Dally Monty Denneau Paul Franzon William Harrod Kerry Hill Jon Hiller etal 2008. Exascale computing study: Technology challenges in achieving exascale systems. Defense Advanced Research Projects Agency Information Processing Techniques Office (DARPA IPTO) Tech. Rep 15 (2008).  Keren Bergman Shekhar Borkar Dan Campbell William Carlson William Dally Monty Denneau Paul Franzon William Harrod Kerry Hill Jon Hiller et al. 2008. Exascale computing study: Technology challenges in achieving exascale systems. Defense Advanced Research Projects Agency Information Processing Techniques Office (DARPA IPTO) Tech. Rep 15 (2008)."},{"key":"e_1_3_2_1_22_1","unstructured":"J Davin JD Case M Fedor and ML Schoffstall. 1989. Simple network management protocol (SNMP). (1989).  J Davin JD Case M Fedor and ML Schoffstall. 1989. Simple network management protocol (SNMP). (1989)."},{"key":"e_1_3_2_1_23_1","volume-title":"Anthony Michael Agelastos, et al","author":"DeConinck Adam","year":"2016","unstructured":"Adam DeConinck , A Bonnie , K Kelly , S Sanchez , C Martin , M Mason , James M Brandt , Ann C Gentile , Benjamin A Allan , Anthony Michael Agelastos, et al . 2016 . Design and Implementation of a Scalable Monitoring System for Trinity. Technical Report. Sandia National Lab.(SNL-NM), Albuquerque, NM (United States) . Adam DeConinck, A Bonnie, K Kelly, S Sanchez, C Martin, M Mason, James M Brandt, Ann C Gentile, Benjamin A Allan, Anthony Michael Agelastos, et al. 2016. Design and Implementation of a Scalable Monitoring System for Trinity. Technical Report. Sandia National Lab.(SNL-NM), Albuquerque, NM (United States)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASC48083.2019.8946279"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1208311"},{"key":"e_1_3_2_1_26_1","volume-title":"PaaS, and IaaS)","author":"Kavis Michael J","unstructured":"Michael J Kavis . 2014. Architecting the cloud: design decisions for cloud computing service models (SaaS , PaaS, and IaaS) . John Wiley & Sons . Michael J Kavis. 2014. Architecting the cloud: design decisions for cloud computing service models (SaaS, PaaS, and IaaS). John Wiley & Sons."},{"key":"e_1_3_2_1_27_1","volume-title":"2nd International Industry\/University Workshop on Data-center Automation, Analytics, and Control (DAAC","author":"Libri Antonio","year":"2018","unstructured":"Antonio Libri , Andrea Bartolini , and Luca Benini . 2018 . Dig: Enabling out-of-band scalable high-resolution monitoring for data-center analytics, automation and control . In 2nd International Industry\/University Workshop on Data-center Automation, Analytics, and Control (DAAC 2018). Data-center Automation, Analytics, and Control (DAAC). Antonio Libri, Andrea Bartolini, and Luca Benini. 2018. Dig: Enabling out-of-band scalable high-resolution monitoring for data-center analytics, automation and control. In 2nd International Industry\/University Workshop on Data-center Automation, Analytics, and Control (DAAC 2018). Data-center Automation, Analytics, and Control (DAAC)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.3390\/en11092478"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2934872.2934879"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14529\/jsfi160205"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2016.167"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/1973333.1973349"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/1964238.1964240"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Nitin Sukhija and Elizabeth Bautista. 2019. Towards a Framework for Monitoring and Analyzing High Performance Computing Environments Using Kubernetes and Prometheus. In 2019 IEEE SmartWorld Ubiquitous Intelligence & Computing Advanced & Trusted Computing Scalable Computing & Communications Cloud & Big Data Computing Internet of People and Smart City Innovation (SmartWorld\/SCALCOM\/UIC\/ATC\/CBDCom\/IOP\/SCI). IEEE 257--262.  Nitin Sukhija and Elizabeth Bautista. 2019. Towards a Framework for Monitoring and Analyzing High Performance Computing Environments Using Kubernetes and Prometheus. In 2019 IEEE SmartWorld Ubiquitous Intelligence & Computing Advanced & Trusted Computing Scalable Computing & Communications Cloud & Big Data Computing Internet of People and Smart City Innovation (SmartWorld\/SCALCOM\/UIC\/ATC\/CBDCom\/IOP\/SCI). IEEE 257--262.","DOI":"10.1109\/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00087"},{"key":"e_1_3_2_1_35_1","unstructured":"Alvaro Videla and Jason JW Williams. 2012. RabbitMQ in action: distributed messaging for everyone. Manning.  Alvaro Videla and Jason JW Williams. 2012. RabbitMQ in action: distributed messaging for everyone. Manning."}],"event":{"name":"MEDES '20: 12th International Conference on Management of Digital EcoSystems","location":"Virtual Event United Arab Emirates","acronym":"MEDES '20","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing"]},"container-title":["Proceedings of the 12th International Conference on Management of Digital EcoSystems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3415958.3433046","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3415958.3433046","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:31:53Z","timestamp":1750195913000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3415958.3433046"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,2]]},"references-count":35,"alternative-id":["10.1145\/3415958.3433046","10.1145\/3415958"],"URL":"https:\/\/doi.org\/10.1145\/3415958.3433046","relation":{},"subject":[],"published":{"date-parts":[[2020,11,2]]},"assertion":[{"value":"2020-11-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}