{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T18:30:58Z","timestamp":1771612258284,"version":"3.50.1"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2025,1,29]],"date-time":"2025-01-29T00:00:00Z","timestamp":1738108800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,1,29]],"date-time":"2025-01-29T00:00:00Z","timestamp":1738108800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CNS-2011252"],"award-info":[{"award-number":["CNS-2011252"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Netw Syst Manage"],"published-print":{"date-parts":[[2025,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Misconfiguration is a known and increasingly serious problem in enterprise systems due to frequent code updates and retuning of the configuration parameters. Diagnosing complex, residual misconfiguration problems that lead to inaccessible services or failed transactions often starts with either a user complaint or observation by administrators, followed by a largely manual process of deciding what tests to run and how to proceed with further testing based on the test results. The goal of this paper is to automate this process and thereby make root-cause analysis of accessibility related misconfigurations much speedier and much more effective. We explore a domain-knowledge-driven methodology, called ConfExp using a network emulator that runs real enterprise networking protocols. Thus, by using commonly used tests, we show that the root-cause can be determined in all cases where discriminative tests exist. The methodology also highlights areas where more discriminative tests are needed to pinpoint the precise configuration variables at fault.<\/jats:p>","DOI":"10.1007\/s10922-024-09886-w","type":"journal-article","created":{"date-parts":[[2025,1,29]],"date-time":"2025-01-29T16:15:27Z","timestamp":1738167327000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["ConfExp: Root-Cause Analysis of Service Misconfigurations in Enterprise Systems"],"prefix":"10.1007","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2171-1691","authenticated-orcid":false,"given":"Negar","family":"Mohammadi Koushki","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2159-0981","authenticated-orcid":false,"given":"Ibrahim","family":"El-Shekeil","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Krishna","family":"Kant","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,1,29]]},"reference":[{"key":"9886_CR1","volume-title":"Building Microservices","author":"S Newman","year":"2021","unstructured":"Newman, S.: Building Microservices. O\u2019Reilly Media, Inc, Sebastopol (2021)"},{"key":"9886_CR2","volume-title":"Continuous Delivery: Reliable Software Releases Through Build, Test, and Deployment Automation","author":"J Humble","year":"2010","unstructured":"Humble, J., Farley, D.: Continuous Delivery: Reliable Software Releases Through Build, Test, and Deployment Automation. Pearson Education, Pearson (2010)"},{"key":"9886_CR3","unstructured":"Williams, M., Vance, A.: Microsoft takes blame for web site access failures. http:\/\/www.computerworld.com\/article\/2590639\/networking\/microsoft-takes-blame-for-web-site-access-failures.html (2001). Accessed 3 July 2023"},{"key":"9886_CR4","unstructured":"Misconfiguration brings down entire .se domain in Sweden. http:\/\/www.circleid.com\/posts\/misconfiguration_brings_down_entire_se_domain_in_sweden (2009). Accessed 3 July 2023"},{"key":"9886_CR5","unstructured":"Apple blames itunes outage on dns error. what does that mean? https:\/\/www.csmonitor.com\/Technology\/2015\/0311\/Apple-blames-iTunes-outage-on-DNS-error.-What-does-that-mean (2015). Accessed 3 July 2023"},{"key":"9886_CR6","unstructured":"Kurtz, A.: Delta malfunction on land keeps a fleet of planes from the sky. https:\/\/www.nytimes.com\/2016\/08\/09\/business\/delta-air-lines-delays-computer-failure.html (2016). Accessed 3 July 2023"},{"key":"9886_CR7","unstructured":"Southwest airlines\u2019 router grounds 2,300 flights. https:\/\/availabilitydigest.com\/public_articles\/1108\/southwest_airlines.pdf (2016). Accessed 3 July 2023"},{"key":"9886_CR8","doi-asserted-by":"crossref","unstructured":"Du, W., Zeng, H., Won, K.: Seed emulator: an internet emulator for research and education. In: Proceedings of the 21st ACM Workshop on Hot Topics in Networks, pp. 101\u2013107 (2022)","DOI":"10.1145\/3563766.3564097"},{"key":"9886_CR9","doi-asserted-by":"publisher","first-page":"148","DOI":"10.1016\/j.jocs.2017.03.003","volume":"22","author":"P-W Tsai","year":"2017","unstructured":"Tsai, P.-W., Piccialli, F., Tsai, C.-W., Luo, M.-Y., Yang, C.-S.: Control frameworks in network emulation testbeds: a survey. J. Comput. Sci. 22, 148\u2013161 (2017)","journal-title":"J. Comput. Sci."},{"key":"9886_CR10","doi-asserted-by":"crossref","unstructured":"Ahrenholz, J., Danilov, C., Henderson, T. R., Kim, J.\u00a0H.: CORE: a real-time network emulator. In: MILCOM 2008-2008 IEEE Military Communications Conference. IEEE, pp. 1\u20137 (2008)","DOI":"10.1109\/MILCOM.2008.4753614"},{"key":"9886_CR11","doi-asserted-by":"crossref","unstructured":"Du, W., Zeng, H.: The seed internet emulator and its applications in cybersecurity education. (2022)","DOI":"10.1145\/3563766.3564097"},{"key":"9886_CR12","doi-asserted-by":"crossref","unstructured":"Case, J., Fedor, M., Schoffstall, M., Davin, J.: Simple network management protocol (SNMP). Internet Engineering Task Force (IETF) RFC, vol. 1157. https:\/\/datatracker.ietf.org\/doc\/rfc1157\/ (1990)","DOI":"10.17487\/rfc1157"},{"key":"9886_CR13","doi-asserted-by":"crossref","unstructured":"Enns, R.: Netconf configuration protocol. Internet Engineering Task Force (IETF) RFC, vol. 4741 (2006). https:\/\/datatracker.ietf.org\/doc\/rfc4741\/","DOI":"10.17487\/rfc4741"},{"key":"9886_CR14","volume-title":"Nagios: System and Network Monitoring. minus","author":"W Barth","year":"2008","unstructured":"Barth, W.: Nagios: System and Network Monitoring. minus. No Starch Press, San Francisco (2008)"},{"issue":"1","key":"9886_CR15","first-page":"88","volume":"24","author":"S-M Lamraoui","year":"2016","unstructured":"Lamraoui, S.-M., Nakajima, S.: A formula-based approach for automatic fault localization of multi-fault programs. J. Inf. Process. 24(1), 88\u201398 (2016)","journal-title":"J. Inf. Process."},{"issue":"1","key":"9886_CR16","first-page":"1","volume":"1","author":"C Chen","year":"2023","unstructured":"Chen, C., Yu, H., Lei, Z., Li, J., Ren, S., Zhang, T., Hu, S., Wang, J., Shi, W.: Balance: Bayesian linear attribution for root cause localization. Proc. ACM Manag. Data 1(1), 1\u201326 (2023)","journal-title":"Proc. ACM Manag. Data"},{"issue":"8","key":"9886_CR17","doi-asserted-by":"publisher","first-page":"707","DOI":"10.1109\/TSE.2016.2521368","volume":"42","author":"WE Wong","year":"2016","unstructured":"Wong, W.E., Gao, R., Li, Y., Abreu, R., Wotawa, F.: A survey on software fault localization. IEEE Trans. Softw. Eng. 42(8), 707\u2013740 (2016)","journal-title":"IEEE Trans. Softw. Eng."},{"key":"9886_CR18","doi-asserted-by":"crossref","unstructured":"Zhang, M., Li, Z., Dahhou, B., Cabassud, M., Volosencu, C.: Root cause analysis of actuator fault. In: Actuators. IntechOpen. p. 131 (2018)","DOI":"10.5772\/intechopen.76211"},{"key":"9886_CR19","doi-asserted-by":"crossref","unstructured":"Brodie, M., Rish, I., Ma, S.: Optimizing probe selectionfor fault localization. In: Proceedings of the 12th IFIP\/IEEE International Workshop on Distributed Systems: Operations and Management (DSOM01) (2001)","DOI":"10.3990\/2.9"},{"key":"9886_CR20","doi-asserted-by":"crossref","unstructured":"Rish, I., Brodie, M., Odintsova, N., Ma, S., Grabarnik, G.: Real-time problem determination in distributed systems using active probing. In: IEEE\/IFIP Network Operations and Management Symposium (IEEE Cat. No. 04CH37507), vol. 1. IEEE 2004, pp. 133\u2013146 (2004)","DOI":"10.1109\/NOMS.2004.1317650"},{"issue":"5","key":"9886_CR21","doi-asserted-by":"publisher","first-page":"1088","DOI":"10.1109\/TNN.2005.853423","volume":"16","author":"I Rish","year":"2005","unstructured":"Rish, I., Brodie, M., Ma, S., Odintsova, N., Beygelzimer, A., Grabarnik, G., Hernandez, K.: Adaptive diagnosis in distributed systems. IEEE Trans. Neural Netw. 16(5), 1088\u20131109 (2005)","journal-title":"IEEE Trans. Neural Netw."},{"key":"9886_CR22","unstructured":"Zheng, A.X., Rish, I.: Efficient test selection in active diagnosis via entropy approximation. arXiv:1207.1418 (2012)"},{"key":"9886_CR23","doi-asserted-by":"crossref","unstructured":"Natu, M., Sethi, A. S.: Probe station placement for fault diagnosis. In: IEEE GLOBECOM 2007-IEEE Global Telecommunications Conference, IEEE. pp. 113\u2013117 (2007)","DOI":"10.1109\/GLOCOM.2007.29"},{"key":"9886_CR24","doi-asserted-by":"publisher","first-page":"950","DOI":"10.1007\/s10922-014-9330-8","volume":"23","author":"D Jeswani","year":"2015","unstructured":"Jeswani, D., Natu, M., Ghosh, R.K.: Adaptive monitoring: application of probing to adapt passive monitoring. J. Netw. Syst. Manag. 23, 950\u2013977 (2015)","journal-title":"J. Netw. Syst. Manag."},{"key":"9886_CR25","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1016\/j.ress.2012.12.015","volume":"114","author":"L Lu","year":"2013","unstructured":"Lu, L., Xu, Z., Wang, W., Sun, Y.: A new fault detection method for computer networks. Reliab. Eng. Syst. Saf. 114, 45\u201351 (2013)","journal-title":"Reliab. Eng. Syst. Saf."},{"key":"9886_CR26","doi-asserted-by":"crossref","unstructured":"Natu, M., Sethi, A. S.: Active probing approach for fault localization in computer networks, In: 4th IEEE\/IFIP Workshop on End-to-End Monitoring Techniques and Services. IEEE 2006, pp. 25\u201333 (2006)","DOI":"10.1109\/E2EMON.2006.1651276"},{"key":"9886_CR27","unstructured":"Tang, Y., Al-Shaer, E. S., Boutaba, R.: Active integrated fault localization in communication networks. In: 9th IFIP\/IEEE International Symposium on Integrated Network Management, 2005. IM 2005. IEEE vol. 2005, pp. 543\u2013556 (2005)"},{"key":"9886_CR28","doi-asserted-by":"crossref","unstructured":"do\u00a0Carmo, R., Hoffmann, J., Willert, V., Hollick, M.: Making active-probing-based network intrusion detection in wireless multihop networks practical: a Bayesian inference approach to probe selection. In: 39th Annual IEEE Conference on LCN. IEEE, pp. 345\u2013353 (2014)","DOI":"10.1109\/LCN.2014.6925790"},{"key":"9886_CR29","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1016\/j.ress.2016.03.017","volume":"152","author":"MS Garshasbi","year":"2016","unstructured":"Garshasbi, M.S.: Fault localization based on combines active and passive measurements in computer networks by ant colony optimization. Reliab. Eng. Syst. Saf. 152, 205\u2013212 (2016)","journal-title":"Reliab. Eng. Syst. Saf."},{"key":"9886_CR30","doi-asserted-by":"crossref","unstructured":"Patil, B., Kinger, S., Pathak, V.\u00a0K.: Probe station placement algorithm for probe set reduction in network fault localization. In: 2013 International Conference on Information Systems and Computer Networks. IEEE, pp. 164\u2013169 (2013)","DOI":"10.1109\/ICISCON.2013.6524195"},{"key":"9886_CR31","doi-asserted-by":"crossref","unstructured":"Salhi, E., Lahoud, S., Cousin, B.: Localization of single link-level network anomalies. In: 2012 21st International Conference on Computer Communications and Networks (ICCCN). IEEE, pp. 1\u20139 (2012)","DOI":"10.1109\/ICCCN.2012.6289247"},{"key":"9886_CR32","doi-asserted-by":"crossref","unstructured":"Traverso, S., Tego, E., Kowallik, E., Raffaglio, S., Fregosi, A., Mellia, M., Matera, F.: Exploiting hybrid measurements for network troubleshooting, In: 16th International Telecommunications Network Strategy and Planning Symposium (Networks). IEEE vol. 2014, pp. 1\u20136 (2014)","DOI":"10.1109\/NETWKS.2014.6959212"},{"issue":"4","key":"9886_CR33","doi-asserted-by":"publisher","first-page":"3030","DOI":"10.1109\/COMST.2016.2570599","volume":"18","author":"A Dusia","year":"2016","unstructured":"Dusia, A., Sethi, A.S.: Recent advances in fault localization in computer networks. IEEE Commun. Surv. Tutor. 18(4), 3030\u20133051 (2016)","journal-title":"IEEE Commun. Surv. Tutor."},{"issue":"4","key":"9886_CR34","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1007\/s10922-022-09671-7","volume":"30","author":"GV Maia","year":"2022","unstructured":"Maia, G.V., Coutinho, T.M., Gon\u00e7alves, E.B., Silva, G.R., Mendes, E.M., Mendes, M.M., Caetano, S.R., Mitt, G.M., Braga, A.P.: One class density estimation approach for fault detection and rootcause analysis in computer networks. J. Netw. Syst. Manag. 30(4), 69 (2022)","journal-title":"J. Netw. Syst. Manag."},{"key":"9886_CR35","doi-asserted-by":"crossref","unstructured":"Yen, C.-C., Sun, W., Purmehdi, H., Park, W., Deshmukh, K.\u00a0R., Thakrar, N., Nassef, O., Jacobs, A.: Graph neural network based root cause analysis using multivariate time-series kpis for wireless networks. In: NOMS 2022-2022 IEEE\/IFIP Network Operations and Management Symposium, IEEE. pp. 1\u20137 (2022)","DOI":"10.1109\/NOMS54207.2022.9789858"},{"key":"9886_CR36","doi-asserted-by":"crossref","unstructured":"Casas, P., Vanerio, J., Fukuda, K.: GML learning, a generic machine learning model for network measurements analysis. In: 2017 13th International Conference on Network and Service Management (CNSM), pp. 1\u20139. IEEE (2017)","DOI":"10.23919\/CNSM.2017.8255998"},{"issue":"2","key":"9886_CR37","doi-asserted-by":"publisher","first-page":"905","DOI":"10.1109\/TNSM.2022.3151249","volume":"19","author":"W Wang","year":"2022","unstructured":"Wang, W., Tang, L., Wang, C., Chen, Q.: Real-time analysis of multiple root causes for anomalies assisted by digital twin in NFV environment. IEEE Trans. Netw. Service Manag. 19(2), 905\u2013921 (2022)","journal-title":"IEEE Trans. Netw. Service Manag."},{"key":"9886_CR38","doi-asserted-by":"crossref","unstructured":"El-Shekeil, I., Pal, A., Kant, K.: CloudMiner: A systematic failure diagnosis framework in enterprise cloud environments. Proc. of CLOUDCOM, Nicosia, Greece (2018)","DOI":"10.1109\/CloudCom2018.2018.00028"},{"key":"9886_CR39","doi-asserted-by":"crossref","unstructured":"Athamnah, M., Pal, A., Kant, K.: A framework for misconfiguration diagnosis in interconnected multi-party systems. In: Proc. of ICCCN 2018 (2018)","DOI":"10.1109\/ICCCN.2018.8487411"},{"key":"9886_CR40","doi-asserted-by":"crossref","unstructured":"Athamnah, M., Kant, K.: Multiparty database sharing with generalized access rules. In: Proc. of CloudCom, Luxemburg, pp. 198\u2013205 (2016)","DOI":"10.1109\/CloudCom.2016.0043"}],"container-title":["Journal of Network and Systems Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10922-024-09886-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10922-024-09886-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10922-024-09886-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,18]],"date-time":"2025-04-18T11:48:50Z","timestamp":1744976930000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10922-024-09886-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,29]]},"references-count":40,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,4]]}},"alternative-id":["9886"],"URL":"https:\/\/doi.org\/10.1007\/s10922-024-09886-w","relation":{},"ISSN":["1064-7570","1573-7705"],"issn-type":[{"value":"1064-7570","type":"print"},{"value":"1573-7705","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,29]]},"assertion":[{"value":"5 August 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 November 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 November 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 January 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing Interests"}}],"article-number":"27"}}