{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T01:28:56Z","timestamp":1760232536880,"version":"build-2065373602"},"reference-count":43,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2022,11,11]],"date-time":"2022-11-11T00:00:00Z","timestamp":1668124800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Software"],"abstract":"<jats:p>The identification of the appropriate distribution of faults is important for ensuring the reliability of a software system and its maintenance. It has been observed that different distributions explain faults in different types of software. Faults in large and complex software systems are best represented by Pareto distribution, whereas Weibull distribution fits enterprise software well. An analysis of faults in open-source software endorses generalized Pareto distribution. This paper presents a model, called the Tsallis distribution, derived using the maximum-entropy principle, which explains faults in many diverse software systems. The effectiveness of Tsallis distribution is ascertained by carrying out experiments on many real data sets from enterprise and open-source software systems. It is found that Tsallis distribution describes software faults better and more precisely than Weibull and generalized Pareto distributions, in both cases. The applications of the Tsallis distribution in (i) software fault-prediction using the Bayesian inference method, and (ii) the Goal and Okumoto software-reliability model, are discussed.<\/jats:p>","DOI":"10.3390\/software1040020","type":"journal-article","created":{"date-parts":[[2022,11,14]],"date-time":"2022-11-14T04:30:52Z","timestamp":1668400252000},"page":"473-484","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Analysis of Faults in Software Systems Using Tsallis Distribution: A Unified Approach"],"prefix":"10.3390","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4295-3769","authenticated-orcid":false,"given":"Shachi","family":"Sharma","sequence":"first","affiliation":[{"name":"Department of Computer Science, South Asian University, New Delhi 110021, India"}]}],"member":"1968","published-online":{"date-parts":[[2022,11,11]]},"reference":[{"key":"ref_1","unstructured":"Jalote, P. (2005). An Integrated Approach to Software Engineering, Springer."},{"key":"ref_2","first-page":"6339","article-title":"An empirical assessment of threshold techniques to discriminate the fault status of software","volume":"34","author":"Kaur","year":"2022","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1109\/TR.2013.2285056","article-title":"Evaluation and application of bounded generalized pareto analysis to fault distributions inopen-source software","volume":"63","author":"Huang","year":"2014","journal-title":"IEEE Trans. Rel."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1109\/2.962984","article-title":"Software defect reduction top 10 list","volume":"34","author":"Boehm","year":"2001","journal-title":"Computer"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.jss.2018.06.025","article-title":"Early software defect prediction: A systematic map and review","volume":"144","author":"Ozakinci","year":"2018","journal-title":"J. Syst. Softw."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Tanaka, K., and Tsuda, K. (2016, January 10\u201314). Methods to predict the number of software faults using Weibull distribution. Proceedings of the IEEE 40th Annual Computer Software and Applications Conference, Atlanta, GA, USA.","DOI":"10.1109\/COMPSAC.2016.154"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1145\/566171.566181","article-title":"The distribution of faults in a large industrial software system","volume":"27","author":"Ostrand","year":"2002","journal-title":"ACM SIGSOFT Softw. Eng. Notes"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1109\/32.879815","article-title":"Quantitative analysis of faults and failures in a complex software system","volume":"26","author":"Fenton","year":"2000","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_9","unstructured":"Vrankovi, A., and Grbac, T.G. (2018, January 27\u201330). Replication of quantitative analysis of bug distributions on open-source software systems. Proceedings of the 7th Workshop of Software Quality Analysis, Monitoring, Improvement, and Applications, Novi Sad, Serbia."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1109\/TSE.2007.70771","article-title":"On the distribution of software faults","volume":"34","author":"Zhang","year":"2008","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1109\/TSE.2012.46","article-title":"A second replicated quantitative analysis of bug distributions in complex software systems","volume":"39","author":"Grbac","year":"2013","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1142\/S0218194018500055","article-title":"Empirical study on the distribution of faults in software systems","volume":"28","author":"Sriram","year":"2018","journal-title":"Int. J. Softw. Eng. Knowl. Eng."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1016\/j.infsof.2014.06.014","article-title":"On the probability distribution of faults in complex software systems","volume":"58","author":"Grbac","year":"2015","journal-title":"Inf. Softw. Technol."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1007\/s10479-017-2486-3","article-title":"A generalized software-reliability model with stochastic fault-detection rate","volume":"277","author":"Pham","year":"2019","journal-title":"Ann. Oper. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"159","DOI":"10.18576\/amis\/120116","article-title":"Using ordered Probit model to study the effects of component quality on reusability","volume":"12","author":"Thapar","year":"2018","journal-title":"Appl. Math. Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1109\/TSE.2011.63","article-title":"Does software process improvement reduce the severity of defects? A longitudinal field study","volume":"38","author":"Harter","year":"2012","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1109\/TSE.2007.1005","article-title":"A replicated quantitative analysis of fault distributions in complex software systems","volume":"33","author":"Andersson","year":"2007","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"998","DOI":"10.1109\/32.177369","article-title":"A practical view of software measurement and implementation experiences within motorola","volume":"18","author":"Daskalantonakis","year":"1992","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"872","DOI":"10.1109\/TSE.2011.54","article-title":"On the distribution of bugs in the eclipse system","volume":"37","author":"Concas","year":"2011","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_20","unstructured":"Hribar, L., and Dula, D. (2010, January 15\u201317). Weibull distribution in modeling component faults. Proceedings of the 52nd 52nd International Symposium ELMAR, Zadar, Croatia."},{"key":"ref_21","unstructured":"Hunt, F., and Johnson, P. (2002, January 19\u201325). On the Pareto distribution of sourceforge projects. Proceedings of the International Workshop open-source software Develop, Orlando, FL, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zimmermann, T., Premraj, R., and Zeller, A. (2007, January 20\u201326). Predicting defects for eclipse. Proceedings of the Third International Workshop on Predictor Models in Software Engineering, Minneapolis, MN, USA.","DOI":"10.1109\/PROMISE.2007.10"},{"key":"ref_23","unstructured":"(2022, October 30). Equinox. Available online: https:\/\/bug.inf.usi.ch\/download.php."},{"key":"ref_24","unstructured":"(2020, February 18). KAA Platform. Available online: https:\/\/www.kaaproject.org\/."},{"key":"ref_25","unstructured":"(2020, February 18). GCC. Available online: https:\/\/gcc.gnu.org\/bugzilla\/."},{"key":"ref_26","unstructured":"(2020, February 18). Samba. Available online: https:\/\/bugzilla.samba.org\/."},{"key":"ref_27","unstructured":"(2020, February 18). Available online: https:\/\/bugs.python.org\/."},{"key":"ref_28","unstructured":"(2020, February 18). Available online: https:\/\/bugzilla.mozilla.org\/."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Kuo, C., Huang, C., and Luan, S. (2012, January 20\u201322). A study of using two-parameter generalized Pareto model to analyze the fault distribution of open-source software. Proceedings of the IEEE Sixth International Conference on Software Security and Reliability, Gaithersburg, MD, USA.","DOI":"10.1109\/SERE.2012.21"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syst. Tech. J."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Gell-mann, M., and Tsallis, C. (2004). Nonextensive Entropy: Interdisciplinary Applications, Oxford University Press.","DOI":"10.1093\/oso\/9780195159769.001.0001"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"566","DOI":"10.1109\/TSE.2008.105","article-title":"Power-law distributions of component size in general software systems","volume":"35","author":"Hatton","year":"2009","journal-title":"IEEE Trans. Softw. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"20380","DOI":"10.1073\/pnas.1320578110","article-title":"A maximum entropy framework for nonexponential distributions","volume":"110","author":"Peterson","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Sharma, S., Pendharkar, P.C., and Karmeshu, K. (2021). Learning component size distributions for software cost estimation: Models based on arithmetic and shifted geometric means rules. IEEE Trans. Softw. Eng.","DOI":"10.1109\/TSE.2021.3139216"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Karmeshu, K., and Sharma, S. (2006). Power law and Tsallis entropy: Network traffic and applications. Chaos, Nonlinearity, Complexity, Springer.","DOI":"10.1007\/3-540-31757-0_5"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1109\/LCOMM.2006.1665118","article-title":"q-Exponential product-form solution of packet distribution in queueing networks: Maximization of Tsallis entropy","volume":"10","author":"Karmeshu","year":"2006","journal-title":"IEEE Comm. Lett."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1530","DOI":"10.1109\/TCOMM.2008.060404","article-title":"Bimodal packet distribution in loss systems using maximum Tsallis entropy principle","volume":"56","author":"Sharma","year":"2008","journal-title":"IEEE Trans. Comm."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/LCOMM.2009.12.091768","article-title":"Power law characteristics and loss probability: Finite buffer queueing systems","volume":"13","author":"Sharma","year":"2009","journal-title":"IEEE Comm. Lett."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"e2417","DOI":"10.1002\/smr.2417","article-title":"On the analysis of power law distribution in software component sizes","volume":"34","author":"Sharma","year":"2022","journal-title":"J. Softw. Evol. Process"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1080\/01621459.1951.10500769","article-title":"The Kolmogrov-Smirnov test for goodness of fit","volume":"46","author":"Massey","year":"1951","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1137\/070710111","article-title":"Power-law distributions in empirical data","volume":"51","author":"Clauset","year":"2009","journal-title":"SIAM Rev."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1016\/j.jss.2016.02.015","article-title":"Analyzing defect inflow distribution and applying Bayesian inference method for software defect prediction in large software projects","volume":"117","author":"Rana","year":"2016","journal-title":"J. Syst. Softw."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1109\/TR.1979.5220566","article-title":"Time-dependent error-detection rate model for software reliability and other performance measures","volume":"28","author":"Goel","year":"1979","journal-title":"IEEE Trans. Rel."}],"container-title":["Software"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2674-113X\/1\/4\/20\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:16:48Z","timestamp":1760145408000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2674-113X\/1\/4\/20"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,11]]},"references-count":43,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["software1040020"],"URL":"https:\/\/doi.org\/10.3390\/software1040020","relation":{},"ISSN":["2674-113X"],"issn-type":[{"type":"electronic","value":"2674-113X"}],"subject":[],"published":{"date-parts":[[2022,11,11]]}}}