{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:31:17Z","timestamp":1760059877680,"version":"build-2065373602"},"reference-count":54,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T00:00:00Z","timestamp":1753228800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Axioms"],"abstract":"<jats:p>The article investigates the accuracy of nonparametric univariate density estimation methods applied to various Gaussian mixture models. A comprehensive comparative analysis is performed for four popular estimation approaches: adaptive kernel density estimation, projection pursuit, log-spline estimation, and wavelet-based estimation. The study is extended with modified versions of these methods, where the sample is first clustered using the EM algorithm based on Gaussian mixture components prior to density estimation. Estimation accuracy is quantitatively evaluated using MAE and MAPE criteria, with simulation experiments conducted over 100,000 replications for various sample sizes. The results show that estimation accuracy strongly depends on the density structure, sample size, and degree of component overlap. Clustering before density estimation significantly improves accuracy for multimodal and asymmetric densities. Although no formal statistical tests are conducted, the performance improvement is validated through non-overlapping confidence intervals obtained from 100,000 simulation replications. In addition, several decision-making systems are compared for automatically selecting the most appropriate estimation method based on the sample\u2019s statistical features. Among the tested systems, kernel discriminant analysis yielded the lowest error rates, while neural networks and hybrid methods showed competitive but more variable performance depending on the evaluation criterion. The findings highlight the importance of using structurally adaptive estimators and automation of method selection in nonparametric statistics. The article concludes with recommendations for method selection based on sample characteristics and outlines future research directions, including extensions to multivariate settings and real-time decision-making systems.<\/jats:p>","DOI":"10.3390\/axioms14080551","type":"journal-article","created":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T10:49:17Z","timestamp":1753267757000},"page":"551","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Comparative Evaluation of Nonparametric Density Estimators for Gaussian Mixture Models with Clustering Support"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2292-9852","authenticated-orcid":false,"given":"Tomas","family":"Ruzgas","sequence":"first","affiliation":[{"name":"Department of Applied Mathematics, Kaunas University of Technology, LT-51368 Kaunas, Lithuania"}]},{"given":"Gintaras","family":"Stankevi\u010dius","sequence":"additional","affiliation":[{"name":"Department of Applied Mathematics, Kaunas University of Technology, LT-51368 Kaunas, Lithuania"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9042-7168","authenticated-orcid":false,"given":"Birut\u0117","family":"Narijauskait\u0117","sequence":"additional","affiliation":[{"name":"Department of Applied Mathematics, Kaunas University of Technology, LT-51368 Kaunas, Lithuania"}]},{"given":"Jurgita","family":"Arnastauskait\u0117 Zencevi\u010dien\u0117","sequence":"additional","affiliation":[{"name":"Department of Computer Sciences, Kaunas University of Technology, LT-51368 Kaunas, Lithuania"}]}],"member":"1968","published-online":{"date-parts":[[2025,7,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1080\/01621459.1996.10476701","article-title":"A Brief Survey of Bandwidth Selection for Density Estimation","volume":"91","author":"Jones","year":"1996","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1214\/aos\/1176348653","article-title":"Exact Mean Integrated Squared Error","volume":"20","author":"Marron","year":"1992","journal-title":"Ann. Stat."},{"key":"ref_3","unstructured":"Silverman, B.W. (1986). Density Estimation for Statistics and Data Analysis, Springer."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1264","DOI":"10.1214\/aos\/1013203453","article-title":"Convergence Rates for Density Estimation with Bernstein Polynomials","volume":"29","author":"Ghosal","year":"2001","journal-title":"Ann. Stat."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1499","DOI":"10.1109\/TPAMI.2003.1240123","article-title":"Efficient Kernel Density Estimation Using the Fast Gauss Transform with Applications to Color Modeling and Tracking","volume":"25","author":"Elgammal","year":"2003","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1080\/01621459.1987.10478427","article-title":"Exploratory Projection Pursuit","volume":"82","author":"Friedman","year":"1987","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_7","unstructured":"Herrick, D.R.M., Nason, G.P., and Silverman, B.W. (2025, June 13). Some New Methods for Wavelet Density Estimation. Available online: https:\/\/www.jstor.org\/stable\/25051366?seq=1."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1016\/0167-9473(91)90115-I","article-title":"A Study of Logspline Density Estimation","volume":"12","author":"Kooperberg","year":"1991","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1080\/10485250212374","article-title":"Nonparametric Estimation and Symmetry Tests for Conditional Density Functions","volume":"14","author":"Hyndman","year":"2002","journal-title":"J. Nonparametr. Stat."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1602","DOI":"10.1214\/aos\/1032298287","article-title":"Local Likelihood Density Estimation","volume":"24","author":"Loader","year":"1996","journal-title":"Ann. Stat."},{"key":"ref_11","unstructured":"Delicado, P., and del R\u00edo, M. (2025, June 12). A Generalization of Histogram Type Estimators. Available online: https:\/\/repositori.upf.edu\/items\/07ef9af1-c209-4297-956e-b839ad6eef4f."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1080\/01621459.1981.10477594","article-title":"Monte Carlo Study of Three Data-Based Nonparametric Probability Density Estimators","volume":"76","author":"Scott","year":"1981","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2795","DOI":"10.1109\/78.324744","article-title":"Nonparametric Multivariate Density Estimation: A Comparative Study","volume":"42","author":"Hwang","year":"1994","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/0304-4076(95)01752-6","article-title":"Qualitative and Asymptotic Performance of SNP Density Estimators","volume":"74","author":"Fenton","year":"1996","journal-title":"J. Econom."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1051\/aas:1998355","article-title":"Density Estimation with Non\u2013Parametric Methods","volume":"127","author":"Fadda","year":"1998","journal-title":"Astron. Astrophys. Suppl. Ser."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1137\/S1064827595290462","article-title":"Efficient Nonparametric Density Estimation on the Sphere with Applications in Fluid Mechanics","volume":"22","author":"Eugeciouglu","year":"2000","journal-title":"SIAM J. Sci. Comput."},{"key":"ref_17","first-page":"2795","article-title":"Nonparametric Density Estimation: A Comparative Study","volume":"3","author":"Takada","year":"2001","journal-title":"Econ. Bull."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"363","DOI":"10.2307\/1913241","article-title":"Semi-Nonparametric Maximum Likelihood Estimation","volume":"55","author":"Gallant","year":"1987","journal-title":"Econometrica"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/BF01413829","article-title":"Multivariate Density Estimation: A Comparative Study","volume":"6","author":"Koronacki","year":"1997","journal-title":"Neural Comput. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1109\/97.817377","article-title":"Wavelet-Based Method for Nonparametric Estimation of HMMs","volume":"7","author":"Couvreur","year":"2000","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1016\/0893-6080(91)90071-C","article-title":"On the Relations between Discriminant Analysis and Multilayer Perceptrons","volume":"4","author":"Gallinari","year":"1991","journal-title":"Neural Netw."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1016\/S0304-3800(99)00113-1","article-title":"Comparing Discriminant Analysis, Neural Networks and Logistic Regression for Predicting Species Distributions: A Case Study with a Himalayan River Bird","volume":"120","author":"Manel","year":"1999","journal-title":"Ecol. Model."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0377-2217(98)00037-X","article-title":"Decision Making Using Multiple Models","volume":"114","author":"Malhotra","year":"1999","journal-title":"Eur. J. Oper. Res."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ge, H., Dong, L., Huang, M., Zang, W., and Zhou, L. (2022). Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model. Electronics, 11.","DOI":"10.3390\/electronics11213604"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Amador Luna, D., Alonso-Chaves, F.M., and Fern\u00e1ndez, C. (2024). Kernel Density Estimation for the Interpretation of Seismic Big Data in Tectonics Using QGIS: The T\u00fcrkiye\u2013Syria Earthquakes (2023). Remote Sens., 16.","DOI":"10.3390\/rs16203849"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"4124","DOI":"10.1109\/TSP.2022.3198169","article-title":"Generalized Likelihood Ratio Test for Adversarially Robust Hypothesis Testing","volume":"70","author":"Puranik","year":"2022","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Helali, S., Masmoudi, A., and Slaoui, Y. (2022). Semi-Parametric Estimation Using Bernstein Polynomial and a Finite Gaussian Mixture Model. Entropy, 24.","DOI":"10.3390\/e24030315"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"622","DOI":"10.3846\/mma.2020.10505","article-title":"Accuracy of Nonparametric Density Estimation for Univariate Gaussian Mixture Models: A Comparative Study","volume":"25","author":"Ruzgas","year":"2020","journal-title":"Math. Model. Anal."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1080\/01621459.1984.10478086","article-title":"Projection Pursuit Density Estimation","volume":"79","author":"Friedman","year":"1984","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1214\/aos\/1032894451","article-title":"Density Estimation by Wavelet Thresholding","volume":"24","author":"Donoho","year":"1996","journal-title":"Ann. Stat."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"905","DOI":"10.1214\/aos\/1176324628","article-title":"Formulae for Mean Integrated Squared Error of Nonlinear Wavelet-Based Density Estimators","volume":"23","author":"Hall","year":"1995","journal-title":"Ann. Stat."},{"key":"ref_32","first-page":"435","article-title":"Projection Pursuit","volume":"13","author":"Huber","year":"1985","journal-title":"Ann. Stat."},{"key":"ref_33","unstructured":"(2025, June 13). Polynomial Spline Routines. Available online: https:\/\/cran.r-project.org\/web\/packages\/polspline\/."},{"key":"ref_34","unstructured":"(2025, June 13). WaveThresh4. Available online: https:\/\/people.maths.bris.ac.uk\/~wavethresh\/."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1016\/j.medengphy.2004.02.006","article-title":"Gaussian Mixture Models of ECoG Signal Features for Improved Detection of Epileptic Seizures","volume":"26","author":"Meng","year":"2004","journal-title":"Med. Eng. Phys."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Gao, M., Tai-hua, C., and Gao, X. (2010, January 29\u201331). Application of Gaussian Mixture Model Genetic Algorithm in Data Stream Clustering Analysis. Proceedings of the 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, Xiamen, China.","DOI":"10.1109\/ICICISYS.2010.5658322"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1007\/s00521-005-0471-2","article-title":"Application of the Gaussian Mixture Model to Drug Dissolution Profiles Prediction","volume":"14","author":"Lim","year":"2005","journal-title":"Neural Comput. Appl."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"2348","DOI":"10.1086\/117248","article-title":"Detecting Bimodality in Astronomical Datasets","volume":"108","author":"Ashman","year":"1994","journal-title":"Astron. J."},{"key":"ref_39","unstructured":"Beaven, S.G., Stein, D., and Hoff, L.E. (2000, January 24\u201328). Comparison of Gaussian Mixture and Linear Mixture Models for Classification of Hyperspectral Data. Proceedings of the IGARSS 2000. IEEE 2000 International Geoscience and Remote Sensing Symposium. Taking the Pulse of the Planet: The Role of Remote Sensing in Managing the Environment. Proceedings (Cat. No.00CH37120), Honolulu, HI, USA."},{"key":"ref_40","unstructured":"Smidtaite, R. (2008). Application of Nonlinear Statistics for Distribution Density Estimation of Random Vectors. [Master\u2019s Thesis, Kaunas University of Technology]."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1007\/BF00992613","article-title":"Statistical Estimation of a Mixture of Gaussian Distributions","volume":"38","author":"Rudzkis","year":"1995","journal-title":"Acta Appl. Math."},{"key":"ref_42","unstructured":"Hall, P. (2025, June 13). The Bootstrap and Edgeworth Expansion. Available online: https:\/\/www.abebooks.com\/9780387945088\/Bootstrap-Edgeworth-Expansion-Springer-Series-0387945083\/plp."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1080\/00949658508810837","article-title":"A Bootstrap Testing Procedure for Investigating the Number of Subpopulations","volume":"22","author":"Wong","year":"1985","journal-title":"J. Stat. Comput. Simul."},{"key":"ref_44","unstructured":"Allison, P.D. (2008). Logistic Regression Using SAS: Theory and Application, Wiley."},{"key":"ref_45","unstructured":"Cox, D.R., and Snell, E.J. (1989). Analysis of Binary Data, Chapman and Hall. [2nd ed.]."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Rao, C.R. (1973). Linear Statistical Inference and Its Applications, Wiley. [2nd ed.]. Wiley Series in Probability and Statistics: 1st ed.","DOI":"10.1002\/9780470316436"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1111\/j.2517-6161.1991.tb01857.x","article-title":"A reliable data-based bandwidth selection method for kernel density-estimation","volume":"53","author":"JONES","year":"1991","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1214\/009053604000000959","article-title":"Bandwidth Choice for Nonparametric Classification","volume":"33","author":"Hall","year":"2005","journal-title":"Ann. Stat."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1016\/j.eswa.2007.10.005","article-title":"Neural Networks and Statistical Techniques: A Review of Applications","volume":"36","author":"Paliwal","year":"2009","journal-title":"Expert. Syst. Appl."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1016\/0167-9473(95)00032-1","article-title":"Neural Networks and Logistic Regression: Part I","volume":"21","author":"Schumacher","year":"1996","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"127.e1","DOI":"10.1016\/j.forsciint.2009.07.014","article-title":"A Comparison between Neural Network and Other Metric Methods to Determine Sex from the Upper Femur in a Modern French Population","volume":"192","author":"Jardin","year":"2009","journal-title":"Forensic Sci. Int."},{"key":"ref_52","unstructured":"Shewhart, W.A. (1939). Statistical Method from the Viewpoint of Quality Control."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"782","DOI":"10.1080\/01621459.1993.10476339","article-title":"The Identification of Multiple Outliers","volume":"88","author":"Davies","year":"1993","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"754","DOI":"10.1145\/30401.315746","article-title":"Programming pearls: A sample of brilliance","volume":"30","author":"Bentley","year":"1987","journal-title":"Commun. ACM"}],"container-title":["Axioms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2075-1680\/14\/8\/551\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:14:23Z","timestamp":1760033663000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2075-1680\/14\/8\/551"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,23]]},"references-count":54,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2025,8]]}},"alternative-id":["axioms14080551"],"URL":"https:\/\/doi.org\/10.3390\/axioms14080551","relation":{},"ISSN":["2075-1680"],"issn-type":[{"type":"electronic","value":"2075-1680"}],"subject":[],"published":{"date-parts":[[2025,7,23]]}}}