{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:41:58Z","timestamp":1760146918329,"version":"build-2065373602"},"reference-count":109,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2024,12,26]],"date-time":"2024-12-26T00:00:00Z","timestamp":1735171200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Species distribution modeling is fundamental to biodiversity, evolution, conservation science, and the study of invasive species. Given environmental data and species distribution data, model selection techniques are frequently used to help identify relevant features. Existing studies aim to find the relevant features by selecting the best models using different criteria, and they deem the predictors in the best models as the relevant features. However, they mostly consider only a given model family, making them vulnerable to model family misspecification. To address this issue, this paper introduces the Bayesian information-theoretic minimum message length (MML) principle to species distribution model selection. In particular, we provide a framework that allows the message length of models from multiple model families to be calculated and compared, and by doing so, the model selection is both accurate and robust against model family misspecification and data aggregation. To find the relevant features efficiently, we further develop a novel search algorithm that does not require calculating the message length for all possible subsets of features. Experimental results demonstrate that our proposed method outperforms competing methods by selecting the best models on both artificial and real-world datasets. More specifically, there was one test on artificial data that all methods got wrong. On the other 10 tests on artificial data, the MML method got everything correct, but the alternative methods all failed on a variety of tests. Our real-world data pertained to two plant species from Barro Colorado Island, Panama. Compared to the alternative methods, for both the plant species, the MML method selects the simplest model while also having the overall best predictions.<\/jats:p>","DOI":"10.3390\/e27010006","type":"journal-article","created":{"date-parts":[[2024,12,26]],"date-time":"2024-12-26T03:10:49Z","timestamp":1735182649000},"page":"6","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Model Selection from Multiple Model Families in Species Distribution Modeling Using Minimum Message Length"],"prefix":"10.3390","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1902-2851","authenticated-orcid":false,"given":"Zihao","family":"Wen","sequence":"first","affiliation":[{"name":"College of Mathematics and Informatics, South China Agricultural University, No. 483, Wushan Road, Tianhe District, Guangzhou 510642, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0583-5918","authenticated-orcid":false,"given":"David L.","family":"Dowe","sequence":"additional","affiliation":[{"name":"Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC 3800, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,12,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1424","DOI":"10.1111\/ele.12189","article-title":"Predicting species distributions for conservation decisions","volume":"16","author":"Guisan","year":"2013","journal-title":"Ecol. Lett."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1111\/conl.12260","article-title":"Dealing with Cumulative Biodiversity Impacts in Strategic Environmental Assessment: A New Frontier for Conservation Planning","volume":"10","author":"Whitehead","year":"2017","journal-title":"Conserv. Lett."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"e03336","DOI":"10.1002\/ecy.3336","article-title":"A practical guide to selecting models for exploration, inference, and prediction in ecology","volume":"102","author":"Tredennick","year":"2021","journal-title":"Ecology"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"e06022","DOI":"10.1111\/ecog.06022","article-title":"Habitats as predictors in species distribution models: Shall we use continuous or binary data?","volume":"2022","author":"Keil","year":"2022","journal-title":"Ecography"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"e01237-20","DOI":"10.1128\/AEM.01237-20","article-title":"Nested species distribution models of chlamydiales in Ixodes ricinus (Tick) hosts in Switzerland","volume":"87","author":"Rochat","year":"2020","journal-title":"Appl. Environ. Microbiol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/s00265-010-1029-6","article-title":"AIC model selection and multimodel inference in behavioral ecology: Some background, observations, and comparisons","volume":"65","author":"Burnham","year":"2011","journal-title":"Behav. Ecol. Sociobiol."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1007\/s00265-010-1035-8","article-title":"Model selection and model averaging in behavioural ecology: The utility of the IT-AIC framework","volume":"65","author":"Richards","year":"2011","journal-title":"Behav. Ecol. Sociobiol."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1890\/13-1452.1","article-title":"Model selection for ecologists: The worldviews of AIC and BIC","volume":"95","author":"Aho","year":"2014","journal-title":"Ecology"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1111\/2041-210X.12541","article-title":"The relative performance of AIC, AICc and BIC in the presence of unobserved heterogeneity","volume":"7","author":"Brewer","year":"2016","journal-title":"Methods Ecol. Evol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1093\/molbev\/msz228","article-title":"On the use of information criteria for model selection in phylogenetics","volume":"37","author":"Susko","year":"2020","journal-title":"Mol. Biol. Evol."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1093\/biomet\/92.4.937","article-title":"Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation","volume":"92","author":"Yang","year":"2005","journal-title":"Biometrika"},{"key":"ref_12","unstructured":"Wallace, C.S. (1998, January 6\u201310). On the selection of the order of a polynomial model. Proceedings of the 14th Biennial Australian Statistical Conference, Queensland, QLD, Australia."},{"key":"ref_13","unstructured":"Wallace, C.S. (1997). On the Selection of the Order of a Polynomial Model, Royal Holloway College. Available online: https:\/\/allisons.org\/ll\/Images\/People\/Wallace\/Polynomial\/."},{"key":"ref_14","unstructured":"Fitzgibbon, L.J., Dowe, D.L., and Allison, L. (2002, January 8\u201312). Univariate polynomial inference by Monte Carlo message length approximation. Proceedings of the ICML, Sydney, NSW, Australia."},{"key":"ref_15","unstructured":"Fitzgibbon, L.J., Dowe, D.L., and Vahid, F. (2004, January 4\u20137). Minimum message length autoregressive model order selection. Proceedings of the International Conference on Intelligent Sensing and Information Processing (ICISIP), Chennai, India."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Makalic, E., and Schmidt, D.F. (2012, January 4\u20137). MML logistic regression with translation and rotation invariant priors. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Sydney, Australia.","DOI":"10.1007\/978-3-642-35101-3_74"},{"key":"ref_17","unstructured":"Dowe, D.L. (2013). Minimum message length order selection and parameter estimation of moving average models. Algorithmic Probability and Friends. Bayesian Prediction and Artificial Intelligence: Papers from the Ray Solomonoff 85th Memorial Conference, Melbourne, VIC, Australia, 30 November\u20132 December 2011, Springer."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1093\/bjps\/axm033","article-title":"Bayes not bust! Why simplicity is no problem for Bayesians","volume":"58","author":"Dowe","year":"2007","journal-title":"Br. J. Philos. Sci."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1111\/1749-4877.12000","article-title":"Applying various algorithms for species distribution modelling","volume":"8","author":"Li","year":"2013","journal-title":"Integr. Zool."},{"key":"ref_20","unstructured":"Burt, J.E., Barber, G.M., and Rigby, D.L. (2009). Elementary Statistics for Geographers, Guilford Press."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/S0167-7152(99)00198-4","article-title":"A Bayesian analysis for estimating the number of species in a population using nonhomogeneous Poisson process","volume":"48","author":"Leite","year":"2000","journal-title":"Stat. Probab. Lett."},{"key":"ref_22","first-page":"1383","article-title":"Poisson point process models solve the \u201cpseudo-absence problem\u201d for presence-only data in ecology","volume":"4","author":"Warton","year":"2010","journal-title":"Ann. Appl. Stat."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1472","DOI":"10.1111\/geb.12216","article-title":"Accounting for imperfect detection and survey bias in statistical analysis of presence-only data","volume":"23","author":"Dorazio","year":"2014","journal-title":"Glob. Ecol. Biogeogr."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1917","DOI":"10.1214\/13-AOAS667","article-title":"Finite-sample equivalence in statistical models for presence-only data","volume":"7","author":"Fithian","year":"2013","journal-title":"Ann. Appl. Stat."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"424","DOI":"10.1111\/2041-210X.12242","article-title":"Bias correction in species distribution models: Pooling survey and collection data for multiple species","volume":"6","author":"Fithian","year":"2015","journal-title":"Methods Ecol. Evol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"934","DOI":"10.1038\/s41467-019-08822-w","article-title":"Model selection may not be a mandatory step for phylogeny reconstruction","volume":"10","author":"Abadi","year":"2019","journal-title":"Nat. Commun."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Baddeley, A., Rubak, E., and Turner, R. (2015). Spatial Point Patterns: Methodology and Applications with R, Chapman and Hall\/CRC Press.","DOI":"10.1201\/b19708"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v033.i01","article-title":"Regularization Paths for Generalized Linear Models via Coordinate Descent","volume":"33","author":"Friedman","year":"2010","journal-title":"J. Stat. Softw."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/comjnl\/bxm117","article-title":"Foreword re C. S. Wallace","volume":"51","author":"Dowe","year":"2008","journal-title":"Comput. J."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1093\/jrsssb\/qkae067","article-title":"Zihao Wen and David L. Dowe\u2019s contribution to the Discussion of \u2018Safe testing\u2019 by Gr\u00fcnwald, de Heide, and Koolen","volume":"86","author":"Wen","year":"2024","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1037\/h0071325","article-title":"Analysis of a Complex of Statistical Variables into Principal Components","volume":"24","author":"Hotelling","year":"1933","journal-title":"J. Educ. Psychol."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1126\/science.1127647","article-title":"Reducing the Dimensionality of Data with Neural Networks","volume":"313","author":"Hinton","year":"2006","journal-title":"Science"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.ecolmodel.2005.03.026","article-title":"Maximum entropy modeling of species geographic distributions","volume":"190","author":"Phillips","year":"2006","journal-title":"Ecol. Model."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/comjnl\/11.2.185","article-title":"An information measure for classification","volume":"11","author":"Wallace","year":"1968","journal-title":"Comput. J."},{"key":"ref_36","unstructured":"Needham, S., and Dowe, D.L. (2001, January 4\u20137). Message length as an effective Ockham\u2019s razor in decision tree induction. Proceedings of the International Workshop on Artificial Intelligence and Statistics, Key West, FL, USA."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1093\/comjnl\/42.4.270","article-title":"Minimum message length and Kolmogorov complexity","volume":"42","author":"Wallace","year":"1999","journal-title":"Comput. J."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1093\/comjnl\/42.4.338","article-title":"Discussion of the papers by Rissanen, and by Wallace and Dowe","volume":"42","author":"Clarke","year":"1999","journal-title":"Comput. J."},{"key":"ref_39","unstructured":"Wallace, C.S. (2005). Statistical and Inductive Inference by Minimum Message Length, Springer."},{"key":"ref_40","unstructured":"Solomonoff, R.J. (1960). A Preliminary Report on a General Theory of Inductive Inference, Zator Company. Technical Report."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0019-9958(64)90223-2","article-title":"A Formal Theory of Inductive Inference. Part I","volume":"7","author":"Solomonoff","year":"1964","journal-title":"Inf. Control"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1016\/S0019-9958(64)90131-7","article-title":"A Formal Theory of Inductive Inference. Part II","volume":"7","author":"Solomonoff","year":"1964","journal-title":"Inf. Control"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1109\/TIT.1978.1055913","article-title":"Complexity-Based Induction Systems: Comparisons and Convergence Theorems","volume":"24","author":"Solomonoff","year":"1978","journal-title":"IEEE Trans. Inf. Theory"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1093\/comjnl\/42.4.256","article-title":"Two Kinds of Probabilistic Induction","volume":"42","author":"Solomonoff","year":"1999","journal-title":"Comput. J."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1145\/321356.321363","article-title":"On the Length of Programs for Computing Finite Binary Sequences","volume":"13","author":"Chaitin","year":"1966","journal-title":"J. ACM (JACM)"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1145\/321495.321506","article-title":"On the Length of Programs for Computing Finite Binary Sequences: Statistical Considerations","volume":"16","author":"Chaitin","year":"1969","journal-title":"J. ACM (JACM)"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1111\/j.2517-6161.1987.tb01695.x","article-title":"Estimation and inference by compact coding","volume":"49","author":"Wallace","year":"1987","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Fitzgibbon, L.J., Dowe, D.L., and Allison, L. (2002, January 18\u201322). Change-point estimation using new minimum message length approximations. Proceedings of the Pacific Rim International Conference on Artificial Intelligence, Tokyo, Japan.","DOI":"10.1007\/3-540-45683-X_28"},{"key":"ref_49","unstructured":"Agusta, Y., and Dowe, D.L. (2003, January 3\u20135). Unsupervised learning of correlated multivariate Gaussian mixture models using MML. Proceedings of the AI 2003: Advances in Artificial Intelligence: 16th Australian Conference on AI, Perth, Australia. Proceedings 16."},{"key":"ref_50","unstructured":"Fitzgibbon, L.J. (2004). Message from Monte Carlo: A Framework for Minimum Message Length Inference Using Markov Chain Monte Carlo Methods. [Ph.D. Thesis, School of Computer Science and Software Engineering, Clayton School of I.T., Monash University]."},{"key":"ref_51","unstructured":"Agusta, Y. (2005). Minimum message length mixture modelling for uncorrelated and correlated continuous data applied to mutual funds classification. [Ph.D. Thesis, School of Computer Science and Software Engineering, Clayton School of I.T., Monash University]."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Dowe, D.L. (2011). MML, hybrid Bayesian network graphical models, statistical consistency, invariance and uniqueness. Philosophy of Statistics, North-Holland.","DOI":"10.1016\/B978-0-444-51862-0.50030-7"},{"key":"ref_53","unstructured":"Schmidt, D.F. (2008). Minimum Message Length Inference of Autoregressive Moving Average models. [Ph.D. Thesis, Monash University]."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Fang, Z., Dowe, D.L., Peiris, S., and Rosadi, D. (2021). Minimum Message Length in Hybrid ARMA and LSTM Model Forecasting. Entropy, 23.","DOI":"10.20944\/preprints202110.0049.v2"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1007\/3-540-40992-0_5","article-title":"Minimum message length grouping of ordered data","volume":"Volume 11","author":"Fitzgibbon","year":"2000","journal-title":"Proceedings of the Algorithmic Learning Theory: 11th International Conference, ALT 2000"},{"key":"ref_56","first-page":"307","article-title":"Minimum Message Length Segmentation","volume":"Volume 1394","author":"Wu","year":"1998","journal-title":"Research and Development in Knowledge Discovery and Data Mining"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Viswanathan, M., Wallace, C.S., Dowe, D.L., and Korb, K.B. (1999, January 6\u201310). Finding Outpoints in Noisy Binary Sequences\u2014A Revised Empirical Evaluation. Proceedings of the Advanced Topics in Artificial Intelligence: 12th Australian Joint Conference on Artificial Intelligence, AI\u201999, Sydney, Australia.","DOI":"10.1007\/3-540-46695-9_34"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1023\/A:1022646101185","article-title":"Coding decision trees","volume":"11","author":"Wallace","year":"1993","journal-title":"Mach. Learn."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Tan, P.J., and Dowe, D.L. (2006, January 13\u201317). Decision forests with oblique decision trees. Proceedings of the Mexican International Conference on Artificial Intelligence, Apizaco, Mexico.","DOI":"10.1007\/11925231_56"},{"key":"ref_60","unstructured":"Oliver, J.J., and Wallace, C.S. (1991, January 24\u201330). Inferring Decision Graphs. Proceedings of the IJCAI \u201991 Workshop 8, Sydney, Australia."},{"key":"ref_61","unstructured":"Oliver, J.J. (1992). Decision Graphs: An Extension of Decision Trees, Monash University, Department of Computer Science. Technical Report."},{"key":"ref_62","unstructured":"Oliver, J.J., Dowe, D.L., and Wallace, C.S. (1992, January 16\u201318). Inferring Decision Graphs Using the Minimum Message Length Principle. Proceedings of the 5th Australian Joint Conference on Artificial Intelligence, Hobart, TAS, Australia."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Tan, P.J., and Dowe, D.L. (2002, January 2\u20136). MML Inference of Decision Graphs with Multi-Way Joins. Proceedings of the AI 2002: Advances in Artificial Intelligence: 15th Australian Joint Conference on Artificial Intelligence, Canberra, Australia. Lecture Notes in Computer Science.","DOI":"10.1007\/3-540-36187-1_12"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Tan, P.J., and Dowe, D.L. (2003, January 3\u20135). MML Inference of Decision Graphs with Multi-Way Joins and Dynamic Attributes. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Perth, Australia.","DOI":"10.1007\/978-3-540-24581-0_23"},{"key":"ref_65","unstructured":"Comley, J.W., and Dowe, D.L. (2003, January 5\u20138). General Bayesian Networks and Asymmetric Languages. Proceedings of the Hawaii International Conference on Statistics and Related Fields, Honolulu, HI, USA."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Gr\u00fcnwald, P., Myung, I.J., and Pitt, M.A. (2005). Minimum Message Length and Generalized Bayesian Nets with Asymmetric Languages. Advances in Minimum Description Length: Theory and Applications, MIT Press. Chapter 11.","DOI":"10.7551\/mitpress\/1114.001.0001"},{"key":"ref_67","unstructured":"Makalic, E., and Schmidt, D.F. (2023). MML Probabilistic Principal Component Analysis. arXiv."},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Hlav\u00e1\u010dkov\u00e1-Schindler, K., and Plant, C. (2020). Heterogeneous Graphical Granger Causality by Minimum Message Length. Entropy, 22.","DOI":"10.3390\/e22121400"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Makalic, E., and Schmidt, D.F. (2021). Minimum Message Length Inference of the Exponential Distribution with Type I Censoring. Entropy, 23.","DOI":"10.3390\/e23111439"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Dowe, D.L., and Zaidi, N.A. (2010, January 7\u201310). Database Normalization as a By-Product of Minimum Message Length Inference. Proceedings of the 23rd Australian Joint Conference on Artificial Intelligence (AI\u20192010), Adelaide, Australia. Lecture Notes in Artificial Intelligence.","DOI":"10.1007\/978-3-642-17432-2_9"},{"key":"ref_71","unstructured":"Torsello, A., and Dowe, D.L. (2008, January 3\u20135). Learning a Generative Model for Structural Representations. Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence (AI08), Auckland, New Zealand. Lecture Notes in Artificial Intelligence (LNAI) 5360."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Torsello, A., and Dowe, D.L. (2008, January 8\u201311). Supervised Learning of a Generative Model for Edge-Weighted Graphs. Proceedings of the 19th International Conference on Pattern Recognition (ICPR08), Tampa, FL, USA.","DOI":"10.1109\/ICPR.2008.4761285"},{"key":"ref_73","unstructured":"Wallace, C.S., and Dowe, D.L. (1994, January 21\u201325). Intrinsic classification by MML\u2014The Snob program. Proceedings of the 7th Australian Joint Conference on Artificial Intelligence, Armidale, Australia."},{"key":"ref_74","unstructured":"Edwards, R.T., and Dowe, D.L. (1998, January 15\u201317). Single Factor Analysis in MML Mixture Modelling. Proceedings of the 2nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 1998), Melbourne, Australia. Lecture Notes in Artificial Intelligence."},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1023\/A:1008992619036","article-title":"MML clustering of multi-state, Poisson, von Mises circular and Gaussian distributions","volume":"10","author":"Wallace","year":"2000","journal-title":"Stat. Comput."},{"key":"ref_76","unstructured":"Edgoose, T., Allison, L., and Dowe, D.L. (1998, January 4\u20139). An MML classification of protein structure that knows about angles and sequence. Proceedings of the Pacific Symposium on Biocomputing, Maui, HI, USA."},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1109\/TPAMI.2008.155","article-title":"A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering","volume":"31","author":"Boutemedjet","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Araya, T.W., Azam, M., Bouguila, N., and Bentahar, J. (2024, January 22\u201325). Multivariate Bounded Support Kotz Mixture Model with Minimum Message Length Criterion. Proceedings of the 2024 International Symposium on Networks, Computers and Communications (ISNCC), Washington, DC, USA.","DOI":"10.1109\/ISNCC62547.2024.10759059"},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Saikrishna, V., Dowe, D.L., and Ray, S. (2022). MML learning and inference of hierarchical Probabilistic Finite State Machines. Applied Data Analytics\u2014Principles and Applications, River Publishers. Chapter 18.","DOI":"10.1201\/9781003337225-18"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Dowe, D.L., Oliver, J.J., and Wallace, C.S. (1996, January 23\u201325). MML estimation of the parameters of the spherical Fisher distribution. Proceedings of the International Workshop on Algorithmic Learning Theory, Sydney, NSW, Australia.","DOI":"10.1007\/3-540-61863-5_48"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Bickerstaffe, A.C., and Makalic, E. (2003, January 3\u20135). MML classification of music genres. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Perth, Australia.","DOI":"10.1007\/978-3-540-24581-0_91"},{"key":"ref_82","first-page":"242","article-title":"Circular clustering of protein dihedral angles by minimum message length","volume":"Volume 96","author":"Dowe","year":"1996","journal-title":"Pacific Symposium on Biocomputing"},{"key":"ref_83","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1038\/s41586-024-08002-x","article-title":"A Prenatal Skin Atlas Reveals Immune Regulation of Human Skin Morphogenesis","volume":"635","author":"Gopee","year":"2024","journal-title":"Nature"},{"key":"ref_84","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1093\/biomet\/asq017","article-title":"The horseshoe estimator for sparse signals","volume":"97","author":"Carvalho","year":"2010","journal-title":"Biometrika"},{"key":"ref_85","first-page":"105","article-title":"Shrink globally, act locally: Sparse Bayesian regularization and prediction","volume":"9","author":"Polson","year":"2010","journal-title":"Bayesian Stat."},{"key":"ref_86","unstructured":"Xu, Z., Schmidt, D.F., Makalic, E., Qian, G., and Hopper, J.L. (2017). Bayesian sparse global-local shrinkage regression for grouped variables. arXiv."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Schmidt, D.F., and Makalic, E. (2013, January 1\u20136). Minimum message length ridge regression for generalized linear models. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Dunedin, New Zealand.","DOI":"10.1007\/978-3-319-03680-9_41"},{"key":"ref_88","doi-asserted-by":"crossref","first-page":"2100259","DOI":"10.1002\/andp.202100259","article-title":"The optimal lattice quantizer in nine dimensions","volume":"533","author":"Allen","year":"2021","journal-title":"Ann. Der Phys."},{"key":"ref_89","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1137\/0604005","article-title":"The optimal lattice quantizer in three dimensions","volume":"4","author":"Barnes","year":"1983","journal-title":"SIAM J. Algebr. Discret. Methods"},{"key":"ref_90","unstructured":"Conway, J.H., and Sloane, N.J.A. (2013). Sphere Packings, Lattices and Groups, Springer."},{"key":"ref_91","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1214\/09-STS284D","article-title":"Bayes, Jeffreys, prior distributions and the philosophy of statistics","volume":"24","author":"Gelman","year":"2009","journal-title":"Stat. Sci."},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1093\/comjnl\/42.4.330","article-title":"Refinements of MDL and MML coding","volume":"42","author":"Wallace","year":"1999","journal-title":"Comput. J."},{"key":"ref_93","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1214\/14-BA915","article-title":"Overall Objective Priors","volume":"10","author":"Berger","year":"2015","journal-title":"Bayesian Anal."},{"key":"ref_94","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1111\/j.2517-6161.1979.tb01066.x","article-title":"Reference posterior distributions for Bayesian inference","volume":"41","author":"Bernardo","year":"1979","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Schmidt, D.F., and Makalic, E. (2012, January 4\u20137). Minimum message length inference and mixture modelling of inverse Gaussian distributions. Proceedings of the Australasian Joint Conference on Artificial Intelligence, Sydney, Australia.","DOI":"10.1007\/978-3-642-35101-3_57"},{"key":"ref_96","unstructured":"Leininger, T.J. (2014). Bayesian Analysis of Spatial Point Patterns. [Ph.D. Thesis, Department of Statistical Science, Duke University]."},{"key":"ref_97","unstructured":"Dalling, J., John, R., Harms, K., Stallard, R., and Yavitt, J. (2018, June 18). Soil Maps of Barro Colorado Island 50 ha Plot; ForestGEO. Available online: http:\/\/ctfs.si.edu\/webatlas\/datasets\/bci\/soilmaps\/BCIsoil.html."},{"key":"ref_98","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/j.stamet.2013.08.001","article-title":"Variable selection for spatial Poisson point processes via a regularization method","volume":"17","author":"Thurman","year":"2014","journal-title":"Stat. Methodol."},{"key":"ref_99","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1080\/02691720802576291","article-title":"Minimum Message Length and statistically consistent invariant (objective?) Bayesian probabilistic inference\u2014From (medical) \u201cevidence\u201d","volume":"22","author":"Dowe","year":"2008","journal-title":"Soc. Epistemol."},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Dowe, D.L. (2013). Introduction to Ray Solomonoff 85th Memorial Conference. Algorithmic Probability and Friends. Bayesian Prediction and Artificial Intelligence: Papers from the Ray Solomonoff 85th Memorial Conference, Melbourne, VIC, Australia, 30 November\u20132 December 2011, Springer.","DOI":"10.1007\/978-3-642-44958-1_1"},{"key":"ref_101","doi-asserted-by":"crossref","first-page":"1360","DOI":"10.1214\/08-AOAS191","article-title":"A weakly informative default prior distribution for logistic and other regression models","volume":"2","author":"Gelman","year":"2008","journal-title":"Ann. Appl. Stat."},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Venables, W., and Ripley, B. (2003). Modern Applied Statistics with S, Springer. Statistics and Computing.","DOI":"10.1007\/978-0-387-21706-2"},{"key":"ref_103","first-page":"1717","article-title":"A Generalization of the Neyman-Scott Process","volume":"22","author":"Yau","year":"2012","journal-title":"Stat. Sin."},{"key":"ref_104","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1111\/j.2517-6161.1984.tb01290.x","article-title":"Monte Carlo methods of inference for implicit statistical models","volume":"46","author":"Diggle","year":"1984","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_105","unstructured":"Fern, K. (2023, July 29). Tropical Plants Database; tropical.theferns.info. Available online: https:\/\/tropical.theferns.info\/viewtropical.php?id=Nymphaea+lotus."},{"key":"ref_106","unstructured":"Condit, R., P\u00e9rez, R., Aguilar, S., Lao, S., Foster, R., and Hubbell, S. (2019). Complete data from the Barro Colorado 50-ha plot: 423617 trees, 35 years, 2019 version. Dryad."},{"key":"ref_107","unstructured":"Condit, R., Perez, R., Aguilar, S., Lao, S., Foster, R., and Hubbell, S. (2019). BCI 50-ha Plot Taxonomy, v4. Dataset. Dryad."},{"key":"ref_108","doi-asserted-by":"crossref","unstructured":"S\u00f8rbye, S.H., Illian, J.B., Simpson, D.P., and Burslem, D. (2017). Careful prior specification avoids incautious inference for log-Gaussian Cox point processes. arXiv.","DOI":"10.1111\/rssc.12321"},{"key":"ref_109","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1002\/nav.3800260304","article-title":"Simulation of nonhomogeneous Poisson processes by thinning","volume":"26","author":"Lewis","year":"1979","journal-title":"Nav. Res. Logist. Q."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/27\/1\/6\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T17:00:21Z","timestamp":1760115621000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/27\/1\/6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,26]]},"references-count":109,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,1]]}},"alternative-id":["e27010006"],"URL":"https:\/\/doi.org\/10.3390\/e27010006","relation":{},"ISSN":["1099-4300"],"issn-type":[{"type":"electronic","value":"1099-4300"}],"subject":[],"published":{"date-parts":[[2024,12,26]]}}}