{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T14:06:49Z","timestamp":1776866809659,"version":"3.51.2"},"reference-count":37,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2022,11,24]],"date-time":"2022-11-24T00:00:00Z","timestamp":1669248000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Appl. Math. Stat."],"abstract":"<jats:p>In this paper, we present a novel validated penalization method for bias reduction to estimate parameters for the logistic model when data are missing at random (MAR). Specific focus was given to address the data missingness problem among categorical model covariates. We penalize a logit log-likelihood with a novel prior distribution based on the family of the LogF(m,m) generalized distribution. The principle of expectation-maximization with weights was employed with the Louis' method to derive an information matrix, while a closed form for the exact bias was derived following the Cox and Snell's equation. A combination of simulation studies and real life data were used to validate the proposed method. Findings from the validation studies show that our model's standard errors are consistently lower than those derived from other bias reduction methods for the missing at random data mechanism. Consequently, we conclude that in most cases, our method's performance in parameter estimation is superior to the other classical methods for bias reduction when data are MAR.<\/jats:p>","DOI":"10.3389\/fams.2022.1052752","type":"journal-article","created":{"date-parts":[[2022,11,24]],"date-time":"2022-11-24T09:54:40Z","timestamp":1669283680000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Bias reduction in the logistic model parameters with the LogF(1,1) penalty under MAR assumption"],"prefix":"10.3389","volume":"8","author":[{"given":"Muna","family":"Al-Shaaibi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ronald","family":"Wesonga","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2022,11,24]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"2563","DOI":"10.1007\/s00180-021-01090-7","article-title":"Reduced-bias estimation of spatial autoregressive models with incompletely geocoded data","volume":"36","author":"Santi","year":"2021","journal-title":"Comput Stat"},{"key":"B2","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1016\/j.jspi.2021.01.005","article-title":"Validation likelihood estimation method for a zero-inflated Bernoulli regression model with missing covariates","volume":"457","author":"Lee","year":"2021","journal-title":"J Stat Plann Infer"},{"key":"B3","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1007\/s00180-020-01012-z","article-title":"Penalized weighted composite quantile regression for partially linear varying coefficient models with missing covariates","volume":"36","author":"Jin","year":"2021","journal-title":"Comput Stat"},{"key":"B4","doi-asserted-by":"publisher","first-page":"563","DOI":"10.1007\/s00180-017-0755-x","article-title":"Logistic regression diagnostics in ridge regression","volume":"33","author":"\u00d6zkale","year":"2018","journal-title":"Comput Stat"},{"key":"B5","doi-asserted-by":"publisher","first-page":"1023","DOI":"10.1002\/sim.1688","article-title":"Increasing the sample size when the unblinded interim result is promising","volume":"23","author":"Chen","year":"2004","journal-title":"Stat Med"},{"key":"B6","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1093\/biomet\/80.1.27","article-title":"Bias reduction of maximum likelihood estimates","volume":"80","author":"Firth","year":"1993","journal-title":"Biometrika"},{"key":"B7","doi-asserted-by":"publisher","first-page":"4216","DOI":"10.1002\/sim.2687","article-title":"A comparative investigation of methods for logistic regression with separated or nearly separated data","volume":"25","author":"Heinze","year":"2006","journal-title":"Stat Med"},{"key":"B8","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1016\/j.jspi.2021.04.003","article-title":"Maximum likelihood estimation of sparse networks with missing observations","volume":"215","author":"Gaucher","year":"2021","journal-title":"J Stat Plann Infer"},{"key":"B9","doi-asserted-by":"publisher","first-page":"1097","DOI":"10.1214\/10-EJS579","article-title":"A generic algorithm for reducing bias in parametric estimation","volume":"4","author":"Kosmidis","year":"2010","journal-title":"Electron J Stat"},{"key":"B10","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1093\/pan\/mpw014","article-title":"Dealing with separation in logistic regression models","volume":"24","author":"Rainey","year":"2016","journal-title":"Polit Anal"},{"key":"B11","volume-title":"Statistical Analysis With Missing Data","author":"Little","year":"2019"},{"key":"B12","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1080\/01621459.1990.10474938","article-title":"Incomplete data in generalized linear models","volume":"85","author":"Ibrahim","year":"1990","journal-title":"J Am Stat Assoc"},{"key":"B13","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.2307\/2533068","article-title":"Parameter estimation from incomplete data in binomial regression when the missing data mechanism is nonignorable","volume":"52","author":"Ibrahim","year":"1996","journal-title":"Biometrics"},{"key":"B14","doi-asserted-by":"publisher","first-page":"2478","DOI":"10.1016\/j.jspi.2010.02.018","article-title":"Bias correction in logistic regression with missing categorical covariates","volume":"140","author":"Das","year":"2010","journal-title":"J Stat Plann Infer"},{"key":"B15","doi-asserted-by":"publisher","first-page":"340","DOI":"10.1080\/00031305.2017.1407359","article-title":"Bias reduction in logistic regression with missing responses when the missing data mechanism is nonignorable","volume":"73","author":"Maity","year":"2018","journal-title":"Am Stat"},{"key":"B16","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1016\/j.jspi.2020.06.004","article-title":"A diagnostic for bias in linear mixed model estimators induced by dependence between the random effects and the corresponding model matrix","volume":"211","author":"Karl","year":"2021","journal-title":"J Stat Plann Infer"},{"key":"B17","doi-asserted-by":"publisher","first-page":"3133","DOI":"10.1002\/sim.6537","article-title":"Penalization, bias reduction, and default priors in logistic and related categorical and survival regressions","volume":"34","author":"Greenland","year":"2015","journal-title":"Stat Med"},{"key":"B18","doi-asserted-by":"publisher","first-page":"e301","DOI":"10.5001\/omj.2021.127","article-title":"Epidemiological risk factors for acquiring severe COVID-19; prospective cohort study","volume":"36","author":"Al Awaidy","year":"2021","journal-title":"Oman Med J"},{"key":"B19","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1111\/j.2517-6161.1968.tb00724.x","article-title":"A general definition of residuals","volume":"30","author":"Cox","year":"1968","journal-title":"J R Stat Soc Ser B"},{"key":"B20","doi-asserted-by":"publisher","first-page":"226","DOI":"10.1111\/j.2517-6161.1982.tb01203.x","article-title":"Finding the observed information matrix when using the EM algorithm","volume":"44","author":"Louis","year":"1982","journal-title":"J R Stat Soc Ser B"},{"key":"B21","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","article-title":"Inference and missing data","volume":"63","author":"Rubin","year":"1976","journal-title":"Biometrika"},{"key":"B22","doi-asserted-by":"publisher","DOI":"10.1201\/b13981","author":"Kim","year":"2013","journal-title":"Statistical Methods for Handling Incomplete Data"},{"key":"B23","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1016\/j.jspi.2020.01.001","article-title":"Robust estimation for moment condition models with data missing not at random","volume":"207","author":"Li","year":"2020","journal-title":"J Stat Plann Infer"},{"key":"B24","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1080\/01621459.2020.1764849","article-title":"A penalized regression framework for building polygenic risk models based on summary statistics from genome-wide association studies and incorporating external information","volume":"116","author":"Chen","year":"2021","journal-title":"J Am Stat Assoc"},{"key":"B25","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1093\/biomet\/62.3.607","article-title":"Discrimination among some parametric models","volume":"62","author":"Prentice","year":"1975","journal-title":"Biometrika"},{"key":"B26","author":"Kalbfleisch","year":"2011","journal-title":"The Statistical Analysis of Failure Time Data"},{"key":"B27","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1007\/s001800200098","article-title":"The log F: a distribution for all seasons","volume":"17","author":"Brown","year":"2002","journal-title":"Comput Stat"},{"key":"B28","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1214\/aoms\/1177731681","article-title":"A study of RA Fisher's z distribution and the related F distribution","volume":"12","author":"Aroian","year":"1941","journal-title":"Ann Math Stat"},{"key":"B29","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1080\/10618600.1992.10474576","article-title":"A simple method for computing the observed information matrix when using the EM algorithm with categorical data","volume":"1","author":"Baker","year":"1992","journal-title":"J Comput Graph Stat"},{"key":"B30","doi-asserted-by":"publisher","DOI":"10.1155\/2013\/487062","article-title":"A new second-order iteration method for solving nonlinear equations","author":"Kang","year":"2013","journal-title":"Abstract and Applied Analysis. Vol. 2013"},{"key":"B31","doi-asserted-by":"publisher","first-page":"885","DOI":"10.1080\/02331888.2018.1467419","article-title":"Robust confidence regions for the semi-parametric regression model with responses missing at random","volume":"52","author":"Bindele","year":"2018","journal-title":"Statistics"},{"key":"B32","doi-asserted-by":"publisher","first-page":"1214","DOI":"10.1080\/02331888.2017.1336170","article-title":"Wavelet estimation of density for censored data with censoring indicator missing at random","volume":"51","author":"Zou","year":"2017","journal-title":"Statistics"},{"key":"B33","doi-asserted-by":"publisher","first-page":"568","DOI":"10.1016\/j.jspi.2006.10.017","article-title":"Probability density estimation with data missing at random when covariables are present","volume":"138","author":"Wang","year":"2008","journal-title":"J Stat Plann Infer"},{"key":"B34","doi-asserted-by":"publisher","first-page":"191","DOI":"10.2307\/2347628","article-title":"Ridge estimators in logistic regression","volume":"41","author":"Le Cessie","year":"1992","journal-title":"J R Stat Soc Ser C"},{"key":"B35","doi-asserted-by":"publisher","first-page":"252","DOI":"10.1093\/aje\/kwt245","article-title":"Maximum likelihood, profile likelihood, and penalized likelihood: a primer","volume":"179","author":"Cole","year":"2014","journal-title":"Am J Epidemiol"},{"key":"B36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/rmv.2146","article-title":"Predictors of COVID-19 severity: a literature review","volume":"31","author":"Gallo Marin","year":"2021","journal-title":"Rev Med Virol"},{"key":"B37","doi-asserted-by":"publisher","first-page":"865845","DOI":"10.3389\/fimmu.2022.865845","article-title":"Identifying immunological and clinical predictors of COVID-19 severity and sequelae by mathematical modeling","volume":"13","author":"Elemam","year":"2022","journal-title":"Front Immunol"}],"container-title":["Frontiers in Applied Mathematics and Statistics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fams.2022.1052752\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,1]],"date-time":"2022-12-01T16:46:41Z","timestamp":1669913201000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fams.2022.1052752\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,24]]},"references-count":37,"alternative-id":["10.3389\/fams.2022.1052752"],"URL":"https:\/\/doi.org\/10.3389\/fams.2022.1052752","relation":{},"ISSN":["2297-4687"],"issn-type":[{"value":"2297-4687","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,24]]},"article-number":"1052752"}}