{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,11]],"date-time":"2025-09-11T19:09:04Z","timestamp":1757617744885,"version":"3.44.0"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T00:00:00Z","timestamp":1744243200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T00:00:00Z","timestamp":1744243200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"AlmaLaurea"},{"name":"AlmaLaurea"},{"DOI":"10.13039\/501100010607","name":"Universit\u00e0 degli Studi di Perugia","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100010607","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2025,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Since 1998, AlmaLaurea\u2014a consortium of 80 Italian universities and a member of the Italian National Statistical System\u2014has conducted an annual census on graduates\u2019 employment status. The survey provides estimates of descriptive indicators at both the population level and for specific subpopulations (domains) of interest, such as degree programmes. Some domains have very few observations due to a small population size and non-response. In this paper, we address this estimation problem within a Small Area Estimation framework. Specifically, we propose using generalized linear mixed models that incorporate two variables as proxies for graduates\u2019 response propensity, making the assumption of non-informative non-response more plausible. Degree programme estimates of employment rates are derived as (semi-parametric) empirical best predictions using a finite mixture of logistic regression models, with their mean squared error estimated via a second-order, bias-corrected, analytical estimator. Sensitivity analysis is conducted to assess the explanatory power of variables modelling response propensity and to evaluate potential correlations between area-specific random effects and observed heterogeneity.<\/jats:p>","DOI":"10.1007\/s11634-025-00630-z","type":"journal-article","created":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T04:57:52Z","timestamp":1744261072000},"page":"515-543","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["When non-response makes estimates from a census a small area estimation problem: the case of the survey on graduates\u2019 employment status in Italy"],"prefix":"10.1007","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7005-8572","authenticated-orcid":false,"given":"Maria Giovanna","family":"Ranalli","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fulvia","family":"Pennoni","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Francesco","family":"Bartolucci","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Antonietta","family":"Mira","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,4,10]]},"reference":[{"key":"630_CR1","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1007\/BF00140869","volume":"6","author":"M Aitkin","year":"1996","unstructured":"Aitkin M (1996) A general maximum likelihood analysis of overdispersion in generalized linear models. Stat Comput 6:251\u2013262","journal-title":"Stat Comput"},{"key":"630_CR2","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1111\/j.0006-341X.1999.00117.x","volume":"55","author":"M Aitkin","year":"1999","unstructured":"Aitkin M (1999) A general maximum likelihood analysis of variance components in generalized linear models. Biometrics 55:117\u2013128","journal-title":"Biometrics"},{"key":"630_CR3","first-page":"267","volume-title":"Second international symposium of information theory","author":"H Akaike","year":"1973","unstructured":"Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F (eds) Second international symposium of information theory. Akademiai Kiado, Budapest, pp 267\u2013281"},{"key":"630_CR4","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1111\/rssc.12452","volume":"70","author":"M Alf\u00f2","year":"2021","unstructured":"Alf\u00f2 M, Marino MF, Ranalli MG, Salvati N, Tzavidis N (2021) M-quantile regression for multivariate longitudinal data with an application to the millennium cohort study. J R Stat Soc Ser C 70:122\u2013146","journal-title":"J R Stat Soc Ser C"},{"key":"630_CR5","unstructured":"AlmaLaurea (2022) 24th Report Occupational Condition of Graduates, 2022 Summary Report. https:\/\/www.almalaurea.it\/sites\/default\/files\/2022-09\/sintesi_occupazione_rapporto_2022_en.pdf. Accessed 27 Feb 2024"},{"key":"630_CR6","unstructured":"AlmaLaurea (2023) Graduates\u2019 employment status, data. https:\/\/www2.almalaurea.it\/cgi-php\/universita\/statistiche\/tendine.php?anno=2021&LANG=en&config=occupazione. Accessed 27 Feb 2024"},{"key":"630_CR7","volume-title":"Latent Markov models for longitudinal data","author":"F Bartolucci","year":"2013","unstructured":"Bartolucci F, Farcomeni A, Pennoni F (2013) Latent Markov models for longitudinal data. Chapman and Hall\/CRC Press, Boca Raton"},{"key":"630_CR8","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1146\/annurev-statistics-040220-091910","volume":"9","author":"F Bartolucci","year":"2022","unstructured":"Bartolucci F, Pandolfi S, Pennoni F (2022) Discrete latent variable models. Annu Rev Stat Its Appl 9:425\u2013452","journal-title":"Annu Rev Stat Its Appl"},{"key":"630_CR9","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1080\/01621459.1988.10478561","volume":"83","author":"GE Battese","year":"1988","unstructured":"Battese GE, Harter RM, Fuller WA (1988) An error-components model for prediction of county crop areas using survey and satellite data. J Am Stat Assoc 83:28\u201336","journal-title":"J Am Stat Assoc"},{"key":"630_CR10","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.1007\/s11135-018-0802-x","volume":"53","author":"A Bell","year":"2019","unstructured":"Bell A, Fairbrother M, Jones K (2019) Fixed and random effects models: making an informed choice. Qual Quant 53:1051\u20131074","journal-title":"Qual Quant"},{"key":"630_CR11","first-page":"167","volume":"44","author":"G Bertarelli","year":"2018","unstructured":"Bertarelli G, Ranalli MG, Bartolucci F, D\u2019Al\u00f2 M, Solari F (2018) Small area estimation for unemployment using latent Markov models. Surv Methodol 44:167\u2013192","journal-title":"Surv Methodol"},{"key":"630_CR12","unstructured":"Brown G, Chambers R, Heady P, Heasman D (2001) Evaluation of small area estimation methods: an application to unemployment estimates from the UK LFS. In: Proceedings of statistics Canada symposium. Statistics Canada, vol 2001, pp 1\u201310"},{"key":"630_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"A Dempster","year":"1977","unstructured":"Dempster A, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1\u201338","journal-title":"J R Stat Soc B"},{"key":"630_CR14","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1080\/01621459.1979.10482505","volume":"74","author":"RE Fay","year":"1979","unstructured":"Fay RE, Herriot RA (1979) Estimates of income for small places: an application of James-Stein procedures to census data. J Am Stat Assoc 74:269\u2013277","journal-title":"J Am Stat Assoc"},{"key":"630_CR15","volume-title":"Sampling statistics","author":"WA Fuller","year":"2011","unstructured":"Fuller WA (2011) Sampling statistics. Wiley, New Jersey"},{"key":"630_CR16","doi-asserted-by":"publisher","first-page":"2720","DOI":"10.1016\/j.csda.2006.01.012","volume":"51","author":"W Gonz\u00e1lez-Manteiga","year":"2007","unstructured":"Gonz\u00e1lez-Manteiga W, Lombard\u00eda M, Molina I, Morales D, Santamar\u00eda L (2007) Estimation of the mean squared error of predictors of small area linear parameters under a logistic mixed model. Comput Stat Data Anal 51:2720\u20132733","journal-title":"Comput Stat Data Anal"},{"key":"630_CR17","doi-asserted-by":"publisher","first-page":"661","DOI":"10.1515\/jos-2016-0034","volume":"32","author":"T Hobza","year":"2016","unstructured":"Hobza T, Morales D (2016) Empirical best prediction under unit-level logit mixed models. J Off Stat 32:661\u2013692","journal-title":"J Off Stat"},{"key":"630_CR18","doi-asserted-by":"publisher","first-page":"720","DOI":"10.1080\/01621459.1998.10473724","volume":"93","author":"J Jiang","year":"1998","unstructured":"Jiang J (1998) Consistent estimators in generalized linear mixed models. J Am Stat Assoc 93:720\u2013729","journal-title":"J Am Stat Assoc"},{"key":"630_CR19","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1016\/S0378-3758(02)00293-8","volume":"111","author":"J Jiang","year":"2003","unstructured":"Jiang J (2003) Empirical best prediction for small-area inference based on generalized linear mixed models. J Stat Plan Inference 111:117\u2013127","journal-title":"J Stat Plan Inference"},{"key":"630_CR20","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1023\/A:1012410420337","volume":"53","author":"J Jiang","year":"2001","unstructured":"Jiang J, Lahiri P (2001) Empirical best prediction for small area inference with binary data. Ann Inst Stat Math 53:217\u2013243","journal-title":"Ann Inst Stat Math"},{"key":"630_CR21","doi-asserted-by":"publisher","first-page":"941","DOI":"10.1111\/rssa.12696","volume":"184","author":"JK Kim","year":"2021","unstructured":"Kim JK, Park S, Chen Y, Wu C (2021) Combining non-probability and probability survey samples through mass imputation. J R Stat Soc Ser A Stat Soc 184:941\u2013963","journal-title":"J R Stat Soc Ser A Stat Soc"},{"key":"630_CR22","first-page":"133","volume":"32","author":"PS Kott","year":"2006","unstructured":"Kott PS (2006) Using calibration weighting to adjust for nonresponse and coverage errors. Surv Methodol 32:133\u2013142","journal-title":"Surv Methodol"},{"key":"630_CR23","doi-asserted-by":"publisher","first-page":"805","DOI":"10.1080\/01621459.1978.10480103","volume":"73","author":"N Laird","year":"1978","unstructured":"Laird N (1978) Nonparametric maximum likelihood estimation of a mixing distribution. J Am Stat Assoc 73:805\u2013811","journal-title":"J Am Stat Assoc"},{"key":"630_CR24","doi-asserted-by":"publisher","first-page":"963","DOI":"10.2307\/2529876","volume":"38","author":"N Laird","year":"1982","unstructured":"Laird N, Ware J (1982) Random-effects models for longitudinal data. Biometrics 38:963\u2013974","journal-title":"Biometrics"},{"key":"630_CR25","doi-asserted-by":"publisher","first-page":"1166","DOI":"10.1214\/18-AOAS1226","volume":"13","author":"MF Marino","year":"2019","unstructured":"Marino MF, Ranalli MG, Salvati N, Alf\u00f2 M (2019) Semiparametric empirical best prediction for small area estimation of unemployment indicators. Ann Appl Stat 13:1166\u20131197","journal-title":"Ann Appl Stat"},{"key":"630_CR26","first-page":"145","volume":"41","author":"A Matei","year":"2015","unstructured":"Matei A, Ranalli MG (2015) Dealing with non-ignorable nonresponse in survey sampling: a latent modeling approach. Surv Methodol 41:145\u2013165","journal-title":"Surv Methodol"},{"key":"630_CR27","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1146\/annurev-statistics-031017-100325","volume":"6","author":"GJ McLachlan","year":"2019","unstructured":"McLachlan GJ, Lee SX, Rathnayake SI (2019) Finite mixture models. Annu Rev Stat Its Appl 6:355\u2013378","journal-title":"Annu Rev Stat Its Appl"},{"key":"630_CR28","doi-asserted-by":"crossref","unstructured":"Morales D, Esteban MD, P\u00e9rez A, Hobza T (2021) A course on small area estimation and mixed models. Methods, Theory and Applications in R. Springer Nature","DOI":"10.1007\/978-3-030-63757-6"},{"key":"630_CR29","doi-asserted-by":"crossref","unstructured":"Mundlak Y (1978) On the pooling of time series and cross section data. Econom J Econom Soc 69\u201385","DOI":"10.2307\/1913646"},{"key":"630_CR30","doi-asserted-by":"publisher","first-page":"479","DOI":"10.1111\/1467-9868.00188","volume":"61","author":"D Oakes","year":"1999","unstructured":"Oakes D (1999) Direct calculation of the information matrix via the EM. J R Stat Soc B 61:479\u2013482","journal-title":"J R Stat Soc B"},{"key":"630_CR31","doi-asserted-by":"publisher","DOI":"10.1002\/9781118735855","volume-title":"Small area estimation","author":"JNK Rao","year":"2015","unstructured":"Rao JNK, Molina I (2015) Small area estimation. Wiley, New Jersey"},{"key":"630_CR32","unstructured":"Saei A, Chambers R (2003) Small area estimation under linear and generalized linear mixed models with time and area effects, S3RI Methodology Working Papers. Southampton: Southampton Statistical Sciences Research Institute, pp 1\u201335"},{"key":"630_CR33","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4378-6","volume-title":"Model assisted survey sampling","author":"CE S\u00e4rndal","year":"1992","unstructured":"S\u00e4rndal CE, Swensson B, Wretman J (1992) Model assisted survey sampling. Springer-Verlag, New York"},{"key":"630_CR34","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1093\/jrsssa\/qnae115","volume":"188","author":"F Schirripa Spagnolo","year":"2025","unstructured":"Schirripa Spagnolo F, Bertarelli G, Summa D, Scannapieco M, Pratesi M, Marchetti S, Salvati N (2025) Inference for big data assisted by small area methods: an application on sustainable development goals sensitivity of enterprises in Italy. J R Stat Soc Ser A 188:27\u201345","journal-title":"J R Stat Soc Ser A"},{"key":"630_CR35","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461\u2013464","journal-title":"Ann Stat"},{"key":"630_CR36","unstructured":"Statistics Canada (2016) Guide to the labour force survey. Technical report, Statistics Canada, Catalogue 71-543-G. http:\/\/www.statcan.gc.ca\/pub\/71-543-g\/71-543-g2016001-eng.pdf"},{"key":"630_CR37","doi-asserted-by":"publisher","first-page":"981","DOI":"10.1111\/rssa.12362","volume":"181","author":"M Sverchkov","year":"2018","unstructured":"Sverchkov M, Pfeffermann D (2018) Small area estimation under informative sampling and not missing at random non-response. J R Stat Soc Ser A Stat Soc 181:981\u20131008","journal-title":"J R Stat Soc Ser A Stat Soc"},{"key":"630_CR38","doi-asserted-by":"publisher","first-page":"625","DOI":"10.1007\/s42081-020-00093-w","volume":"3","author":"S Yang","year":"2020","unstructured":"Yang S, Kim JK (2020) Statistical data integration in survey sampling: a review. Jpn J Stat Data Sci 3:625\u2013650","journal-title":"Jpn J Stat Data Sci"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00630-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-025-00630-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00630-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T10:46:48Z","timestamp":1757155608000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-025-00630-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,10]]},"references-count":38,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["630"],"URL":"https:\/\/doi.org\/10.1007\/s11634-025-00630-z","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"type":"print","value":"1862-5347"},{"type":"electronic","value":"1862-5355"}],"subject":[],"published":{"date-parts":[[2025,4,10]]},"assertion":[{"value":"29 February 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 January 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 February 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 April 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}