IDEAS home Printed from https://ideas.repec.org/a/spr/advdac/v19y2025i2d10.1007_s11634-025-00630-z.html
   My bibliography  Save this article

When non-response makes estimates from a census a small area estimation problem: the case of the survey on graduates’ employment status in Italy

Author

Listed:
  • Maria Giovanna Ranalli

    (University of Perugia)

  • Fulvia Pennoni

    (University of Milano-Bicocca)

  • Francesco Bartolucci

    (University of Perugia)

  • Antonietta Mira

    (Università della Svizzera italiana
    University of Insubria)

Abstract

Since 1998, AlmaLaurea—a consortium of 80 Italian universities and a member of the Italian National Statistical System—has conducted an annual census on graduates’ employment status. The survey provides estimates of descriptive indicators at both the population level and for specific subpopulations (domains) of interest, such as degree programmes. Some domains have very few observations due to a small population size and non-response. In this paper, we address this estimation problem within a Small Area Estimation framework. Specifically, we propose using generalized linear mixed models that incorporate two variables as proxies for graduates’ response propensity, making the assumption of non-informative non-response more plausible. Degree programme estimates of employment rates are derived as (semi-parametric) empirical best predictions using a finite mixture of logistic regression models, with their mean squared error estimated via a second-order, bias-corrected, analytical estimator. Sensitivity analysis is conducted to assess the explanatory power of variables modelling response propensity and to evaluate potential correlations between area-specific random effects and observed heterogeneity.

Suggested Citation

  • Maria Giovanna Ranalli & Fulvia Pennoni & Francesco Bartolucci & Antonietta Mira, 2025. "When non-response makes estimates from a census a small area estimation problem: the case of the survey on graduates’ employment status in Italy," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 19(2), pages 515-543, June.
  • Handle: RePEc:spr:advdac:v:19:y:2025:i:2:d:10.1007_s11634-025-00630-z
    DOI: 10.1007/s11634-025-00630-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11634-025-00630-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11634-025-00630-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Murray Aitkin, 1999. "A General Maximum Likelihood Analysis of Variance Components in Generalized Linear Models," Biometrics, The International Biometric Society, vol. 55(1), pages 117-128, March.
    2. Jae Kwang Kim & Seho Park & Yilin Chen & Changbao Wu, 2021. "Combining non‐probability and probability survey samples through mass imputation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 941-963, July.
    3. Marco Alfò & Maria Francesca Marino & Maria Giovanna Ranalli & Nicola Salvati & Nikos Tzavidis, 2021. "M‐quantile regression for multivariate longitudinal data with an application to the Millennium Cohort Study," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(1), pages 122-146, January.
    4. Gonzalez-Manteiga, W. & Lombardia, M.J. & Molina, I. & Morales, D. & Santamaria, L., 2007. "Estimation of the mean squared error of predictors of small area linear parameters under a logistic mixed model," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2720-2733, February.
    5. D. Oakes, 1999. "Direct calculation of the information matrix via the EM," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(2), pages 479-482, April.
    6. Andrew Bell & Malcolm Fairbrother & Kelvyn Jones, 2019. "Fixed and random effects models: making an informed choice," Quality & Quantity: International Journal of Methodology, Springer, vol. 53(2), pages 1051-1074, March.
    7. Michael Sverchkov & Danny Pfeffermann, 2018. "Small area estimation under informative sampling and not missing at random non‐response," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 981-1008, October.
    8. Jiming Jiang & P. Lahiri, 2001. "Empirical Best Prediction for Small Area Inference with Binary Data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 53(2), pages 217-243, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Miguel Boubeta & María José Lombardía & Domingo Morales, 2016. "Empirical best prediction under area-level Poisson mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 548-569, September.
    2. Ralf Münnich & Jan Pablo Burgard & Siegfried Gabler & Matthias Ganninger & Jan-Philipp Kolb, 2016. "Small Area Estimation In The German Census 2011," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 25-40, March.
    3. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    4. repec:csb:stintr:v:17:y:2016:i:1:p:25-40 is not listed on IDEAS
    5. María Dolores Esteban & María José Lombardía & Esther López‐Vizcaíno & Domingo Morales & Agustín Pérez, 2022. "Empirical best prediction of small area bivariate parameters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1699-1727, December.
    6. Boubeta, Miguel & Lombardía, María José & Morales, Domingo, 2017. "Poisson mixed models for studying the poverty in small areas," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 32-47.
    7. Tomáš Hobza & Domingo Morales & Laureano Santamaría, 2018. "Small area estimation of poverty proportions under unit-level temporal binomial-logit mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 270-294, June.
    8. M. Giovanna Ranalli & Giorgio E. Montanari & Cecilia Vicarelli, 2018. "Estimation of small area counts with the benchmarking property," METRON, Springer;Sapienza Università di Roma, vol. 76(3), pages 349-378, December.
    9. Herwig Friedl & Göran Kauermann, 2000. "Standard Errors for EM Estimates in Generalized Linear Models with Random Effects," Biometrics, The International Biometric Society, vol. 56(3), pages 761-767, September.
    10. Münnich Ralf & Burgard Jan Pablo & Gabler Siegfried & Ganninger Matthias & Kolb Jan-Philipp, 2016. "Small Area Estimation in the German Census 2011," Statistics in Transition New Series, Statistics Poland, vol. 17(1), pages 25-40, March.
    11. Joscha Krause & Jan Pablo Burgard & Domingo Morales, 2022. "$$\ell _2$$ ℓ 2 -penalized approximate likelihood inference in logit mixed models for regional prevalence estimation under covariate rank-deficiency," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(4), pages 459-489, May.
    12. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    13. Shu Yang & Jae Kwang Kim, 2016. "Likelihood-based Inference with Missing Data Under Missing-at-Random," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 436-454, June.
    14. KonShik Kim, 2023. "The impact of job quality on organizational commitment and job satisfaction: The moderating role of socioeconomic status," Economic and Industrial Democracy, Department of Economic History, Uppsala University, Sweden, vol. 44(3), pages 773-797, August.
    15. Baron, Opher & Callen, Jeffrey L. & Segal, Dan, 2023. "Does the bullwhip matter economically? A cross-sectional firm-level analysis," International Journal of Production Economics, Elsevier, vol. 259(C).
    16. Julian P. T. Higgins & Simon G. Thompson & David J. Spiegelhalter, 2009. "A re‐evaluation of random‐effects meta‐analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(1), pages 137-159, January.
    17. Araki, Satoshi & Olivos, Francisco, 2025. "Gender inequality, well-being, and ill-being: A macro analysis of human and societal flourishing," Social Science & Medicine, Elsevier, vol. 366(C).
    18. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2020. "Small area estimation of proportions under area-level compositional mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 793-818, September.
    19. Getachew A. Dagne, 2016. "A growth mixture Tobit model: application to AIDS studies," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(7), pages 1174-1185, July.
    20. Wenlu Yuan & Xiaoying Li & Collin Brehmer & Talia Sternbach & Xiang Zhang & Ellison Carter & Yuanxun Zhang & Guofeng Shen & Shu Tao & Jill Baumgartner & Sam Harper, 2025. "Effects of Outdoor and Household Air Pollution on Hand Grip Strength in a Longitudinal Study of Rural Beijing Adults," IJERPH, MDPI, vol. 22(8), pages 1-19, August.
    21. G. Bertarelli & R. Chambers & N. Salvati, 2021. "Outlier robust small domain estimation via bias correction and robust bootstrapping," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 331-357, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:advdac:v:19:y:2025:i:2:d:10.1007_s11634-025-00630-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.