IDEAS home Printed from https://ideas.repec.org/a/vrs/offsta/v31y2015i2p177-203n3.html
   My bibliography  Save this article

ReGenesees: an Advanced R System for Calibration, Estimation and Sampling Error Assessment in Complex Sample Surveys

Author

Listed:
  • Zardetto Diego

    (Istat – Italian National Institute of Statistics, Via Cesare Balbo, 16 Rome, Rome 00184, Italy)

Abstract

ReGenesees is a new software system for design-based and model-assisted analysis of complex sample surveys, based on R. As compared to traditional estimation platforms, it ensures easier and safer usage and achieves a dramatic reduction in user workload for both the calibration and the variance estimation tasks. Indeed, ReGenesees allows the specification of calibration models in a symbolic way, using R model formulae. Driven by this symbolic metadata, the system automatically and transparently generates the right values and formats for the auxiliary variables at the sample level, and assists the user in defining and calculating the corresponding population totals. Moreover, ReGenesees can handle arbitrary complex estimators, provided they can be expressed as differentiable functions of Horvitz-Thompson or calibration estimators of totals. Complex estimators can be defined in a completely free fashion: the user only needs to provide the system with the symbolic expression of the estimator as a mathematical function. ReGenesees is in fact able to automatically linearize such complex estimators, so that the estimation of their variance comes at no cost at all to the user. Remarkably, all the innovative features sketched above leverage a particular strong point of the R programming language, namely its ability to process symbolic information.

Suggested Citation

  • Zardetto Diego, 2015. "ReGenesees: an Advanced R System for Calibration, Estimation and Sampling Error Assessment in Complex Sample Surveys," Journal of Official Statistics, Sciendo, vol. 31(2), pages 177-203, June.
  • Handle: RePEc:vrs:offsta:v:31:y:2015:i:2:p:177-203:n:3
    DOI: 10.1515/jos-2015-0013
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jos-2015-0013
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jos-2015-0013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lumley, Thomas, 2004. "Analysis of Complex Survey Samples," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 9(i08).
    2. G. N. Wilkinson & C. E. Rogers, 1973. "Symbolic Description of Factorial Models for Analysis of Variance," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 22(3), pages 392-399, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maciej Berk{e}sewicz & Herman Cherniaiev & Robert Pater, 2021. "Estimating the number of entities with vacancies using administrative and online data," Papers 2106.03263, arXiv.org.
    2. J. Michael Brick & Michael E. Jones, 2008. "Propensity to respond and nonresponse bias," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1), pages 51-73.
    3. Payne, Roger W., 1998. "Design keys, pseudo-factors and general balance," Computational Statistics & Data Analysis, Elsevier, vol. 29(2), pages 217-229, December.
    4. Jacques Muthusi & Samuel Mwalili & Peter Young, 2019. "%svy_logistic_regression: A generic SAS macro for simple and multiple logistic regression and creating quality publication-ready tables using survey or non-survey data," PLOS ONE, Public Library of Science, vol. 14(9), pages 1-14, September.
    5. Jonathan Wakefield & Taylor Okonek & Jon Pedersen, 2020. "Small Area Estimation for Disease Prevalence Mapping," International Statistical Review, International Statistical Institute, vol. 88(2), pages 398-418, August.
    6. Alex Fenton, 2013. "Small-area measures of income poverty," CASE Papers case173, Centre for Analysis of Social Exclusion, LSE.
    7. repec:cep:sticas:/173 is not listed on IDEAS
    8. Mei‐Chih Meg Tseng & Yi‐Ping Lin & Fu‐Chang Hu & Tsun‐Jen Cheng, 2013. "Risks Perception of Electromagnetic Fields in Taiwan: The Influence of Psychopathology and the Degree of Sensitivity to Electromagnetic Fields," Risk Analysis, John Wiley & Sons, vol. 33(11), pages 2002-2012, November.
    9. Raphael Nishimura & James Wagner & Michael Elliott, 2016. "Alternative Indicators for the Risk of Non-response Bias: A Simulation Study," International Statistical Review, International Statistical Institute, vol. 84(1), pages 43-62, April.
    10. Bent Nielsen, 2014. "Deviance analysis of age-period-cohort models," Economics Papers 2014-W03, Economics Group, Nuffield College, University of Oxford.
    11. Camelia Herman & Colleen M. Leonard & Perpetua Uhomoibhi & Mark Maire & Delynn Moss & Uwem Inyang & Ado Abubakar & Abiodun Ogunniyi & Nwando Mba & Stacie M. Greby & McPaul I. Okoye & Nnaemeka C. Iriem, 2023. "Non-falciparum malaria infection and IgG seroprevalence among children under 15 years in Nigeria, 2018," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    12. Elijah O. Onsomu & DaKysha Moore & Benta A. Abuya & Peggy Valentine & Vanessa Duren-Winfield, 2013. "Importance of the Media in Scaling-Up HIV Testing in Kenya," SAGE Open, , vol. 3(3), pages 21582440134, July.
    13. Vinas-Forcade, Jennifer & Seijas, María Noé, 2021. "To teach or not to teach: Negative selection into the teaching profession in Uruguay," International Journal of Educational Development, Elsevier, vol. 84(C).
    14. Lenis, David & Ackerman, Benjamin & Stuart, Elizabeth A., 2018. "Measuring model misspecification: Application to propensity score methods with complex survey data," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 48-57.
    15. Sofia Gil-Clavel & Emilio Zagheni & Valeria Bordone, 2022. "Close Social Networks Among Older Adults: The Online and Offline Perspectives," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 1111-1135, June.
    16. Matthew R. Williams & Terrance D. Savitsky, 2021. "Uncertainty Estimation for Pseudo‐Bayesian Inference Under Complex Sampling," International Statistical Review, International Statistical Institute, vol. 89(1), pages 72-107, April.
    17. Jeanna Parsons Leigh & Kirsten Fiest & Rebecca Brundin-Mather & Kara Plotnikoff & Andrea Soo & Emma E Sypes & Liam Whalen-Browne & Sofia B Ahmed & Karen E A Burns & Alison Fox-Robichaud & Shelly Kupsc, 2020. "A national cross-sectional survey of public perceptions of the COVID-19 pandemic: Self-reported beliefs, knowledge, and behaviors," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-18, October.
    18. Wang, Jianqiang C., 2012. "Sample distribution function based goodness-of-fit test for complex surveys," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 664-679.
    19. Alejandro Aybar-Flores & Alvaro Talavera & Elizabeth Espinoza-Portilla, 2023. "Predicting the HIV/AIDS Knowledge among the Adolescent and Young Adult Population in Peru: Application of Quasi-Binomial Logistic Regression and Machine Learning Algorithms," IJERPH, MDPI, vol. 20(7), pages 1-29, March.
    20. Joseph R Starnes & Chiara Di Gravio & Rebecca Irlmeier & Ryan Moore & Vincent Okoth & Ash Rogers & Daniele J Ressler & Troy D Moon, 2021. "Characterizing multidimensional poverty in Migori County, Kenya and its association with depression," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-10, November.
    21. Christian A. Maino Vieytes & Ruoqing Zhu & Francesca Gany & Amirah Burton-Obanla & Anna E. Arthur, 2022. "Empirical Dietary Patterns Associated with Food Insecurity in U.S. Cancer Survivors: NHANES 1999–2018," IJERPH, MDPI, vol. 19(21), pages 1-21, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:offsta:v:31:y:2015:i:2:p:177-203:n:3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.