IDEAS home Printed from https://ideas.repec.org/a/csb/stintr/v16y2015i4p523-540.html

Covariate selection for small area estimation in repeated sample surveys

Author

Listed:
  • Bart Buelens

  • Jan A. van den Brakel

Abstract

If the implementation of small area estimation methods to multiple editions of a repeated sample survey is considered, then the question arises which covariates to use in the models. Applying standard model selection procedures independently to the different editions of the survey may identify different sets of covariates for each edition. If the small area predictions are sensitive to the different models, this is undesirable in official statistics since monitoring change over time of statistical quantities is of utmost importance. Therefore, potential confounding of true change and methodological alterations should be avoided. An approach to model selection is proposed resulting in a single set of covariates for multiple survey editions. This is achieved through conducting covariate selection simultaneously for all editions, minimizing the average of the edition-specific conditional Akaike Information Criteria. Consecutive editions of the Dutch crime victimization survey are used as a case study. Municipal estimates of three survey variables are obtained using area level models. The proposed averaging strategy is compared to the standard method of considering each edition separately, and to an elementary approach using covariates selected in the first edition. Resulting models, point estimates and MSE estimates are analyzed, indicating no substantial adverse effects of the conceptually attractive averaging strategy.

Suggested Citation

  • Bart Buelens & Jan A. van den Brakel, 2015. "Covariate selection for small area estimation in repeated sample surveys," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 523-540, December.
  • Handle: RePEc:csb:stintr:v:16:y:2015:i:4:p:523-540
    as

    Download full text from publisher

    File URL: http://index.stat.gov.pl/repec/files/csb/stintr/csb_stintr_v16_2015_i4_n5.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Claeskens,Gerda & Hjort,Nils Lid, 2008. "Model Selection and Model Averaging," Cambridge Books, Cambridge University Press, number 9780521852258, August.
    2. Florin Vaida & Suzette Blanchard, 2005. "Conditional Akaike information for mixed-effects models," Biometrika, Biometrika Trust, vol. 92(2), pages 351-370, June.
    3. Harm Jan Boonstra & Jan A. Van Den Brakel & Bart Buelens & Sabine Krieg & Marc Smeets, 2008. "Towards small area estimation at Statistics Netherlands," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1), pages 21-49.
    4. Pfeffermann, Danny & Tiller, Richard, 2006. "Small-Area Estimation With StateSpace Models Subject to Benchmark Constraints," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1387-1397, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bijlsma Ineke & van den Brakel Jan & van der Velden Rolf & Allen Jim, 2020. "Estimating Literacy Levels at a Detailed Regional Level: an Application Using Dutch Data," Journal of Official Statistics, Sciendo, vol. 36(2), pages 251-274, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van den Brakel Jan A. & Buelens Bart, 2015. "Covariate Selection for Small Area Estimation in Repeated Sample Surveys," Statistics in Transition New Series, Statistics Poland, vol. 16(4), pages 523-540, December.
    2. Jan A. van den Brakel & Bart Buelens, 2015. "Covariate Selection For Small Area Estimation In Repeated Sample Surveys," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 523-540, December.
    3. Bijlsma Ineke & van den Brakel Jan & van der Velden Rolf & Allen Jim, 2020. "Estimating Literacy Levels at a Detailed Regional Level: an Application Using Dutch Data," Journal of Official Statistics, Sciendo, vol. 36(2), pages 251-274, June.
    4. Braun, Julia & Sabanés Bové, Daniel & Held, Leonhard, 2014. "Choice of generalized linear mixed models using predictive crossvalidation," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 190-202.
    5. Xinyu Zhang & Alan T. K. Wan & Sherry Z. Zhou, 2011. "Focused Information Criteria, Model Selection, and Model Averaging in a Tobit Model With a Nonzero Threshold," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(1), pages 132-142, June.
    6. Julia Braun & Leonhard Held & Bruno Ledergerber, 2012. "Predictive Cross-validation for the Choice of Linear Mixed-Effects Models with Application to Data from the Swiss HIV Cohort Study," Biometrics, The International Biometric Society, vol. 68(1), pages 53-61, March.
    7. Yan Li & Partha Lahiri, 2019. "A Simple Adaptation of Variable Selection Software for Regression Models to Select Variables in Nested Error Regression Models," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 302-317, December.
    8. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    9. Krieg, Sabine & van den Brakel, Jan A., 2012. "Estimation of the monthly unemployment rate for six domains through structural time series modelling with cointegrated trends," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2918-2933.
    10. Jie Huang & Haiming Zhou & Nader Ebrahimi, 2022. "Bayesian Bivariate Cure Rate Models Using Copula Functions," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 11(3), pages 1-9, May.
    11. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    12. Davide Fiaschi & Andrea Mario Lavezzi & Angela Parenti, 2020. "Deep and Proximate Determinants of the World Income Distribution," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 66(3), pages 677-710, September.
    13. Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.
    14. Kurisu, Daisuke & Otsu, Taisuke, 2025. "Model averaging for global Fréchet regression," LSE Research Online Documents on Economics 126533, London School of Economics and Political Science, LSE Library.
    15. Bhattacharya, Debopam & Dupas, Pascaline, 2012. "Inferring welfare maximizing treatment assignment under budget constraints," Journal of Econometrics, Elsevier, vol. 167(1), pages 168-196.
    16. Schomaker Michael & Heumann Christian, 2011. "Model Averaging in Factor Analysis: An Analysis of Olympic Decathlon Data," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(1), pages 1-15, January.
    17. Tumala, Mohammed M & Olubusoye, Olusanya E & Yaaba, Baba N & Yaya, OlaOluwa S & Akanbi, Olawale B, 2017. "Forecasting Nigerian Inflation using Model Averaging methods: Modelling Frameworks to Central Banks," MPRA Paper 88754, University Library of Munich, Germany, revised Feb 2018.
    18. José Manuel Cordero Ferrera & Manuel Muñiz Pérez & Rosa Simancas Rodríguez, 2015. "The influence of socioeconomic factors on cognitive and non-cognitive educational outcomes," Investigaciones de Economía de la Educación volume 10, in: Marta Rahona López & Jennifer Graves (ed.), Investigaciones de Economía de la Educación 10, edition 1, volume 10, chapter 21, pages 413-438, Asociación de Economía de la Educación.
    19. Naoya Sueishi & Arihiro Yoshimura, 2017. "Focused Information Criterion for Series Estimation in Partially Linear Models," The Japanese Economic Review, Japanese Economic Association, vol. 68(3), pages 352-363, September.
    20. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:csb:stintr:v:16:y:2015:i:4:p:523-540. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Beata Witek The email address of this maintainer does not seem to be valid anymore. Please ask Beata Witek to update the entry or send us the correct address (email available below). General contact details of provider: https://edirc.repec.org/data/gusgvpl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.