IDEAS home Printed from https://ideas.repec.org/a/vrs/stintr/v16y2015i4p523-540n4.html
   My bibliography  Save this article

Covariate Selection for Small Area Estimation in Repeated Sample Surveys

Author

Listed:
  • van den Brakel Jan A.

    (Statistics Netherlands, Department of Statistical Methods and Maastricht University, Department of Quantitative Economics, Netherlands, ; Netherlands)

  • Buelens Bart

    (Statistics Netherlands, Department of Statistical Methods, Netherlands, ; Netherlands)

Abstract

If the implementation of small area estimation methods to multiple editions of a repeated sample survey is considered, then the question arises which covariates to use in the models. Applying standard model selection procedures independently to the different editions of the survey may identify different sets of covariates for each edition. If the small area predictions are sensitive to the different models, this is undesirable in official statistics since monitoring change over time of statistical quantities is of utmost importance. Therefore, potential confounding of true change and methodological alterations should be avoided. An approach to model selection is proposed resulting in a single set of covariates for multiple survey editions. This is achieved through conducting covariate selection simultaneously for all editions, minimizing the average of the edition-specific conditional Akaike Information Criteria. Consecutive editions of the Dutch crime victimization survey are used as a case study. Municipal estimates of three survey variables are obtained using area level models. The proposed averaging strategy is compared to the standard method of considering each edition separately, and to an elementary approach using covariates selected in the first edition. Resulting models, point estimates and MSE estimates are analyzed, indicating no substantial adverse effects of the conceptually attractive averaging strategy.

Suggested Citation

  • van den Brakel Jan A. & Buelens Bart, 2015. "Covariate Selection for Small Area Estimation in Repeated Sample Surveys," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 523-540, December.
  • Handle: RePEc:vrs:stintr:v:16:y:2015:i:4:p:523-540:n:4
    DOI: 10.21307/stattrans-2015-031
    as

    Download full text from publisher

    File URL: https://doi.org/10.21307/stattrans-2015-031
    Download Restriction: no

    File URL: https://libkey.io/10.21307/stattrans-2015-031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Harm Jan Boonstra & Jan A. Van Den Brakel & Bart Buelens & Sabine Krieg & Marc Smeets, 2008. "Towards small area estimation at Statistics Netherlands," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1), pages 21-49.
    2. Claeskens,Gerda & Hjort,Nils Lid, 2008. "Model Selection and Model Averaging," Cambridge Books, Cambridge University Press, number 9780521852258.
    3. Pfeffermann, Danny & Tiller, Richard, 2006. "Small-Area Estimation With StateSpace Models Subject to Benchmark Constraints," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1387-1397, December.
    4. Florin Vaida & Suzette Blanchard, 2005. "Conditional Akaike information for mixed-effects models," Biometrika, Biometrika Trust, vol. 92(2), pages 351-370, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jan A. van den Brakel & Bart Buelens, 2015. "Covariate Selection For Small Area Estimation In Repeated Sample Surveys," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 523-540, December.
    2. Bart Buelens & Jan A. van den Brakel, 2015. "Covariate selection for small area estimation in repeated sample surveys," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 523-540, December.
    3. Bijlsma Ineke & van den Brakel Jan & van der Velden Rolf & Allen Jim, 2020. "Estimating Literacy Levels at a Detailed Regional Level: an Application Using Dutch Data," Journal of Official Statistics, Sciendo, vol. 36(2), pages 251-274, June.
    4. Yan Li & Partha Lahiri, 2019. "A Simple Adaptation of Variable Selection Software for Regression Models to Select Variables in Nested Error Regression Models," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(2), pages 302-317, December.
    5. Braun, Julia & Sabanés Bové, Daniel & Held, Leonhard, 2014. "Choice of generalized linear mixed models using predictive crossvalidation," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 190-202.
    6. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    7. Xinyu Zhang & Alan T. K. Wan & Sherry Z. Zhou, 2011. "Focused Information Criteria, Model Selection, and Model Averaging in a Tobit Model With a Nonzero Threshold," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(1), pages 132-142, June.
    8. Krieg, Sabine & van den Brakel, Jan A., 2012. "Estimation of the monthly unemployment rate for six domains through structural time series modelling with cointegrated trends," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2918-2933.
    9. Julia Braun & Leonhard Held & Bruno Ledergerber, 2012. "Predictive Cross-validation for the Choice of Linear Mixed-Effects Models with Application to Data from the Swiss HIV Cohort Study," Biometrics, The International Biometric Society, vol. 68(1), pages 53-61, March.
    10. Jie Huang & Haiming Zhou & Nader Ebrahimi, 2022. "Bayesian Bivariate Cure Rate Models Using Copula Functions," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 11(3), pages 1-9, May.
    11. Kubokawa, Tatsuya & Nagashima, Bui, 2012. "Parametric bootstrap methods for bias correction in linear mixed models," Journal of Multivariate Analysis, Elsevier, vol. 106(C), pages 1-16.
    12. Kitagawa, Toru & Muris, Chris, 2016. "Model averaging in semiparametric estimation of treatment effects," Journal of Econometrics, Elsevier, vol. 193(1), pages 271-289.
    13. Myung-Jae Hwang & Jong-Hun Kim & Hae-Kwan Cheong, 2020. "Short-Term Impacts of Ambient Air Pollution on Health-Related Quality of Life: A Korea Health Panel Survey Study," IJERPH, MDPI, vol. 17(23), pages 1-11, December.
    14. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    15. Davide Fiaschi & Andrea Mario Lavezzi & Angela Parenti, 2020. "Deep and Proximate Determinants of the World Income Distribution," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 66(3), pages 677-710, September.
    16. Fabio Canova & Christian Matthes, 2021. "Dealing with misspecification in structural macroeconometric models," Quantitative Economics, Econometric Society, vol. 12(2), pages 313-350, May.
    17. Yu, Jun & Meng, Xiran & Wang, Yaping, 2023. "Optimal designs for semi-parametric dose-response models under random contamination," Computational Statistics & Data Analysis, Elsevier, vol. 178(C).
    18. J. N. K. Rao, 2015. "Inferential issues in model-based small area estimation: some new developments," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 491-510, December.
    19. Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.
    20. HAEDO, Christian & MOUCHART , Michel & ,, 2013. "Specialized agglomerations with areal data: model and detection," LIDAM Discussion Papers CORE 2013060, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:stintr:v:16:y:2015:i:4:p:523-540:n:4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.