IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v183y2020i1p121-148.html
   My bibliography  Save this article

Data‐driven transformations in small area estimation

Author

Listed:
  • Natalia Rojas‐Perilla
  • Sören Pannier
  • Timo Schmid
  • Nikos Tzavidis

Abstract

Small area models typically depend on the validity of model assumptions. For example, a commonly used version of the empirical best predictor relies on the Gaussian assumptions of the error terms of the linear mixed regression model: a feature rarely observed in applications with real data. The paper tackles the potential lack of validity of the model assumptions by using data‐driven scaled transformations as opposed to ad hoc chosen transformations. Different types of transformations are explored, the estimation of the transformation parameters is studied in detail under the linear mixed regression model and transformations are used in small area prediction of linear and non‐linear parameters. The use of scaled transformations is crucial as it enables fitting the linear mixed regression model with standard software and hence it simplifies the work of the data analyst. Mean‐squared error estimation that accounts for the uncertainty due to the estimation of the transformation parameters is explored by using the parametric and semiparametric (wild) bootstrap. The methods proposed are illustrated by using real survey and census data for estimating income deprivation parameters for municipalities in the Mexican state of Guerrero. Simulation studies and the results from the application show that using carefully selected, data‐driven transformations can improve small area estimation.

Suggested Citation

  • Natalia Rojas‐Perilla & Sören Pannier & Timo Schmid & Nikos Tzavidis, 2020. "Data‐driven transformations in small area estimation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 121-148, January.
  • Handle: RePEc:bla:jorssa:v:183:y:2020:i:1:p:121-148
    DOI: 10.1111/rssa.12488
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12488
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12488?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. J. A. John & N. R. Draper, 1980. "An Alternative Family of Transformations," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(2), pages 190-197, June.
    2. Chris Elbers & Jean O. Lanjouw & Peter Lanjouw, 2003. "Micro--Level Estimation of Poverty and Inequality," Econometrica, Econometric Society, vol. 71(1), pages 355-364, January.
    3. Foster, James & Greer, Joel & Thorbecke, Erik, 1984. "A Class of Decomposable Poverty Measures," Econometrica, Econometric Society, vol. 52(3), pages 761-766, May.
    4. Nikos Tzavidis & Li‐Chun Zhang & Angela Luna & Timo Schmid & Natalia Rojas‐Perilla, 2018. "From start to finish: a framework for the production of small area official statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 927-979, October.
    5. Xingdong Feng & Xuming He & Jianhua Hu, 2011. "Wild bootstrap for quantile regression," Biometrika, Biometrika Trust, vol. 98(4), pages 995-999.
    6. Timo Schmid & Fabian Bruckschen & Nicola Salvati & Till Zbiranski, 2017. "Constructing sociodemographic indicators for national statistical institutes by using mobile phone data: estimating literacy rates in Senegal," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(4), pages 1163-1190, October.
    7. Elbers, Chris & van der Weide, Roy, 2014. "Estimation of normal mixtures in a nested error model with an application to small area estimation of poverty and inequality," Policy Research Working Paper Series 6962, The World Bank.
    8. Yang, Zhenlin, 2006. "A modified family of power transformations," Economics Letters, Elsevier, vol. 92(1), pages 14-19, July.
    9. Monique Graf & J. Miguel Marín & Isabel Molina, 2019. "A generalized mixed model for skewed distributions applied to small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 565-597, June.
    10. Patrick Royston & Paul C. Lambert, 2011. "Flexible Parametric Survival Analysis Using Stata: Beyond the Cox Model," Stata Press books, StataCorp LP, number fpsaus, March.
    11. Fabrizi, Enrico & Trivisano, Carlo, 2016. "Small area estimation of the Gini concentration coefficient," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 223-234.
    12. Cecilia Tortajada, 2006. "Who Has Access to Water? Case Study of Mexico City Metropolitan Area," Human Development Occasional Papers (1992-2007) HDOCPA-2006-16, Human Development Report Office (HDRO), United Nations Development Programme (UNDP).
    13. Hoeting, Jennifer A. & Ibrahim, Joseph G., 1998. "Bayesian predictive simultaneous variable and transformation selection in the linear model," Computational Statistics & Data Analysis, Elsevier, vol. 28(1), pages 87-103, July.
    14. Matthew J. Gurka & Lloyd J. Edwards & Keith E. Muller & Lawrence L. Kupper, 2006. "Extending the Box–Cox transformation to the linear mixed model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(2), pages 273-288, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Isabel Molina & Paul Corral & Minh Nguyen, 2022. "Estimation of poverty and inequality in small areas: review and discussion," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1143-1166, December.
    2. Paul Walter & Marcus Groß & Timo Schmid & Nikos Tzavidis, 2021. "Domain prediction with grouped income data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1501-1523, October.
    3. Guadarrama, María & Morales, Domingo & Molina, Isabel, 2021. "Time stable empirical best predictors under a unit-level model," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rojas-Perilla, Natalia & Pannier, Sören & Schmid, Timo & Tzavidis, Nikos, 2017. "Data-driven transformations in small area estimation," Discussion Papers 2017/30, Free University Berlin, School of Business & Economics.
    2. Paul Walter & Marcus Groß & Timo Schmid & Nikos Tzavidis, 2021. "Domain prediction with grouped income data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1501-1523, October.
    3. Marchetti Stefano & Tzavidis Nikos, 2021. "Robust Estimation of the Theil Index and the Gini Coeffient for Small Areas," Journal of Official Statistics, Sciendo, vol. 37(4), pages 955-979, December.
    4. Isabel Molina & Paul Corral & Minh Nguyen, 2022. "Estimation of poverty and inequality in small areas: review and discussion," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1143-1166, December.
    5. Nikos Tzavidis & Li‐Chun Zhang & Angela Luna & Timo Schmid & Natalia Rojas‐Perilla, 2018. "From start to finish: a framework for the production of small area official statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 927-979, October.
    6. Patrick Krennmair & Timo Schmid, 2022. "Flexible domain prediction using mixed effects random forests," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1865-1894, November.
    7. Walter, Paul & Groß, Markus & Schmid, Timo & Tzavidis, Nikos, 2017. "Estimation of linear and non-linear indicators using interval censored income data," Discussion Papers 2017/22, Free University Berlin, School of Business & Economics.
    8. Huapeng Li & Yukun Liu & Riquan Zhang, 2019. "Small area estimation under transformed nested-error regression models," Statistical Papers, Springer, vol. 60(4), pages 1397-1418, August.
    9. Aldo Gardini & Enrico Fabrizi & Carlo Trivisano, 2022. "Poverty and inequality mapping based on a unit‐level log‐normal mixture model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2073-2096, October.
    10. Kreutzmann, Ann-Kristin & Marek, Philipp & Salvati, Nicola & Schmid, Timo, 2019. "Estimating regional wealth in Germany: How different are East and West really?," Discussion Papers 35/2019, Deutsche Bundesbank.
    11. Bijlsma Ineke & van den Brakel Jan & van der Velden Rolf & Allen Jim, 2020. "Estimating Literacy Levels at a Detailed Regional Level: an Application Using Dutch Data," Journal of Official Statistics, Sciendo, vol. 36(2), pages 251-274, June.
    12. Tomoki Fujii, 2013. "Geographic decomposition of inequality in health and wealth: evidence from Cambodia," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 11(3), pages 373-392, September.
    13. Brown, Caitlin & Ravallion, Martin & van de Walle, Dominique, 2018. "A poor means test? Econometric targeting in Africa," Journal of Development Economics, Elsevier, vol. 134(C), pages 109-124.
    14. Sims, Katharine R.E., 2010. "Conservation and development: Evidence from Thai protected areas," Journal of Environmental Economics and Management, Elsevier, vol. 60(2), pages 94-114, September.
    15. Nguyen Viet CUONG, 2008. "Is A Governmental Micro‐Credit Program For The Poor Really Pro‐Poor? Evidence From Vietnam," The Developing Economies, Institute of Developing Economies, vol. 46(2), pages 151-187, June.
    16. Ethan Ligon & Laura Schechter, 2003. "Measuring Vulnerability," Economic Journal, Royal Economic Society, vol. 113(486), pages 95-102, March.
    17. Molina Isabel, 2020. "Discussion of “Small area estimation: its evolution in five decades”, by Malay Ghosh," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 40-44, August.
    18. van der Weide, Roy, 2014. "GLS estimation and empirical bayes prediction for linear mixed models with Heteroskedasticity and sampling weights : a background study for the POVMAP project," Policy Research Working Paper Series 7028, The World Bank.
    19. Sebastian Levine & Benjamin Roberts, 2013. "Robust Estimates of Changes in Poverty and Inequality in Post-Independence Namibia," South African Journal of Economics, Economic Society of South Africa, vol. 81(2), pages 167-191, June.
    20. Channing Arndt & Azhar M. Hussain & Vincenzo Salvucci & Finn Tarp & Lars Peter Østerdal, 2016. "Poverty Mapping Based on First‐Order Dominance with an Example from Mozambique," Journal of International Development, John Wiley & Sons, Ltd., vol. 28(1), pages 3-21, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:183:y:2020:i:1:p:121-148. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.