IDEAS home Printed from https://ideas.repec.org/p/ese/cempwp/cempa9-25.html

Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM / IPF method to build a tax-benefit model for the Essex region in the UK

Author

Listed:
  • Richiardi, Matteo
  • Rejoice, Frimpong

Abstract

Development of microsimulation models often requires reweighting some input dataset to reflect the characteristics of a different population of interest. In this paper we explore a machine learning approach whereas a variant of decision trees (Gradient Boosted Machine) is used to replicate the joint distribution of target variables observed in a large commercially available but slightly biased dataset, with an additional raking step to remove the bias and ensure consistency of relevant marginal distributions with official statistics. The method is applied to build a regional variant of UKMOD, an open-source static tax-benefit model for the UK belonging to the EUROMOD family, with an application to the Greater Essex region in the UK.

Suggested Citation

  • Richiardi, Matteo & Rejoice, Frimpong, 2025. "Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM / IPF method to build a tax-benefit model for the Essex region in the UK," Centre for Microsimulation and Policy Analysis Working Paper Series CEMPA9/25, Centre for Microsimulation and Policy Analysis at the Institute for Social and Economic Research.
  • Handle: RePEc:ese:cempwp:cempa9-25
    as

    Download full text from publisher

    File URL: https://www.iser.essex.ac.uk/wp-content/uploads/files/working-papers/cempa/cempa9-25.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Leite, Walter & Zhang, Huibin & collier, zachary & Chawla, Kamal & , l.kong@ufl.edu & Lee, Yongseok & Quan, Jia & Soyoye, Olushola, 2024. "Machine Learning for Propensity Score Estimation: A Systematic Review and Reporting Guidelines," OSF Preprints gmrk7, Center for Open Science.
    2. Richard K. Crump & V. Joseph Hotz & Guido W. Imbens & Oscar A. Mitnik, 2009. "Dealing with limited overlap in estimation of average treatment effects," Biometrika, Biometrika Trust, vol. 96(1), pages 187-199.
    3. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    4. repec:osf:osfxxx:gmrk7_v1 is not listed on IDEAS
    5. Fortin, Nicole & Lemieux, Thomas & Firpo, Sergio, 2011. "Decomposition Methods in Economics," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 4, chapter 1, pages 1-102, Elsevier.
    6. DiNardo, John & Fortin, Nicole M & Lemieux, Thomas, 1996. "Labor Market Institutions and the Distribution of Wages, 1973-1992: A Semiparametric Approach," Econometrica, Econometric Society, vol. 64(5), pages 1001-1044, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Victor Chernozhukov & Iván Fernández‐Val & Blaise Melly, 2013. "Inference on Counterfactual Distributions," Econometrica, Econometric Society, vol. 81(6), pages 2205-2268, November.
    2. Tymon Słoczyński, 2015. "The Oaxaca–Blinder Unexplained Component as a Treatment Effects Estimator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 588-604, August.
    3. Thomas Y. Mathä & Alessandro Porpiglia & Michael Ziegelmeyer, 2014. "Wealth differences across borders and the effect of real estate price dynamics: Evidence from two household surveys," BCL working papers 90, Central Bank of Luxembourg.
    4. Benjamín Villena-Roldán, 2024. "Unpacking the Persistence of Informality," Journal of Labor Research, Springer, vol. 45(2), pages 203-231, June.
    5. Naticchioni, Paolo & Ragusa, Giuseppe & Massari, Riccardo, 2014. "Unconditional and Conditional Wage Polarization in Europe," IZA Discussion Papers 8465, IZA Network @ LISER.
    6. Kobus, Martyna & Kapera, Marek & Maasoumi, Esfandiar, 2024. "Gap in many dimensions: Application to gender," Labour Economics, Elsevier, vol. 89(C).
    7. Hill, Matthew J. & Maestas, Nicole & Mullen, Kathleen J., 2016. "Employer accommodation and labor supply of disabled workers," Labour Economics, Elsevier, vol. 41(C), pages 291-303.
    8. Sonja C. Kassenboehmer & Mathias G. Sinning, 2014. "Distributional Changes in the Gender Wage Gap," ILR Review, Cornell University, ILR School, vol. 67(2), pages 335-361, April.
    9. Anna Rosso, 2016. "Skill Transferability and Immigrant-Native Wage Gaps," Development Working Papers 405, Centro Studi Luca d'Agliano, University of Milano, revised 21 Oct 2016.
    10. Fredrik Andersson & Elizabeth E. Davis & Matthew L. Freedman & Julia I. Lane & Brian P. Mccall & Kristin Sandusky, 2012. "Decomposing the Sources of Earnings Inequality: Assessing the Role of Reallocation," Industrial Relations: A Journal of Economy and Society, Wiley Blackwell, vol. 51(4), pages 779-810, October.
    11. Michel Lubrano & Abdoul Aziz Junior Ndoye, 2014. "Bayesian Unconditional Quantile Regression: An Analysis of Recent Expansions in Wage Structure and Earnings Inequality in the US 1992–2009," Scottish Journal of Political Economy, Scottish Economic Society, vol. 61(2), pages 129-153, May.
    12. Domenico Depalo & Raffaela Giordano & Evangelia Papapetrou, 2015. "Public–private wage differentials in euro-area countries: evidence from quantile decomposition analysis," Empirical Economics, Springer, vol. 49(3), pages 985-1015, November.
    13. Kaltenberg, Mary & Foster-McGregor, Neil, 2020. "The impact of automation on inequality across Europe," MERIT Working Papers 2020-009, United Nations University - Maastricht Economic and Social Research Institute on Innovation and Technology (MERIT).
    14. Thomas Grandner & Dieter Gstach, 2015. "Decomposing wage discrimination in Germany and Austria with counterfactual densities," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 42(1), pages 49-76, February.
    15. Mathä, Thomas Y. & Porpiglia, Alessandro & Ziegelmeyer, Michael, 2017. "Household wealth in the euro area: The importance of intergenerational transfers, homeownership and house price dynamics," Journal of Housing Economics, Elsevier, vol. 35(C), pages 1-12.
    16. Ferreira,Francisco H. G. & Firpo,Sergio P. & Messina,Julian, 2017. "Ageing poorly? : accounting for the decline in earnings inequality in Brazil, 1995-2012," Policy Research Working Paper Series 8018, The World Bank.
    17. Atencio,Andrea & Posadas,Josefina, 2015. "Gender gap in pay in the Russian Federation : twenty years later, still a concern," Policy Research Working Paper Series 7407, The World Bank.
    18. Joanna Landmesser, 2016. "Decomposition of differences In income distributions Using quantile regression," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 17(2), pages 331-348, June.
    19. Christl, Michael & Köppl-Turyna, Monika & Gnan, Phillipp, 2017. "Wage Differences Between Immigrants and Natives in Austria: The Role of Literacy Skills," GLO Discussion Paper Series 145, Global Labor Organization (GLO).
    20. Ken Yamada & Daiji Kawaguchi, 2015. "The changing and unchanged nature of inequality and seniority in Japan," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 13(1), pages 129-153, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ese:cempwp:cempa9-25. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Jonathan Nears (email available below). General contact details of provider: https://edirc.repec.org/data/rcessuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.