IDEAS home Printed from https://ideas.repec.org/a/kap/poprpr/v42y2023i4d10.1007_s11113-023-09795-x.html
   My bibliography  Save this article

Boosted Regression Trees for Small-Area Population Forecasting

Author

Listed:
  • Jack Baker

    (Farmers Life)

  • David Swanson

    (University of California Riverside
    University of Washington)

  • Jeff Tayman

    (University of California San Diego)

Abstract

Small-area population forecasting, such as the forecasting of age/gender groupings at the level of US Census Tracts, is challenged by thorny issues including (1) small population sizes, (2) frequent and sometimes directionally opposing shifts in population dynamics between censuses, (3) data availability, and (4) the ongoing evolution of the US census geographies. It is, therefore, not surprising that evaluation studies suggest wide-ranging forecast errors. Estimates vary between lows between 10% and 20% and highs sometimes exceeding 100% within any given age/gender interval. Despite its successes, only recently have population forecasters begun to explore the possibilities presented by machine learning. Using 1990 and 2000 census data, we develop 10-year age/gender-structured 2010 population forecasts for 50,965 census tracts in the U.S. using a well-known machine learning technique: boosted regression trees. Using standard ex post facto measures of forecast error (MAPE, MALPE, and MAPE-R), we demonstrate that forecasts based on “out-of-the-box” boosted regression trees have greater accuracy and produce fewer and less extreme outliers than comparison forecasts produced by the Hamilton-Perry method (reported in Baker et al. in Population Res Policy Rev 40:1341–1354, 2021. https://doi.org/10.1007/s11113-020-09601-y ).

Suggested Citation

  • Jack Baker & David Swanson & Jeff Tayman, 2023. "Boosted Regression Trees for Small-Area Population Forecasting," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(4), pages 1-24, August.
  • Handle: RePEc:kap:poprpr:v:42:y:2023:i:4:d:10.1007_s11113-023-09795-x
    DOI: 10.1007/s11113-023-09795-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11113-023-09795-x
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11113-023-09795-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Philip Rees & Paul Norman & Dominic Brown, 2004. "A framework for progressively improving small area population estimates," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 167(1), pages 5-36, February.
    2. David Swanson & Jeff Tayman & Charles Barr, 2000. "A note on the measurement of accuracy for subnational demographic estimates," Demography, Springer;Population Association of America (PAA), vol. 37(2), pages 193-201, May.
    3. Jack Baker & David Swanson & Jeff Tayman, 2021. "The Accuracy of Hamilton–Perry Population Projections for Census Tracts in the United States," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 40(6), pages 1341-1354, December.
    4. Tiago M. Fragoso & Wesley Bertoli & Francisco Louzada, 2018. "Bayesian Model Averaging: A Systematic Review and Conceptual Classification," International Statistical Review, International Statistical Institute, vol. 86(1), pages 1-28, April.
    5. Robert Tibshirani & Guenther Walther & Trevor Hastie, 2001. "Estimating the number of clusters in a data set via the gap statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(2), pages 411-423.
    6. Jeff Tayman & Stanley Smith & Stefan Rayer, 2011. "Evaluating Population Forecast Accuracy: A Regression Approach Using County Data," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 30(2), pages 235-262, April.
    7. Stefan Rayer & Stanley Smith, 2014. "Population Projections by Age for Florida and its Counties: Assessing Accuracy and the Impact of Adjustments," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 33(5), pages 747-770, October.
    8. Tom Wilson, 2016. "Evaluation of Alternative Cohort-Component Models for Local Area Population Forecasts," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(2), pages 241-261, April.
    9. Stanley Smith & Jeff Tayman, 2003. "An evaluation of population projections by age," Demography, Springer;Population Association of America (PAA), vol. 40(4), pages 741-757, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jack Baker & David Swanson & Jeff Tayman, 2021. "The Accuracy of Hamilton–Perry Population Projections for Census Tracts in the United States," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 40(6), pages 1341-1354, December.
    2. Jeff Tayman & David A. Swanson & Jack Baker, 2021. "Using Synthetic Adjustments and Controlling to Improve County Population Forecasts from the Hamilton–Perry Method," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 40(6), pages 1355-1383, December.
    3. Tom Wilson & Irina Grossman & Monica Alexander & Phil Rees & Jeromey Temple, 2022. "Methods for Small Area Population Forecasts: State-of-the-Art and Research Needs," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 865-898, June.
    4. Tom Wilson, 2022. "Preparing local area population forecasts using a bi-regional cohort-component model without the need for local migration data," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(32), pages 919-956.
    5. Jeff Tayman & David A. Swanson, 2017. "Using modified cohort change and child-woman ratios in the Hamilton–Perry forecasting method," Journal of Population Research, Springer, vol. 34(3), pages 209-231, September.
    6. Philip Rees & Tom Wilson, 2023. "Accuracy of Local Authority Population Forecasts Produced by a New Minimal Data Model: A Case Study of England," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(6), pages 1-30, December.
    7. Ma, Lu & Srinivasan, Sivaramakrishnan, 2016. "An empirical assessment of factors affecting the accuracy of target-year synthetic populations," Transportation Research Part A: Policy and Practice, Elsevier, vol. 85(C), pages 247-264.
    8. Tom Wilson & Huw Brokensha & Francisco Rowe & Ludi Simpson, 2018. "Insights from the Evaluation of Past Local Area Population Forecasts," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 37(1), pages 137-155, February.
    9. Thiemo Fetzer & Samuel Marden, 2017. "Take What You Can: Property Rights, Contestability and Conflict," Economic Journal, Royal Economic Society, vol. 0(601), pages 757-783, May.
    10. Daniel Agness & Travis Baseler & Sylvain Chassang & Pascaline Dupas & Erik Snowberg, 2022. "Valuing the Time of the Self-Employed," Working Papers 2022-2, Princeton University. Economics Department..
    11. Roland Brown & Yingling Fan & Kirti Das & Julian Wolfson, 2021. "Iterated multisource exchangeability models for individualized inference with an application to mobile sensor data," Biometrics, The International Biometric Society, vol. 77(2), pages 401-412, June.
    12. Orietta Nicolis & Jean Paul Maidana & Fabian Contreras & Danilo Leal, 2024. "Analyzing the Impact of COVID-19 on Economic Sustainability: A Clustering Approach," Sustainability, MDPI, vol. 16(4), pages 1-30, February.
    13. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    14. Yaeji Lim & Hee-Seok Oh & Ying Kuen Cheung, 2019. "Multiscale Clustering for Functional Data," Journal of Classification, Springer;The Classification Society, vol. 36(2), pages 368-391, July.
    15. Forzani, Liliana & Gieco, Antonella & Tolmasky, Carlos, 2017. "Likelihood ratio test for partial sphericity in high and ultra-high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 159(C), pages 18-38.
    16. Yujia Li & Xiangrui Zeng & Chien‐Wei Lin & George C. Tseng, 2022. "Simultaneous estimation of cluster number and feature sparsity in high‐dimensional cluster analysis," Biometrics, The International Biometric Society, vol. 78(2), pages 574-585, June.
    17. Vojtech Blazek & Michal Petruzela & Tomas Vantuch & Zdenek Slanina & Stanislav Mišák & Wojciech Walendziuk, 2020. "The Estimation of the Influence of Household Appliances on the Power Quality in a Microgrid System," Energies, MDPI, vol. 13(17), pages 1-21, August.
    18. Emanuel Kopp, 2018. "Determinants of U.S. Business Investment," IMF Working Papers 2018/139, International Monetary Fund.
    19. Caruso, Germán & Scartascini, Carlos & Tommasi, Mariano, 2015. "Are we all playing the same game? The economic effects of constitutions depend on the degree of institutionalization," European Journal of Political Economy, Elsevier, vol. 38(C), pages 212-228.
    20. Michael P. Cameron & William Cochrane, 2015. "Using Land-Use Modelling to Statistically Downscale Population Projections to Small Areas," Working Papers in Economics 15/12, University of Waikato.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:poprpr:v:42:y:2023:i:4:d:10.1007_s11113-023-09795-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.