IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0320298.html
   My bibliography  Save this article

Demographic forecast modelling using SSA-XGBoost for smart population management based on multi-sources data

Author

Listed:
  • Jin Wang
  • Shihan Ma
  • Qing Lv
  • Qiang Li

Abstract

Population prediction could provide effective data support for social and economic planning and decision-making, especially for the sub-national population forecasting accurately. In addition to realizing efficient smart population management, this research focuses primarily on the combination model for forecasting demographic data based on machine learning. As to the higher error of population forecasts due to high population density and mobility, a dynamic monitoring method based on mobile communication big data such as mobile phone signals is proposed, combined with more structurally stable traditional statistical data, it forms a multi-source dataset that possesses both accuracy and real-time characteristics. In the study, the Extreme Gradient Boosting tree (XGBoost) model is used to identify the base model to create a reliable predictive model for population dynamic monitoring. The sparrow search algorithm (SSA) is investigated to obtain more reasonable parameters of XGBoost to improve forecast accuracy. The combination model is verified based on the data of the 6th and 7th national population census and mobile phone signal data in Hebei Province, obtained the predicted data for mortality and migration, categorized by age and gender, for the following year. Subsequently, the research compared the performance of different metaheuristic algorithms and various gradient-boosting machine-learning models on the dataset. The SSA-XGBoost model demonstrates a better prediction performance in the demographic data forecast with better R2 0.9984 and a lower mean absolute error of 0.0002 and a mean squared error of 6.9184. The results of the comparative experiments and cross-validation show that the proposed predictive model can effectively forecast the demographic data for sub-national regions to realize smart population management.

Suggested Citation

  • Jin Wang & Shihan Ma & Qing Lv & Qiang Li, 2025. "Demographic forecast modelling using SSA-XGBoost for smart population management based on multi-sources data," PLOS ONE, Public Library of Science, vol. 20(6), pages 1-24, June.
  • Handle: RePEc:plo:pone00:0320298
    DOI: 10.1371/journal.pone.0320298
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0320298
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0320298&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0320298?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jeff Tayman, 2011. "Assessing Uncertainty in Small Area Forecasts: State of the Practice and Implementation Strategy," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 30(5), pages 781-800, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tom Wilson & Irina Grossman & Monica Alexander & Phil Rees & Jeromey Temple, 2022. "Methods for Small Area Population Forecasts: State-of-the-Art and Research Needs," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 865-898, June.
    2. Michael P. Cameron & William Cochrane, 2015. "Using Land-Use Modelling to Statistically Downscale Population Projections to Small Areas," Working Papers in Economics 15/12, University of Waikato.
    3. Tom Wilson, 2022. "Preparing local area population forecasts using a bi-regional cohort-component model without the need for local migration data," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(32), pages 919-956.
    4. Wilson, Tom & Grossman, Irina & Temple, Jeromey, 2023. "Evaluation of the best M4 competition methods for small area population forecasting," International Journal of Forecasting, Elsevier, vol. 39(1), pages 110-122.
    5. Hana Sevcikova & Patrick Gerland & Adrian E. Raftery, 2018. "Probabilistic projection of subnational total fertility rates," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 38(60), pages 1843-1884.
    6. Tom Wilson, 2016. "Evaluation of Alternative Cohort-Component Models for Local Area Population Forecasts," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(2), pages 241-261, April.
    7. Philip Rees & Tom Wilson, 2023. "Accuracy of Local Authority Population Forecasts Produced by a New Minimal Data Model: A Case Study of England," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(6), pages 1-30, December.
    8. Richard S. Grip & Meghan L. Grip, 2020. "Using Multiple Methods to Provide Prediction Bands of K-12 Enrollment Projections," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 39(1), pages 1-22, February.
    9. Tom Wilson & Huw Brokensha & Francisco Rowe & Ludi Simpson, 2018. "Insights from the Evaluation of Past Local Area Population Forecasts," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 37(1), pages 137-155, February.
    10. Tom Wilson & Fiona Shalley, 2019. "Subnational population forecasts: Do users want to know about uncertainty?," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(13), pages 367-392.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0320298. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.