IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2508.12514.html
   My bibliography  Save this paper

Reconstructing Subnational Labor Indicators in Colombia: An Integrated Machine and Deep Learning Approach

Author

Listed:
  • Jaime Vera-Jaramillo

Abstract

This study proposes a unified multi-stage framework to reconstruct consistent monthly and annual labor indicators for all 33 Colombian departments from 1993 to 2025. The approach integrates temporal disaggregation, time-series splicing and interpolation, statistical learning, and institutional covariates to estimate seven key variables: employment, unemployment, labor force participation (PEA), inactivity, working-age population (PET), total population, and informality rate, including in regions without direct survey coverage. The framework enforces labor accounting identities, scales results to demographic projections, and aligns all estimates with national benchmarks to ensure internal coherence. Validation against official departmental GEIH aggregates and city-level informality data for the 23 metropolitan areas yields in-sample Mean Absolute Percentage Errors (MAPEs) below 2.3% across indicators, confirming strong predictive performance. To our knowledge, this is the first dataset to provide spatially exhaustive and temporally consistent monthly labor measures for Colombia. By incorporating both quantitative and qualitative dimensions of employment, the panel enhances the empirical foundation for analysing long-term labor market dynamics, identifying regional disparities, and designing targeted policy interventions.

Suggested Citation

  • Jaime Vera-Jaramillo, 2025. "Reconstructing Subnational Labor Indicators in Colombia: An Integrated Machine and Deep Learning Approach," Papers 2508.12514, arXiv.org, revised Aug 2025.
  • Handle: RePEc:arx:papers:2508.12514
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2508.12514
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. D’Amuri, Francesco & Marcucci, Juri, 2017. "The predictive power of Google searches in forecasting US unemployment," International Journal of Forecasting, Elsevier, vol. 33(4), pages 801-816.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benchimol, Jonathan & Bounader, Lahcen & Dotta, Mario, 2025. "Estimating Behavioral Inattention," Journal of Economic Behavior & Organization, Elsevier, vol. 236(C).
    2. Bentzen, Jeanet Sinding, 2021. "In crisis, we pray: Religiosity and the COVID-19 pandemic," Journal of Economic Behavior & Organization, Elsevier, vol. 192(C), pages 541-583.
    3. Matteo Iacopini & Carlo R.M.A. Santagiustina, 2021. "Filtering the intensity of public concern from social media count data with jumps," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1283-1302, October.
    4. Ferrara, Laurent & Sheng, Xuguang Simon, 2022. "Guest editorial: Economic forecasting in times of COVID-19," International Journal of Forecasting, Elsevier, vol. 38(2), pages 527-528.
    5. Monge, Manuel & Claudio-Quiroga, Gloria & Poza, Carlos, 2024. "Chinese economic behavior in times of covid-19. A new leading economic indicator based on Google trends," International Economics, Elsevier, vol. 177(C).
    6. Laurent Ferrara & Anna Simoni, 2023. "When are Google Data Useful to Nowcast GDP? An Approach via Preselection and Shrinkage," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(4), pages 1188-1202, October.
    7. Cebrián, Eduardo & Domenech, Josep, 2024. "Addressing Google Trends inconsistencies," Technological Forecasting and Social Change, Elsevier, vol. 202(C).
    8. Neto, David, 2021. "Are Google searches making the Bitcoin market run amok? A tail event analysis," The North American Journal of Economics and Finance, Elsevier, vol. 57(C).
    9. Borgioli, Stefano & Gallo, Giampiero M. & Ongari, Chiara, 2024. "Financial returns, sentiment and market volatility. A dynamic assessment," Working Paper Series 2999, European Central Bank.
    10. Perroni, Carlo & Scharf, Kimberley & Talavera, Oleksandr & Vi, Linh, 2022. "Does online salience predict charitable giving? Evidence from SMS text donations," Journal of Economic Behavior & Organization, Elsevier, vol. 197(C), pages 134-149.
    11. Necmettin Alpay Koçak, 2020. "The Role of Ecb Speeches in Nowcasting German Gdp," European Financial and Accounting Journal, Prague University of Economics and Business, vol. 2020(2), pages 05-20.
    12. Dean Fantazzini, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    13. Christian Conrad & Anessa Custovic & Eric Ghysels, 2018. "Long- and Short-Term Cryptocurrency Volatility Components: A GARCH-MIDAS Analysis," JRFM, MDPI, vol. 11(2), pages 1-12, May.
    14. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    15. Puhr, Harald & Müllner, Jakob, 2024. "Vox populi, vox dei: A concept and measure for grassroots socio-political risk using Google Trends," Journal of International Management, Elsevier, vol. 30(2).
    16. Massimiliano Marcellino & Dalibor Stevanovic, 2022. "The demand and supply of information about inflation," Working Papers 22-06, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Nov 2022.
    17. González-Fernández, Marcos & González-Velasco, Carmen, 2020. "An alternative approach to predicting bank credit risk in Europe with Google data," Finance Research Letters, Elsevier, vol. 35(C).
    18. Mikhaylov, Dmitry, 2023. "Macroeconomic Forecasting with the Use of News Data," Working Papers w20220250, Russian Presidential Academy of National Economy and Public Administration.
    19. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2022. "Learning from revisions: an algorithm to detect errors in banks’ balance sheet statistical reporting," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4025-4059, December.
    20. Abay,Kibrom A. & Hirfrfot,Kibrom Tafere & Woldemichael,Andinet, 2020. "Winners and Losers from COVID-19 : Global Evidence from Google Search," Policy Research Working Paper Series 9268, The World Bank.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2508.12514. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.