IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2506.01874.html
   My bibliography  Save this paper

Life Sequence Transformer: Generative Modelling for Counterfactual Simulation

Author

Listed:
  • Alberto Cabezas
  • Carlotta Montorsi

Abstract

Social sciences rely on counterfactual analysis using surveys and administrative data, generally depending on strong assumptions or the existence of suitable control groups, to evaluate policy interventions and estimate causal effects. We propose a novel approach that leverages the Transformer architecture to simulate counterfactual life trajectories from large-scale administrative records. Our contributions are: the design of a novel encoding method that transforms longitudinal administrative data to sequences and the proposal of a generative model tailored to life sequences with overlapping events across life domains. We test our method using data from the Istituto Nazionale di Previdenza Sociale (INPS), showing that it enables the realistic and coherent generation of life trajectories. This framework offers a scalable alternative to classical counterfactual identification strategy, such as difference-in-differences and synthetic controls, particularly in contexts where these methods are infeasible or their assumptions unverifiable. We validate the model's utility by comparing generated life trajectories against established findings from causal studies, demonstrating its potential to enrich labour market research and policy evaluation through individual-level simulations.

Suggested Citation

  • Alberto Cabezas & Carlotta Montorsi, 2025. "Life Sequence Transformer: Generative Modelling for Counterfactual Simulation," Papers 2506.01874, arXiv.org.
  • Handle: RePEc:arx:papers:2506.01874
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2506.01874
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Card, David & Krueger, Alan B, 1994. "Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania," American Economic Review, American Economic Association, vol. 84(4), pages 772-793, September.
    2. Daron Acemoglu & Leopoldo Fergusson & Simon Johnson, 2020. "Population and Conflict," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 87(4), pages 1565-1604.
    3. Alessandra Casarico & Salvatore Lattanzio, 2023. "Behind the child penalty: understanding what contributes to the labour market costs of motherhood," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(3), pages 1489-1511, July.
    4. James J. Heckman, 2008. "Econometric Causality," International Statistical Review, International Statistical Institute, vol. 76(1), pages 1-27, April.
    5. Chiara Ardito & Roberto Leombruni & David Blane & Angelo d’Errico, 2020. "To Work or Not to Work? The Effect of Higher Pension Age on Cardiovascular Health," Industrial Relations: A Journal of Economy and Society, Wiley Blackwell, vol. 59(3), pages 399-434, July.
    6. Cristian Alonso, 2022. "Beyond Labor Market Outcomes: The Impact of the Minimum Wage on Nondurable Consumption," Journal of Human Resources, University of Wisconsin Press, vol. 57(5), pages 1690-1714.
    7. Vafa, Keyon & Athey, Susan & Blei, David M., 2024. "Estimating Wage Disparities Using Foundation Models," Research Papers 4206, Stanford University, Graduate School of Business.
    8. Francesca Carta & Alessandra Casarico & Marta De Philippis & Salvatore Lattanzio, 2024. "Mom's out: employment after childbirth and firm-level responses," Temi di discussione (Economic working papers) 1458, Bank of Italy, Economic Research and International Relations Area.
    9. Brunello, Giorgio & Miniaci, Raffaele, 1997. "Benefit Transfers in Italy: An Empirical Study of Mobility Lists in the Milan Area," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 59(3), pages 329-347, August.
    10. Roberto Leombruni & Roberto Quaranta & Claudia Villosio, 2010. "Note di pubblicazione di WHIP v. 3.2," LABORatorio R. Revelli Whip Technical Reports 01, LABORatorio R. Revelli, Centre for Employment Studies.
    11. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    12. Donald B. Rubin, 2005. "Causal Inference Using Potential Outcomes: Design, Modeling, Decisions," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 322-331, March.
    13. James J. Heckman, 2008. "Causalidad econométrica," Monetaria, CEMLA, vol. 0(3), pages 291-338, julio-sep.
    14. Keyon Vafa & Emil Palikot & Tianyu Du & Ayush Kanodia & Susan Athey & David M. Blei, 2022. "CAREER: A Foundation Model for Labor Sequence Data," Papers 2202.08370, arXiv.org, revised Feb 2024.
    15. Keyon Vafa & Susan Athey & David M. Blei, 2025. "Estimating wage disparities using foundation models," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 122(22), pages 2427298122-, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jinglong Zhao, 2024. "Experimental Design For Causal Inference Through An Optimization Lens," Papers 2408.09607, arXiv.org, revised Aug 2024.
    2. Sokolov, Boris, 2025. "Causal Estimands for Policy Evaluation and Beyond," SocArXiv 4vtpk_v1, Center for Open Science.
    3. Viviana Celli, 2022. "Causal mediation analysis in economics: Objectives, assumptions, models," Journal of Economic Surveys, Wiley Blackwell, vol. 36(1), pages 214-234, February.
    4. Dong, Feng & Li, Yangfan & Li, Kun & Zhu, Jiao & Zheng, Lu, 2022. "Can smart city construction improve urban ecological total factor energy efficiency in China? Fresh evidence from generalized synthetic control method," Energy, Elsevier, vol. 241(C).
    5. KAMKOUM, Arnaud Cedric, 2023. "The Federal Reserve’s Response to the Global Financial Crisis and its Effects: An Interrupted Time-Series Analysis of the Impact of its Quantitative Easing Programs," Thesis Commons d7pvg, Center for Open Science.
    6. James J. Heckman, 1991. "Randomization and Social Policy Evaluation Revisited," NBER Technical Working Papers 0107, National Bureau of Economic Research, Inc.
    7. Mustapha Douch & Terence Huw Edwards, 2022. "The bilateral trade effects of announcement shocks: Brexit as a natural field experiment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(2), pages 305-329, March.
    8. Neumark David, 2019. "The Econometrics and Economics of the Employment Effects of Minimum Wages: Getting from Known Unknowns to Known Knowns," German Economic Review, De Gruyter, vol. 20(3), pages 293-329, August.
    9. Dang, Le Phuong Xuan & Hoang, Viet-Ngu & Nghiem, Son Hong & Wilson, Clevo, 2023. "Social networks with organisational resource, generalised trust and informal loans: Evidence from rural Vietnam," Economic Analysis and Policy, Elsevier, vol. 77(C), pages 388-402.
    10. Aydemir, Abdurrahman B. & Kırdar, Murat G., 2017. "Quasi-experimental impact estimates of immigrant labor supply shocks: The role of treatment and comparison group matching and relative skill composition," European Economic Review, Elsevier, vol. 98(C), pages 282-315.
    11. Aomar Ibourk & Zakaria Elouaourti & Mohammed-Ali Bougzime, 2025. "Impact evaluation of the ‘IDMAJ’ wage subsidy program on employment quality in Morocco," International Journal of Economic Policy Studies, Springer, vol. 19(1), pages 135-158, February.
    12. Nathan Canen & Kristopher Ramsay, 2024. "Quantifying theory in politics: Identification, interpretation, and the role of structural methods," Journal of Theoretical Politics, , vol. 36(4), pages 301-327, October.
    13. Damian Clarke & Daniel Paila~nir & Susan Athey & Guido Imbens, 2023. "Synthetic Difference In Differences Estimation," Papers 2301.11859, arXiv.org, revised Feb 2023.
    14. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    15. Almer, Christian & Winkler, Ralph, 2017. "Analyzing the effectiveness of international environmental policies: The case of the Kyoto Protocol," Journal of Environmental Economics and Management, Elsevier, vol. 82(C), pages 125-151.
    16. Guido W. Imbens, 2022. "Causality in Econometrics: Choice vs Chance," Econometrica, Econometric Society, vol. 90(6), pages 2541-2566, November.
    17. Poulos, Jason & Albanese, Andrea & Mercatanti, Andrea & Li, Fan, 2021. "Retrospective Causal Inference via Matrix Completion, with an Evaluation of the Effect of European Integration on Cross-Border Employment," IZA Discussion Papers 14472, Institute of Labor Economics (IZA).
    18. Dale Belman & Paul Wolfson & Kritkorn Nawakitphaitoon, 2015. "Who Is Affected by the Minimum Wage?," Industrial Relations: A Journal of Economy and Society, Wiley Blackwell, vol. 54(4), pages 582-621, October.
    19. Nadler, Carl & Allegretto, Sylvia & Godoey, Anna & Reich, Michael, 2019. "Are Local Minimum Wages Too High? Working Paper #102-19," Institute for Research on Labor and Employment, Working Paper Series qt7xt8716f, Institute of Industrial Relations, UC Berkeley.
    20. Sandeep Devanatha Pillai & Brent Goldfarb & David Kirsch, 2024. "Lovely and likely: Using historical methods to improve inference to the best explanation in strategy," Strategic Management Journal, Wiley Blackwell, vol. 45(8), pages 1539-1566, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2506.01874. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.