IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2202.08370.html

CAREER: A Foundation Model for Labor Sequence Data

Author

Listed:
  • Keyon Vafa
  • Emil Palikot
  • Tianyu Du
  • Ayush Kanodia
  • Susan Athey
  • David M. Blei

Abstract

Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small to take advantage of them. In recent years large datasets of online resumes have also become available, providing data about the career trajectories of millions of individuals. However, standard econometric models cannot take advantage of their scale or incorporate them into the analysis of survey data. To this end we develop CAREER, a foundation model for job sequences. CAREER is first fit to large, passively-collected resume data and then fine-tuned to smaller, better-curated datasets for economic inferences. We fit CAREER to a dataset of 24 million job sequences from resumes, and adjust it on small longitudinal survey datasets. We find that CAREER forms accurate predictions of job sequences, outperforming econometric baselines on three widely-used economics datasets. We further find that CAREER can be used to form good predictions of other downstream variables. For example, incorporating CAREER into a wage model provides better predictions than the econometric models currently in use.

Suggested Citation

  • Keyon Vafa & Emil Palikot & Tianyu Du & Ayush Kanodia & Susan Athey & David M. Blei, 2022. "CAREER: A Foundation Model for Labor Sequence Data," Papers 2202.08370, arXiv.org, revised Feb 2024.
  • Handle: RePEc:arx:papers:2202.08370
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2202.08370
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bernd Fitzenberger & Aderonke Osikominu & Robert Völter, 2008. "Get Training or Wait? Long-Run Employment Effects of Training Programs for the Unemployed in West Germany," Annals of Economics and Statistics, GENES, issue 91-92, pages 321-355.
    2. McCall, Brian P, 1990. "Occupational Matching: A Test of Sorts," Journal of Political Economy, University of Chicago Press, vol. 98(1), pages 45-69, February.
    3. Robert W. Fairlie & William A. Sundstrom, 1999. "The Emergence, Persistence, and Recent Widening of the Racial Unemployment Gap," ILR Review, Cornell University, ILR School, vol. 52(2), pages 252-270, January.
    4. Guido Matias Cortes, 2016. "Where Have the Middle-Wage Workers Gone? A Study of Polarization Using Panel Data," Journal of Labor Economics, University of Chicago Press, vol. 34(1), pages 63-105.
    5. David Hummels & Rasmus J?rgensen & Jakob Munch & Chong Xiang, 2014. "The Wage Effects of Offshoring: Evidence from Danish Matched Worker-Firm Data," American Economic Review, American Economic Association, vol. 104(6), pages 1597-1629, June.
    6. Fatih Guvenen & Burhan Kuruscu & Satoshi Tanaka & David Wiczer, 2020. "Multidimensional Skill Mismatch," American Economic Journal: Macroeconomics, American Economic Association, vol. 12(1), pages 210-244, January.
    7. Ruhose, Jens & Thomsen, Stephan L. & Weilage, Insa, 2019. "The benefits of adult learning: Work-related training, social capital, and earnings," Economics of Education Review, Elsevier, vol. 72(C), pages 166-186.
    8. repec:adr:anecst:y:2008:i:91-92:p:15 is not listed on IDEAS
    9. David H. Autor & David Dorn, 2013. "The Growth of Low-Skill Service Jobs and the Polarization of the US Labor Market," American Economic Review, American Economic Association, vol. 103(5), pages 1553-1597, August.
    10. Martin Henning & Rikard H Eriksson, 2021. "Labour market polarisation as a localised process: evidence from Sweden," Cambridge Journal of Regions, Economy and Society, Cambridge Political Economy Society, vol. 14(1), pages 69-91.
    11. Fane Groes & Philipp Kircher & Iourii Manovskii, 2015. "The U-Shapes of Occupational Mobility," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(2), pages 659-692.
    12. Carolyn Heinrich & Peter Mueser & Kenneth Troske & Kyung-Seong Jeon & Daver Kahvecioglu, 2013. "Do Public Employment and Training Programs Work?," IZA Journal of Labor Economics, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 2(1), pages 1-23, December.
    13. Poterba, James M & Summers, Lawrence H, 1986. "Reporting Errors and Labor Market Dynamics," Econometrica, Econometric Society, vol. 54(6), pages 1319-1338, November.
    14. Schmidt, Peter & Strauss, Robert P, 1975. "The Prediction of Occupation Using Multiple Logit Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 16(2), pages 471-486, June.
    15. Robert E. Hall, 1972. "Turnover in the Labor Force," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 3(3), pages 709-764.
    16. Neal, Derek, 1999. "The Complexity of Job Mobility among Young Men," Journal of Labor Economics, University of Chicago Press, vol. 17(2), pages 237-261, April.
    17. Randall S. Brown & Marilyn Moon & Barbara S. Zoloth, 1980. "Incorporating Occupational Attainment in Studies of Male-Female Earnings Differentials," Journal of Human Resources, University of Wisconsin Press, vol. 15(1), pages 3-28.
    18. Gueorgui Kambourov & Iourii Manovskii, 2008. "Rising Occupational And Industry Mobility In The United States: 1968-97," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 49(1), pages 41-79, February.
    19. Francine D. Blau & Lawrence M. Kahn, 2017. "The Gender Wage Gap: Extent, Trends, and Explanations," Journal of Economic Literature, American Economic Association, vol. 55(3), pages 789-865, September.
    20. Jana Stefanová Lauerová & Katherine Terrell, 2007. "What Drives Gender Differences in Unemployment?," Comparative Economic Studies, Palgrave Macmillan;Association for Comparative Economic Studies, vol. 49(1), pages 128-155, March.
    21. Fredrik Andersson & Harry J. Holzer & Julia I. Lane & David Rosenblum & Jeffrey Smith, 2024. "Does Federally Funded Job Training Work? Nonexperimental Estimates of WIA Training Impacts Using Longitudinal Data on Workers and Firms," Journal of Human Resources, University of Wisconsin Press, vol. 59(4), pages 1244-1283.
    22. Boskin, Michael J, 1974. "A Conditional Logit Model of Occupational Choice," Journal of Political Economy, University of Chicago Press, vol. 82(2), pages 389-398, Part I, M.
    23. Keane, Michael P & Wolpin, Kenneth I, 1997. "The Career Decisions of Young Men," Journal of Political Economy, University of Chicago Press, vol. 105(3), pages 473-522, June.
    24. Sharon Traiberman, 2019. "Occupations and Import Competition: Evidence from Denmark," American Economic Review, American Economic Association, vol. 109(12), pages 4260-4301, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alberto Cabezas & Carlotta Montorsi, 2025. "Life Sequence Transformer: Generative Modelling for Counterfactual Simulation," Papers 2506.01874, arXiv.org.
    2. Du, Tianyu & Kanodia, Ayush & Brunborg, Herman & Vafa, Keyon & Athey, Susan, 2024. "Labor-LLM: Language-Based Occupational Representations with Large Language Models," Research Papers 4188, Stanford University, Graduate School of Business.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian vom Lehn & Cache Ellsworth & Zachary Kroff, 2022. "Reconciling Occupational Mobility in the Current Population Survey," Journal of Labor Economics, University of Chicago Press, vol. 40(4), pages 1005-1051.
    2. Nicolas A. Roys & Christopher R. Taber, 2019. "Skill Prices, Occupations, and Changes in the Wage Structure for Low Skilled Men," NBER Working Papers 26453, National Bureau of Economic Research, Inc.
    3. Genz, Sabrina & Schnabel, Claus, 2021. "Digging into the digital divide: Workers' exposure to digitalization and its consequences for individual employment," Discussion Papers 118, Friedrich-Alexander University Erlangen-Nuremberg, Chair of Labour and Regional Economics.
    4. Stijepic Damir, 2020. "Job Mobility and Sorting: Theory and Evidence," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 240(1), pages 19-49, February.
    5. Ronald Bachmann & Peggy Bechara & Christina Vonnahme, 2020. "Occupational Mobility in Europe: Extent, Determinants and Consequences," De Economist, Springer, vol. 168(1), pages 79-108, March.
    6. Wang, Xiupeng, 2020. "Labor market polarization in Britain and Germany: A cross-national comparison using longitudinal household data," Labour Economics, Elsevier, vol. 65(C).
    7. Ek, Simon, 2025. "Worker specialization and the consequences of occupational decline," Working Paper Series 2025:7, IFAU - Institute for Evaluation of Labour Market and Education Policy.
    8. Carl Sanders, 2012. "Skill Uncertainty, Skill Accumulation, and Occupational Choice," 2012 Meeting Papers 633, Society for Economic Dynamics.
    9. Fitzenberger, Bernd & Licklederer, Stefanie & Zwiener, Hanna, 2015. "Mobility across firms and occupations among graduates from apprenticeship," Labour Economics, Elsevier, vol. 34(C), pages 138-151.
    10. Stijepic Damir, 2020. "Job Mobility and Sorting: Theory and Evidence," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 240(1), pages 19-49, February.
    11. Francesco Roncone, 2025. "Work-Hour Instability, Occupational Mobility and Gender," Working Papers wp1201, Dipartimento Scienze Economiche, Universita' di Bologna.
    12. Papageorgiou, Theodore, 2018. "Large firms and within firm occupational reallocation," Journal of Economic Theory, Elsevier, vol. 174(C), pages 184-223.
    13. Guido Matias Cortes & Giovanni Gallipoli, 2018. "The Costs of Occupational Mobility: An Aggregate Analysis," Journal of the European Economic Association, European Economic Association, vol. 16(2), pages 275-315.
    14. Baird, Matthew D. & Engberg, John & Gutierrez, Italo A., 2022. "RCT evidence on differential impact of US job training programmes by pre-training employment status," Labour Economics, Elsevier, vol. 75(C).
    15. Demiralp, Berna, 2011. "Occupational self-selection in a labor market with moral hazard," European Economic Review, Elsevier, vol. 55(4), pages 497-519, May.
    16. Pedros Silos & Eric Smith, 2015. "Human Capital Portfolios," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 18(3), pages 635-652, July.
    17. Maximiliano Dvorkin, 2021. "International trade and labor reallocation: misclassification errors, mobility, and switching costs," Working Papers 2021-014, Federal Reserve Bank of St. Louis, revised Jun 2024.
    18. Theodore Papageorgiou, 2022. "Occupational Matching and Cities," American Economic Journal: Macroeconomics, American Economic Association, vol. 14(3), pages 82-132, July.
    19. Michael J. Böhm, 2020. "The price of polarization: Estimating task prices under routine‐biased technical change," Quantitative Economics, Econometric Society, vol. 11(2), pages 761-799, May.
    20. Ma, Jie, 2020. "High skilled immigration and the market for skilled labor: The role of occupational choice," Labour Economics, Elsevier, vol. 63(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2202.08370. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.