IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.01363.html

Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks

Author

Listed:
  • Matthias Mertens
  • Adam Kuzee
  • Brittany S. Harris
  • Harry Lyu
  • Wensu Li
  • Jonathan Rosenfeld
  • Meiri Anto
  • Martin Fleming
  • Neil Thompson

Abstract

We propose that AI automation is a continuum between: (i) crashing waves where AI capabilities surge abruptly over small sets of tasks, and (ii) rising tides where the increase in AI capabilities is more continuous and broad-based. We test for these effects in preliminary evidence from an ongoing evaluation of AI capabilities across over 3,000 broad-based tasks derived from the U.S. Department of Labor O*NET categorization that are text-based and thus LLM-addressable. Based on more than 17,000 evaluations by workers from these jobs, we find little evidence of crashing waves (in contrast to recent work by METR), but substantial evidence that rising tides are the primary form of AI automation. AI performance is high and improving rapidly across a wide range of tasks. We estimate that, in 2024-Q2, AI models successfully complete tasks that take humans approximately 3-4 hours with about a 50% success rate, increasing to about 65% by 2025-Q3. If recent trends in AI capability growth persist, this pace of AI improvement implies that LLMs will be able to complete most text-related tasks with success rates of, on average, 80%-95% by 2029 at a minimally sufficient quality level. Achieving near-perfect success rates at this quality level or comparable success rates at superior quality would require several additional years. These AI capability improvements would impact the economy and labor market as organizations adopt AI, which could have a substantially longer timeline.

Suggested Citation

  • Matthias Mertens & Adam Kuzee & Brittany S. Harris & Harry Lyu & Wensu Li & Jonathan Rosenfeld & Meiri Anto & Martin Fleming & Neil Thompson, 2026. "Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks," Papers 2604.01363, arXiv.org.
  • Handle: RePEc:arx:papers:2604.01363
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2604.01363
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kristina McElheran & J. Frank Li & Erik Brynjolfsson & Zachary Kroff & Emin Dinlersoz & Lucia Foster & Nikolas Zolas, 2024. "AI adoption in America: Who, what, and where," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 33(2), pages 375-415, March.
    2. Kiefer, Nicholas M, 1988. "Economic Duration Data and Hazard Functions," Journal of Economic Literature, American Economic Association, vol. 26(2), pages 646-679, June.
    3. Jenkins, Stephen P, 1995. "Easy Estimation Methods for Discrete-Time Duration Models," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 57(1), pages 129-138, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Burton, Michael P. & Rigby, Dan & Young, Trevor, 2003. "Modelling the adoption of organic horticultural technology in the UK using Duration Analysis," Australian Journal of Agricultural and Resource Economics, Australian Agricultural and Resource Economics Society, vol. 47(01), pages 1-26, March.
    2. Castro, Vítor, 2010. "The duration of economic expansions and recessions: More than duration dependence," Journal of Macroeconomics, Elsevier, vol. 32(1), pages 347-365, March.
    3. Agnello, Luca & Castro, Vitor & Sousa, Ricardo M., 2013. "What determines the duration of a fiscal consolidation program?," Journal of International Money and Finance, Elsevier, vol. 37(C), pages 113-134.
    4. Paul Brenton & Christian Saborowski & Erik von Uexkull, 2014. "What Explains the Low Survival Rate of Developing Country Export Flows?," World Scientific Book Chapters, in: INTERNATIONAL TRADE, DISTRIBUTION AND DEVELOPMENT Empirical Studies of Trade Policies, chapter 17, pages 347-372, World Scientific Publishing Co. Pte. Ltd..
    5. Pişkin, Erhan, 2017. "Türkiye İhracatının Ölüm-Kalım Meselesi [The Matter of Survival for Turkish Exports]," MPRA Paper 81459, University Library of Munich, Germany.
    6. Felipe Morandé Lavín & Mauricio Tejada, 2008. "Price Stickiness in Emerging Economies: Empirical Evidence for Four Latin-American Countries," Working Papers wp286, University of Chile, Department of Economics.
    7. William Liu, 2023. "A Theory Guide to Using Control Functions to Instrument Hazard Models," Papers 2312.03165, arXiv.org.
    8. Dragana Djurdjevic, 2005. "Unemployment and Under-Employment: The Case of Switzerland," Swiss Journal of Economics and Statistics (SJES), Swiss Society of Economics and Statistics (SSES), vol. 141(I), pages 23-70, March.
    9. Ana M. Fernandes & Caroline Paunov, 2015. "The Risks of Innovation: Are Innovating Firms Less Likely to Die?," The Review of Economics and Statistics, MIT Press, vol. 97(3), pages 638-653, July.
    10. Agnello, Luca & Castro, Vítor & Sousa, Ricardo M., 2021. "On the duration of sovereign ratings cycle phases," Journal of Economic Behavior & Organization, Elsevier, vol. 182(C), pages 512-526.
    11. Kaiser, Ulrich & Szczesny, Andrea, 2000. "Einfache ökonometrische Verfahren für die Kreditrisikomessung," CoFE Discussion Papers 00/28, University of Konstanz, Center of Finance and Econometrics (CoFE).
    12. María José Moral Rincón, 1998. "La retirada de automóviles en España: Un aplicación de modelos de duración," Investigaciones Economicas, Fundación SEPI, vol. 22(2), pages 225-258, May.
    13. Smythe Donald J., 2008. "Transaction Costs, Neighborhood Effects, and the Diffusion of the Uniform Sales Act, 1906-47," Review of Law & Economics, De Gruyter, vol. 4(1), pages 341-372, October.
    14. Bachmann, Ronald & Baumgarten, Daniel & Stiebale, Joel, 2011. "Cross-border Investment, Heterogeneous Workers, and Employment Security – Evidence from Germany," Ruhr Economic Papers 268, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    15. Stefan Buehler & Christian Kaiser & Franz Jaeger, 2012. "The geographic determinants of bankruptcy: evidence from Switzerland," Small Business Economics, Springer, vol. 39(1), pages 231-251, July.
    16. Luca Agnello & Vitor Castro & Ricardo Sousa, 2019. "The Benevolence of Time, Sound Macroeconomic Environment and Governance Quality on the Duration of Sovereign Ratings Phases," Working Papers 34, European Stability Mechanism.
    17. Agnello, Luca & Castro, Vítor & Hammoudeh, Shawkat & Sousa, Ricardo M., 2020. "Global factors, uncertainty, weather conditions and energy prices: On the drivers of the duration of commodity price cycle phases," Energy Economics, Elsevier, vol. 90(C).
    18. Zhang, Dongyang & Zheng, Wenping & Ning, Lutao, 2018. "Does innovation facilitate firm survival? Evidence from Chinese high-tech firms," Economic Modelling, Elsevier, vol. 75(C), pages 458-468.
    19. Khataza, Robertson R.B. & Doole, Graeme J. & Kragt, Marit E. & Hailu, Atakelty, 2018. "Information acquisition, learning and the adoption of conservation agriculture in Malawi: A discrete-time duration analysis," Technological Forecasting and Social Change, Elsevier, vol. 132(C), pages 299-307.
    20. Steffen Jenner & Lotte Ovaere & Stephan Schindele, 2013. "The Impact of Private Interest Contributions on RPS Adoption," Economics and Politics, Wiley Blackwell, vol. 25(3), pages 411-423, November.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.01363. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.