IDEAS home Printed from https://ideas.repec.org/a/kap/jtecht/v47y2022i2d10.1007_s10961-021-09900-2.html
   My bibliography  Save this article

Identifying artificial intelligence (AI) invention: a novel AI patent dataset

Author

Listed:
  • Alexander V. Giczy

    (U.S. Patent and Trademark Office
    Addx Corporation)

  • Nicholas A. Pairolero

    (U.S. Patent and Trademark Office)

  • Andrew A. Toole

    (U.S. Patent and Trademark Office
    Leibniz Centre for European Economic Research (ZEW))

Abstract

Artificial intelligence (AI) is an area of increasing scholarly and policy interest. To help researchers, policymakers, and the public, this paper describes a novel dataset identifying AI in over 13.2 million patents and pre-grant publications (PGPubs). The dataset, called the Artificial Intelligence Patent Dataset (AIPD), was constructed using machine learning models for each of eight AI component technologies covering areas such as natural language processing, AI hardware, and machine learning. The AIPD contains two data files, one identifying the patents and PGPubs predicted to contain AI and a second file containing the patent documents used to train the machine learning classification models. We also present several evaluation metrics based on manual review by patent examiners with focused expertise in AI, and show that our machine learning approach achieves state-of-the-art performance across existing alternatives in the literature. We believe releasing this dataset will strengthen policy formulation, encourage additional empirical work, and provide researchers with a common base for building empirical knowledge on the determinants and impacts of AI invention.

Suggested Citation

  • Alexander V. Giczy & Nicholas A. Pairolero & Andrew A. Toole, 2022. "Identifying artificial intelligence (AI) invention: a novel AI patent dataset," The Journal of Technology Transfer, Springer, vol. 47(2), pages 476-505, April.
  • Handle: RePEc:kap:jtecht:v:47:y:2022:i:2:d:10.1007_s10961-021-09900-2
    DOI: 10.1007/s10961-021-09900-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10961-021-09900-2
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10961-021-09900-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Crafts, Nicholas & Mills, Terence C., 2004. "Was 19th century British growth steam-powered?: the climacteric revisited," Explorations in Economic History, Elsevier, vol. 41(2), pages 156-171, April.
    2. Ashish Arora & Sharon Belenzon & Andrea Patacconi & Jungkyu Suh, 2020. "The Changing Structure of American Innovation: Some Cautionary Remarks for Economic Growth," Innovation Policy and the Economy, University of Chicago Press, vol. 20(1), pages 39-93.
    3. Nathan Rosenberg & Manuel Trajtenberg, 2009. "A General-Purpose Technology at Work: The Corliss Steam Engine in the Late-Nineteenth-Century United States," World Scientific Book Chapters, in: Nathan Rosenberg (ed.), Studies On Science And The Innovation Process Selected Works of Nathan Rosenberg, chapter 6, pages 97-135, World Scientific Publishing Co. Pte. Ltd..
    4. Nicholas Crafts, 2004. "Steam as a general purpose technology: A growth accounting perspective," Economic Journal, Royal Economic Society, vol. 114(495), pages 338-351, April.
    5. Atack, Jeremy & Bateman, Fred & Margo, Robert A., 2008. "Steam power, establishment size, and labor productivity growth in nineteenth century American manufacturing," Explorations in Economic History, Elsevier, vol. 45(2), pages 185-198, April.
    6. Ajay Agrawal & Joshua Gans & Avi Goldfarb, 2019. "The Economics of Artificial Intelligence: An Agenda," NBER Books, National Bureau of Economic Research, Inc, number agra-1, March.
    7. Manav Raj & Robert Seamans, 2018. "Artificial Intelligence, Labor, Productivity, and the Need for Firm-Level Data," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 553-565, National Bureau of Economic Research, Inc.
    8. Mario Benassi & Elena Grinza & Francesco Rentocchini, 2019. "The Rush for Patents in the Fourth Industrial Revolution: An Exploration of Patenting Activity at the European Patent Office," SPRU Working Paper Series 2019-12, SPRU - Science Policy Research Unit, University of Sussex Business School.
    9. Susanto Basu & John G. Fernald, 2008. "Information and communications technology as a general purpose technology: evidence from U.S. industry data," Economic Review, Federal Reserve Bank of San Francisco, pages 1-15.
    10. Edward Felten & Manav Raj & Robert Seamans, 2021. "Occupational, industry, and geographic exposure to artificial intelligence: A novel dataset and its potential uses," Strategic Management Journal, Wiley Blackwell, vol. 42(12), pages 2195-2217, December.
    11. Jovanovic, Boyan & Rousseau, Peter L., 2005. "General Purpose Technologies," Handbook of Economic Growth, in: Philippe Aghion & Steven Durlauf (ed.), Handbook of Economic Growth, edition 1, volume 1, chapter 18, pages 1181-1224, Elsevier.
    12. Stefano Baruffaldi & Brigitte van Beuzekom & Hélène Dernis & Dietmar Harhoff & Nandan Rao & David Rosenfeld & Mariagrazia Squicciarini, 2020. "Identifying and measuring developments in artificial intelligence: Making the impossible possible," OECD Science, Technology and Industry Working Papers 2020/05, OECD Publishing.
    13. Stuart J.H. Graham & Alan C. Marco & Richard Miller, 2018. "The USPTO Patent Examination Research Dataset: A window on patent processing," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 27(3), pages 554-578, September.
    14. Jason Furman & Robert Seamans, 2019. "AI and the Economy," Innovation Policy and the Economy, University of Chicago Press, vol. 19(1), pages 161-191.
    15. Ashish Arora & Sharon Belenzon & Andrea Patacconi, 2018. "The decline of science in corporate R&D," Strategic Management Journal, Wiley Blackwell, vol. 39(1), pages 3-32, January.
    16. Fujii, Hidemichi & Managi, Shunsuke, 2018. "Trends and priority shifts in artificial intelligence technology invention: A global patent analysis," Economic Analysis and Policy, Elsevier, vol. 58(C), pages 60-69.
    17. Kim, Sukkoo, 2005. "Industrialization and urbanization: Did the steam engine contribute to the growth of cities in the United States?," Explorations in Economic History, Elsevier, vol. 42(4), pages 586-598, October.
    18. Sukkoo Kim, 2005. "Industrialization and Urbanization: Did the Steam Engine Contribute to the Growth of Cities in the United States?," NBER Working Papers 11206, National Bureau of Economic Research, Inc.
    19. Agrawal, Ajay & Gans, Joshua & Goldfarb, Avi (ed.), 2019. "The Economics of Artificial Intelligence," National Bureau of Economic Research Books, University of Chicago Press, number 9780226613338, December.
    20. Daniel F. Spulber, 2015. "How Patents Provide The Foundation Of The Market For Inventions," Journal of Competition Law and Economics, Oxford University Press, vol. 11(2), pages 271-316.
    21. Robert Seamans & Manav Raj, 2018. "AI, Labor, Productivity and the Need for Firm-Level Data," NBER Working Papers 24239, National Bureau of Economic Research, Inc.
    22. Giacomo Damioli & Vincent Van Roy & Daniel Vertesy, 2021. "The impact of artificial intelligence on labor productivity," Eurasian Business Review, Springer;Eurasia Business and Economics Society, vol. 11(1), pages 1-25, March.
    23. Michael Webb & Nick Short & Nicholas Bloom & Josh Lerner, 2018. "Some Facts of High-Tech Patenting," NBER Working Papers 24793, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chunyi Shan & Jun Wang & Yongming Zhu, 2023. "The Evolution of Artificial Intelligence in the Digital Economy: An Application of the Potential Dirichlet Allocation Model," Sustainability, MDPI, vol. 15(2), pages 1-12, January.
    2. Farhat Chowdhury & Albert N. Link & Martijn Hasselt, 2022. "Public support for research in artificial intelligence: a descriptive study of U.S. Department of Defense SBIR Projects," The Journal of Technology Transfer, Springer, vol. 47(3), pages 762-774, June.
    3. Haessler, Philipp & Giones, Ferran & Brem, Alexander, 2023. "The who and how of commercializing emerging technologies: A technology-focused review," Technovation, Elsevier, vol. 121(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Venturini, Francesco, 2022. "Intelligent technologies and productivity spillovers: Evidence from the Fourth Industrial Revolution," Journal of Economic Behavior & Organization, Elsevier, vol. 194(C), pages 220-243.
    2. Shih-tse Lo & Dhanoos Sutthiphisal, 2008. "Crossover Inventions And Knowledge Diffusion Of General Purpose Technologies? Evidence From The Electrical Technology," NBER Working Papers 14043, National Bureau of Economic Research, Inc.
    3. Mario Benassi & Elena Grinza & Francesco Rentocchini, 2020. "The rush for patents in the Fourth Industrial Revolution," Economia e Politica Industriale: Journal of Industrial and Business Economics, Springer;Associazione Amici di Economia e Politica Industriale, vol. 47(4), pages 559-588, December.
    4. Czarnitzki, Dirk & Fernández, Gastón P. & Rammer, Christian, 2023. "Artificial intelligence and firm-level productivity," Journal of Economic Behavior & Organization, Elsevier, vol. 211(C), pages 188-205.
    5. Alex W. Chernoff, 2021. "Firm heterogeneity, technology adoption and the spatial distribution of population: Theory and measurement," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 54(2), pages 475-521, May.
    6. Kemeny, Tom & Petralia, Sergio & Storper, Michael, 2022. "Disruptive innovation and spatial inequality," LSE Research Online Documents on Economics 115953, London School of Economics and Political Science, LSE Library.
    7. Damioli, G. & Van Roy, V. & Vertesy, D. & Vivarelli, M., 2021. "May AI revolution be labour-friendly? Some micro evidence from the supply side," GLO Discussion Paper Series 823, Global Labor Organization (GLO).
    8. Atack, Jeremy & Margo, Robert A. & Rhode, Paul W., 2022. "Industrialization and urbanization in nineteenth century America," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    9. repec:gdk:wpaper:67 is not listed on IDEAS
    10. Damioli, Giacomo & Van Roy, Vincent & Vertesy, Daniel & Vivarelli, Marco, 2021. "Will the AI revolution be labour-friendly? Some micro evidence from the supply side," MERIT Working Papers 2021-016, United Nations University - Maastricht Economic and Social Research Institute on Innovation and Technology (MERIT).
    11. Rammer, Christian & Fernández, Gastón P. & Czarnitzki, Dirk, 2022. "Artificial intelligence and industrial innovation: Evidence from German firm-level data," Research Policy, Elsevier, vol. 51(7).
    12. Sergio Petralia, 2020. "GPTs and Growth: Evidence on the Technological Adoption of Electrical & Electronic Technologies in the 1920s," Papers in Evolutionary Economic Geography (PEEG) 2033, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Aug 2020.
    13. Moshe A. Barach & Aseem Kaul & Ming D. Leung & Sibo Lu, 2019. "Strategic Redundancy in the Use of Big Data: Evidence from a Two-Sided Labor Market," Strategy Science, INFORMS, vol. 4(4), pages 298-322, December.
    14. Giacomo Damioli & Vincent Van Roy & Daniel Vertesy & Marco Vivarelli, 2021. "Detecting the labour-friendly nature of AI product innovation," DISCE - Quaderni del Dipartimento di Politica Economica dipe0017, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    15. Christian Rammer & Gastón P Fernández & Dirk Czarnitzki, 2021. "Artificial Intelligence and Industrial Innovation: Evidence from Firm-Level Data," Working Papers of Department of Economics, Leuven 674605, KU Leuven, Faculty of Economics and Business (FEB), Department of Economics, Leuven.
    16. Harald Edquist & Magnus Henrekson, 2006. "Technological Breakthroughs and Productivity Growth," Research in Economic History, in: Research in Economic History, pages 1-53, Emerald Group Publishing Limited.
    17. Giacomo Damioli & Vincent Van Roy & Daniel Vertesy, 2021. "The impact of artificial intelligence on labor productivity," Eurasian Business Review, Springer;Eurasia Business and Economics Society, vol. 11(1), pages 1-25, March.
    18. Burton A. Abrams & Jing Li & James G. Mulligan, 2012. "Capital Intensity and U.S. Country Population Growth during the Late Nineteenth Century," Working Papers 12-02, University of Delaware, Department of Economics.
    19. van Ark, Bart & de Vries, Klaas & Erumban, Abdul, 2021. "How To Not Miss A Productivity Revival Once Again," National Institute Economic Review, National Institute of Economic and Social Research, vol. 255, pages 9-24, February.
    20. Yang, Chih-Hai, 2022. "How Artificial Intelligence Technology Affects Productivity and Employment: Firm-level Evidence from Taiwan," Research Policy, Elsevier, vol. 51(6).
    21. Duch-Brown, Néstor & Gomez-Herrera, Estrella & Mueller-Langer, Frank & Tolan, Songül, 2022. "Market power and artificial intelligence work on online labour markets," Research Policy, Elsevier, vol. 51(3).

    More about this item

    Keywords

    Patent; Patent landscape; Artificial intelligence; AI; Machine learning; Patent dataset;
    All these keywords.

    JEL classification:

    • O31 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Innovation and Invention: Processes and Incentives
    • O34 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Intellectual Property and Intellectual Capital
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • L86 - Industrial Organization - - Industry Studies: Services - - - Information and Internet Services; Computer Software

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jtecht:v:47:y:2022:i:2:d:10.1007_s10961-021-09900-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.