IDEAS home Printed from https://ideas.repec.org/a/bla/jemstr/v31y2022i2p369-392.html
   My bibliography  Save this article

Reliance on science by inventors: Hybrid extraction of in‐text patent‐to‐article citations

Author

Listed:
  • Matt Marx
  • Aaron Fuegi

Abstract

We curate and characterize a complete set of citations from patents to scientific articles, including 16.8 million from the full text of USPTO and EPO patents. Combining hand‐tuned heuristics and the GROBID machine‐learning package, we achieve much higher performance than machine learning alone. Recall is evaluated with a set of 5939 randomly sampled, cross‐verified “known good” citations, which the authors have never seen. At 99.4% precision, we achieve recall rates of 78% for the full test set and 88% for references specified without mistakes. We compare these “in‐text” citations with those on the front page of patents. In‐text citations are more diverse temporally, geographically, and topically; moreover, they are less self‐referential and less likely to be copied from one patent to the next. In‐text citations have dropped from two‐thirds of all patent‐to‐article citations half a century ago to about one‐third today. In replicating two articles that use only front‐page citations, we show that failing to capture in‐text citations leads to understating the role of academic science in commercial invention. All patent‐to‐article citations, the known‐good test set, and the source code are available at http://relianceonscience.org.

Suggested Citation

  • Matt Marx & Aaron Fuegi, 2022. "Reliance on science by inventors: Hybrid extraction of in‐text patent‐to‐article citations," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 31(2), pages 369-392, April.
  • Handle: RePEc:bla:jemstr:v:31:y:2022:i:2:p:369-392
    DOI: 10.1111/jems.12455
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/jems.12455
    Download Restriction: no

    File URL: https://libkey.io/10.1111/jems.12455?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jasjit Singh & Matt Marx, 2013. "Geographic Constraints on Knowledge Spillovers: Political Borders vs. Spatial Proximity," Management Science, INFORMS, vol. 59(9), pages 2056-2078, September.
    2. Ashish Arora & Sharon Belenzon & Lia Sheer, 2017. "Back to Basics: Why do Firms Invest in Research?," NBER Working Papers 23187, National Bureau of Economic Research, Inc.
    3. Adam B. Jaffe & Manuel Trajtenberg & Rebecca Henderson, 1993. "Geographic Localization of Knowledge Spillovers as Evidenced by Patent Citations," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 108(3), pages 577-598.
    4. Sharon Belenzon & Mark Schankerman, 2013. "Spreading the Word: Geography, Policy, and Knowledge Spillovers," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 884-903, July.
    5. Bronwyn H. Hall & Adam B. Jaffe & Manuel Trajtenberg, 2001. "The NBER Patent Citation Data File: Lessons, Insights and Methodological Tools," NBER Working Papers 8498, National Bureau of Economic Research, Inc.
    6. Peter Thompson & Melanie Fox-Kean, 2005. "Patent Citations and the Geography of Knowledge Spillovers: A Reassessment: Reply," American Economic Review, American Economic Association, vol. 95(1), pages 465-466, March.
    7. Peter Thompson & Melanie Fox-Kean, 2005. "Patent Citations and the Geography of Knowledge Spillovers: A Reassessment," American Economic Review, American Economic Association, vol. 95(1), pages 450-460, March.
    8. Peter Thompson, 2006. "Patent Citations and the Geography of Knowledge Spillovers: Evidence from Inventor- and Examiner-added Citations," The Review of Economics and Statistics, MIT Press, vol. 88(2), pages 383-388, May.
    9. Michael Roach & Wesley M. Cohen, 2013. "Lens or Prism? Patent Citations as a Measure of Knowledge Flows from Public Research," Management Science, INFORMS, vol. 59(2), pages 504-525, October.
    10. Lee Fleming & Olav Sorenson, 2004. "Science as a map in technological search," Strategic Management Journal, Wiley Blackwell, vol. 25(8‐9), pages 909-928, August.
    11. Felix Poege & Dietmar Harhoff & Fabian Gaessler & Stefano Baruffaldi, 2019. "Science Quality and the Value of Inventions," Papers 1903.05020, arXiv.org, revised Apr 2019.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cesare Righi & Davide Cannito & Theodor Vladasel, 2023. "Continuing Patent Applications at the USPTO," Working Papers 1382, Barcelona School of Economics.
    2. Righi, Cesare & Cannito, Davide & Vladasel, Theodor, 2023. "Continuing patent applications at the USPTO," Research Policy, Elsevier, vol. 52(4).
    3. Cesare Righi & Davide Cannito & Theodor Vladasel, 2023. "Continuing patent applications at the USPTO," Economics Working Papers 1855, Department of Economics and Business, Universitat Pompeu Fabra.
    4. de Rassenfosse, Gaétan & Palangkaraya, Alfons, 2023. "Do patent pledges accelerate innovation?," Research Policy, Elsevier, vol. 52(5).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carlino, Gerald & Kerr, William R., 2015. "Agglomeration and Innovation," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 349-404, Elsevier.
    2. Keith Head & Yao Amber Li & Asier Minondo, 2019. "Geography, Ties, and Knowledge Flows: Evidence from Citations in Mathematics," The Review of Economics and Statistics, MIT Press, vol. 101(4), pages 713-727, October.
    3. Nivedita Mukherji & Jonathan Silberman, 2021. "Knowledge flows between universities and industry: the impact of distance, technological compatibility, and the ability to diffuse knowledge," The Journal of Technology Transfer, Springer, vol. 46(1), pages 223-257, February.
    4. William R. Kerr & Frederic Robert-Nicoud, 2020. "Tech Clusters," Journal of Economic Perspectives, American Economic Association, vol. 34(3), pages 50-76, Summer.
    5. Li, Yao Amber, 2014. "Borders and distance in knowledge spillovers: Dying over time or dying with age?—Evidence from patent citations," European Economic Review, Elsevier, vol. 71(C), pages 152-172.
    6. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    7. Po‐Hsuan Hsu & Hai‐Ping Hui & Hsiao‐Hui Lee & Kevin Tseng, 2022. "Supply chain technology spillover, customer concentration, and product invention," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 31(2), pages 393-417, April.
    8. Fernández, Ana María & Ferrándiz, Esther & Medina, Jennifer, 2022. "The diffusion of energy technologies. Evidence from renewable, fossil, and nuclear energy patents," Technological Forecasting and Social Change, Elsevier, vol. 178(C).
    9. Christian Catalini, 2018. "Microgeography and the Direction of Inventive Activity," Management Science, INFORMS, vol. 64(9), pages 4348-4364, September.
    10. Forman, Chris & van Zeebroeck, Nicolas, 2019. "Digital technology adoption and knowledge flows within firms: Can the Internet overcome geographic and technological distance?," Research Policy, Elsevier, vol. 48(8), pages 1-1.
    11. Watzinger, Martin & Schnitzer, Monika, 2019. "Standing on the Shoulders of Science," Rationality and Competition Discussion Paper Series 215, CRC TRR 190 Rationality and Competition.
    12. William R. Kerr, 2010. "Breakthrough Inventions and Migrating Clusters of Innovation," NBER Chapters, in: Cities and Entrepreneurship, National Bureau of Economic Research, Inc.
    13. William R. Kerr & Scott Duke Kominers, 2015. "Agglomerative Forces and Cluster Shapes," The Review of Economics and Statistics, MIT Press, vol. 97(4), pages 877-899, October.
    14. Stefano Breschi & Francesco Lissoni & Gianluca Tarasconi, 2014. "Inventor Data for Research on Migration and Innovation: A Survey and a Pilot," WIPO Economic Research Working Papers 17, World Intellectual Property Organization - Economics and Statistics Division.
    15. Matt Marx & Aaron Fuegi, 2020. "Reliance on science: Worldwide front‐page patent citations to scientific articles," Strategic Management Journal, Wiley Blackwell, vol. 41(9), pages 1572-1594, September.
    16. Edward L. Glaeser & William R. Kerr, 2009. "Local Industrial Conditions and Entrepreneurship: How Much of the Spatial Distribution Can We Explain?," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 18(3), pages 623-663, September.
    17. Heide Fier & Andreas Pyka, 2014. "Against the one-way-street: analyzing knowledge transfer from industry to science," The Journal of Technology Transfer, Springer, vol. 39(2), pages 219-246, April.
    18. Yasusada Murata & Ryo Nakajima & Ryosuke Okamoto & Ryuichi Tamura, 2014. "Localized Knowledge Spillovers and Patent Citations: A Distance-Based Approach," The Review of Economics and Statistics, MIT Press, vol. 96(5), pages 967-985, December.
    19. Joel Blit, 2017. "Learning remotely: R&D satellites, intra‐firm linkages, and knowledge sourcing," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 26(4), pages 757-781, December.
    20. Gonzalez-Uribe, Juanita, 2020. "Exchanges of innovation resources inside venture capital portfolios," LSE Research Online Documents on Economics 100924, London School of Economics and Political Science, LSE Library.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jemstr:v:31:y:2022:i:2:p:369-392. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.kellogg.northwestern.edu/research/journals/JEMS/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.