IDEAS home Printed from https://ideas.repec.org/p/uto/labeco/202502.html
   My bibliography  Save this paper

Addressing the identification of Critical Raw Material Patents Using Pretrained and Large Language Models

Author

Abstract

In modern technologies, critical raw materials (CRMs) have gained attention due to supply chain risks, environmental concerns, and their essential role in industries such as renewable energy, electric vehicles, and advanced electronics. However, identifying and classifying CRM-related patents, and thus technologies, remains challenging due to the lack of specific classification systems. Traditional approaches, such as keyword- based searches and Cooperative Patent Classification (CPC) and International Patent Classification (IPC) codes, suffer from inaccuracies due to evolving terminology, ambiguous context, as well as the inability in recognizing alternative material usage. This study proposes a novel methodology leveraging advanced natural language processing (NLP) tools to overcome these limitations. Our approach addresses two key objectives: (1) distinguishing between substitutable and non-substitutable CRMs in patent abstracts through the GPT-3.5-turbo-16k model and (2) identifying CRM- related patents via a fine-tuned BERT for Patents model. Our findings reveal distinct geographical, technological, and temporal patterns in CRM- related innovation, emphasizing the significance of NLP techniques in overcoming traditional classification challenges. This research offers policymakers and industry stakeholders valuable insights into CRM innovation trends, supporting strategic decision-making for sustainable resource management.

Suggested Citation

  • Manera, Maria & Fusillo, Fabrizio & Orsatti, Gianluca & Quatraro, Francesco, 2025. "Addressing the identification of Critical Raw Material Patents Using Pretrained and Large Language Models," Department of Economics and Statistics Cognetti de Martiis LEI & BRICK - Laboratory of Economics of Innovation "Franco Momigliano", Bureau of Research in Innovation, Complexity and Knowledge, Collegio 202502, University of Turin.
  • Handle: RePEc:uto:labeco:202502
    as

    Download full text from publisher

    File URL: https://www.est.unito.it/do/home.pl/Download?doc=/allegati/wp2025dip/wp_05_2025.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Eisenreich, Anja & Just, Julian & Gimenez-Jimenez, Daniela & Füller, Johann, 2024. "Revolution or inflated expectations? Exploring the impact of generative AI on ideation in a practical sustainability context," Technovation, Elsevier, vol. 138(C).
    2. Hache, Emmanuel & Seck, Gondia Sokhna & Simoen, Marine & Bonnet, Clément & Carcanague, Samuel, 2019. "Critical raw materials and transportation sector electrification: A detailed bottom-up analysis in world transport," Applied Energy, Elsevier, vol. 240(C), pages 6-25.
    3. Metzger, Philipp & Mendonça, Sandro & Silva, José A. & Damásio, Bruno, 2023. "Battery innovation and the Circular Economy: What are patents revealing?," Renewable Energy, Elsevier, vol. 209(C), pages 516-532.
    4. Francesco de Cunzo & Davide Consoli & Francois Perruchas & Angelica Sbardella, 2023. "Mapping Critical Raw Materials in Green Technologies," Papers in Evolutionary Economic Geography (PEEG) 2322, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Dec 2023.
    5. Youngjae Choi & Sanghyun Park & Sungjoo Lee, 2021. "Identifying emerging technologies to envision a future innovation ecosystem: A machine learning approach to patent data," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5431-5476, July.
    6. Basberg, Bjorn L., 1987. "Patents and the measurement of technological change: A survey of the literature," Research Policy, Elsevier, vol. 16(2-4), pages 131-141, August.
    7. Chiarello, Filippo & Giordano, Vito & Spada, Irene & Barandoni, Simone & Fantoni, Gualtiero, 2024. "Future applications of generative large language models: A data-driven case study on ChatGPT," Technovation, Elsevier, vol. 133(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Archibugi, Daniele & Mariella, Vitantonio & Vezzani, Antonio, 2025. "What next? Nations in the technological race through the 2030," Technological Forecasting and Social Change, Elsevier, vol. 212(C).
    2. Fontana, Roberto & Nuvolari, Alessandro & Shimizu, Hiroshi & Vezzulli, Andrea, 2013. "Reassessing patent propensity: Evidence from a dataset of R&D awards, 1977–2004," Research Policy, Elsevier, vol. 42(10), pages 1780-1792.
    3. Bedford, Anna & Ma, Le & Ma, Nelson & Vojvoda, Kristina, 2022. "Australian innovation: Patent database construction and first evidence," Pacific-Basin Finance Journal, Elsevier, vol. 73(C).
    4. Ming Liu & Sumner LaCroix, 2011. "The Impact of Stronger Property Rights in Pharmaceuticals on Innovation in Developed and Developing Countries," Working Papers 201116, University of Hawaii at Manoa, Department of Economics.
    5. Hache, Emmanuel & Simoën, Marine & Seck, Gondia Sokhna & Bonnet, Clément & Jabberi, Aymen & Carcanague, Samuel, 2020. "The impact of future power generation on cement demand: An international and regional assessment based on climate scenarios," International Economics, Elsevier, vol. 163(C), pages 114-133.
    6. Robert M. Salomon & J. Myles Shaver, 2005. "Learning by Exporting: New Insights from Examining Firm Innovation," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 14(2), pages 431-460, June.
    7. Rebecca Henderson & Adam B. Jaffe & Manuel Trajtenberg, 1998. "Universities As A Source Of Commercial Technology: A Detailed Analysis Of University Patenting, 1965-1988," The Review of Economics and Statistics, MIT Press, vol. 80(1), pages 119-127, February.
    8. Waters, James, 2014. "Introduction of innovations during the 2007-8 financial crisis: US companies compared with universities," MPRA Paper 59016, University Library of Munich, Germany.
    9. Zander, Ivo, 1997. "Technological diversification in the multinational corporation--historical evolution and future prospects," Research Policy, Elsevier, vol. 26(2), pages 209-227, May.
    10. Inchae Park & Yujin Jeong & Byungun Yoon, 2017. "Analyzing the value of technology based on the differences of patent citations between applicants and examiners," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 665-691, May.
    11. Fareri, Silvia & Apreda, Riccardo & Mulas, Valentina & Alonso, Ruben, 2023. "The worker profiler: Assessing the digital skill gaps for enhancing energy efficiency in manufacturing," Technological Forecasting and Social Change, Elsevier, vol. 196(C).
    12. Magerman, Tom & Looy, Bart Van & Debackere, Koenraad, 2015. "Does involvement in patenting jeopardize one’s academic footprint? An analysis of patent-paper pairs in biotechnology," Research Policy, Elsevier, vol. 44(9), pages 1702-1713.
    13. Sanghoon Lee & Wonjoon Kim, 2017. "The knowledge network dynamics in a mobile ecosystem: a patent citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 717-742, May.
    14. Valle, Sandra & García, Francisco & Avella, Lucía, 2015. "Offshoring Intermediate Manufacturing: Boost or Hindrance to Firm Innovation?," Journal of International Management, Elsevier, vol. 21(2), pages 117-134.
    15. Michele Cincera, 2005. "Firms' productivity growth and R&D spillovers: An analysis of alternative technological proximity measures," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 14(8), pages 657-682.
    16. Karine Pellier, 2007. "Convergence, Patenting Activity and Geographic Spillovers: A Spatial Econometric Analysis for European Regions," Working Papers 07-14, LAMETA, Universtiy of Montpellier, revised Dec 2007.
    17. Katja Rost, 2006. "Der Einfluss von Erfindernetzwerken auf die Relevanz von Patenten," Schmalenbach Journal of Business Research, Springer, vol. 58(3), pages 363-389, May.
    18. Claude DIEBOLT & Karine PELLIER, 2018. "Patents in the Long Run: Theory, History and Statistics," Working Papers of BETA 2018-20, Bureau d'Economie Théorique et Appliquée, UDS, Strasbourg.
    19. Schmoch, Ulrich, 2007. "Double-boom cycles and the comeback of science-push and market-pull," Research Policy, Elsevier, vol. 36(7), pages 1000-1015, September.
    20. Rémi Barré & Françoise Laville, 1994. "La bibliométrie des brevets : une mesure de l'activité technologique," Économie et Statistique, Programme National Persée, vol. 275(1), pages 71-81.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:uto:labeco:202502. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Laura Ballestra or Cinzia Carlevaris (email available below). General contact details of provider: https://edirc.repec.org/data/leifrit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.