“Singling out individual inventors from patent data”
Abstract
An increasing number of studies in recent years have sought to identify individual inventors from patent data. A variety of heuristics have been proposed for using the names and other information disclosed in patent documents to establish “who is who” in patents. This paper contributes to this literature by describing a methodology for identifying inventors using patents applied to the European Patent Office (EPO hereafter). As in much of this literature, we basically follow a three-step procedure: (1) the parsing stage, aimed at reducing the noise in the inventor’s name and other fields of the patent; (2) the matching stage, where name matching algorithms are used to group similar names; and (3) the filtering stage, where additional information and various scoring schemes are used to filter out these similarly-named inventors. The paper presents the results obtained by using the algorithms with the set of European inventors applying to the EPO over a long period of time.Download Info
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.Bibliographic Info
Paper provided by University of Barcelona, Research Institute of Applied Economics in its series IREA Working Papers with number 201105.Length: 39 pages
Date of creation: May 2011
Date of revision: May 2011
Handle: RePEc:ira:wpaper:201105
Contact details of provider:
Postal: Tinent Coronel Valenzuela, Num 1-11 08034 Barcelona
Web page: http://www.ub.edu/irea/
More information through EDIRC
Related research
Keywords: “Names game”; patent data; unique inventors; name matching algorithms. JEL classification:C8; J61; O31; O33; R0.;Other versions of this item:
- Ernest Miguélez & Ismael Gómez-Miguélez, 2011. "Singling out individual inventors from patent data," Working Papers XREAP2011-03, Xarxa de Referència en Economia Aplicada (XREAP), revised May 2011.
- C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs
- J61 - Labor and Demographic Economics - - Mobility, Unemployment, and Vacancies - - - Geographic Labor Mobility; Immigrant Workers
- O31 - Economic Development, Technological Change, and Growth - - Technological Change; Research and Development; Intellectual Property Rights - - - Innovation and Invention: Processes and Incentives
- O33 - Economic Development, Technological Change, and Growth - - Technological Change; Research and Development; Intellectual Property Rights - - - Technological Change: Choices and Consequences; Diffusion Processes
- R0 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General
This paper has been announced in the following NEP Reports:
- NEP-ALL-2011-07-13 (All new papers)
- NEP-INO-2011-07-13 (Innovation)
- NEP-IPR-2011-07-13 (Intellectual Property Rights)
References
References listed on IDEASPlease report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Ajay Agrawal & Iain Cockburn & John McHale, 2003. "Gone But Not Forgotten: Labor Flows, Knowledge Spillovers, and Enduring Social Capital," NBER Working Papers 9950, National Bureau of Economic Research, Inc.
- Nicolas CARAYOL (GREThA UMR CNRS 5113) & Lorenzo CASSI (CES, Université Paris 1 Panthéon Sorbonne - CNRS), 2009.
"Who\'s Who in Patents. A Bayesian approach,"
Cahiers du GREThA
2009-07, Groupe de Recherche en Economie Théorique et Appliquée.
- Lorenzo Cassi & Nicolas Carayol, 2009. "Who's Who in Patents. A Bayesian approach," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00631750, HAL.
- Zvi Griliches, 1998.
"Patent Statistics as Economic Indicators: A Survey,"
NBER Chapters,
in: R&D and Productivity: The Econometric Evidence, pages 287-343
National Bureau of Economic Research, Inc.
- Griliches, Zvi, 1990. "Patent Statistics as Economic Indicators: A Survey," Journal of Economic Literature, American Economic Association, vol. 28(4), pages 1661-1707, December.
- Zvi Griliches, 1991. "Patent Statistics as Economic Indicators: A Survey," NBER Working Papers 3301, National Bureau of Economic Research, Inc.
- Jinyoung Kim & Sangjoon John Lee & Gerald Marschke, 2006.
"International Knowledge Flows: Evidence from an Inventor-Firm Matched Data Set,"
NBER Working Papers
12692, National Bureau of Economic Research, Inc.
- Jinyoung Kim & Sangjoon John Lee & Gerald Marschke, 2009. "International Knowledge Flows: Evidence from an Inventor-Firm Matched Data Set," NBER Chapters, in: Science and Engineering Careers in the United States: An Analysis of Markets and Employment, pages 321-348 National Bureau of Economic Research, Inc.
- Jinyoung Kim & Sangjoon John Lee & Gerald Marschke, 2007. "International Knowledge Flows: Evidence from an Inventor-Firm Matched Data Set," Discussion Paper Series 0706, Institute of Economic Research, Korea University.
- Manuel Trajtenberg & Gil Shiff & Ran Melamed, 2006.
"The "Names Game": Harnessing Inventors' Patent Data for Economic Research,"
NBER Working Papers
12479, National Bureau of Economic Research, Inc.
- Melamed, Ran & Shiff, Gil & Trajtenberg, Manuel, 2006. "The 'Names Game': Harnessing Inventors Patent Data for Economic Research," CEPR Discussion Papers 5833, C.E.P.R. Discussion Papers.
- Grid Thoma & Salvatore Torrisi, 2007. "Creating Powerful Indicators for Innovation Studies with Approximate Matching Algorithms. A test based on PATSTAT and Amadeus databases," KITeS Working Papers 211, KITeS, Centre for Knowledge, Internationalization and Technology Studies, Universita' Bocconi, Milano, Italy, revised Dec 2007.
- Julio Raffo & Stéphane Lhuillery, 2009.
"How to play the “Names Game”: Patent retrieval comparing different heuristics,"
CEMI Working Papers
cemi-workingpaper-2009-00, Ecole Polytechnique Fédérale de Lausanne, Collège du Management de la Technologie, Management of Technology and Entrepreneurship Institute, Chaire en Economie et Management de l'Innovation.
- Raffo, Julio & Lhuillery, Stéphane, 2009. "How to play the "Names Game": Patent retrieval comparing different heuristics," Research Policy, Elsevier, vol. 38(10), pages 1617-1627, December.
- Bottazzi, Laura & Peri, Giovanni, 2003.
"Innovation and spillovers in regions: Evidence from European patent data,"
European Economic Review,
Elsevier, vol. 47(4), pages 687-710, August.
- Laura Bottazzi & Giovanni Peri, . "Innovation and Spillovers in Regions: Evidence from European Patent Data," Working Papers 215, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
Citations
Lists
This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.Statistics
Access and download statisticsCorrections
When requesting a correction, please mention this item's handle: RePEc:ira:wpaper:201105For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Alicia García).
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.

