IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Login to save this paper or follow this series

How To Kill Inventors: Testing The Massacrator© Algorithm For Inventor Disambiguation

  • Michele PEZZONI (University of Milano-Bicocca - KiTES-Università Bocconi - Observatoire des Sciences et des Techniques)
  • Francesco LISSONI (GREThA, CNRS, UMR 5113 - KiTES)
  • Gianluca TARASCONI (KiTES, Università Bocconi)

Inventor disambiguation is an increasingly important issue for users of patent data. We propose and test a number of refinements to the Massacrator© algorithm, originally proposed by Lissoni et al. (2006) and now applied to APE-INV, a free access database funded by the European Science Foundation. Following Raffo and Lhuillery (2009) we describe disambiguation as a 3-step process: cleaning&parsing, matching, and filtering. By means of sensitivity analysis, based on MonteCarlo simulations, we show how various filtering criteria can be manipulated in order to obtain optimal combinations of precision and recall (type I and type II errors). We also show how these different combinations generate different results for applications to studies on inventors\' productivity, mobility, and networking. The filtering criteria based upon information on inventors\' addresses are sensitive to data quality, while those based upon information on co-inventorship networks are always effective. Details on data access and data quality improvement via feedback collection are also discussed.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://cahiersdugretha.u-bordeaux4.fr/2012/2012-29.pdf
Download Restriction: no

Paper provided by Groupe de Recherche en Economie Théorique et Appliquée in its series Cahiers du GREThA with number 2012-29.

as
in new window

Length:
Date of creation: 2012
Date of revision:
Handle: RePEc:grt:wpegrt:2012-29
Contact details of provider: Postal: Avenue Léon Duguit, 33608 Pessac Cedex
Phone: +33 (0)5.56.84.25.75
Fax: +33 (0)5.56.84.86.47
Web page: http://gretha.u-bordeaux4.fr/
Email:


More information through EDIRC

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as in new window
  1. Matt Marx & Deborah Strumsky & Lee Fleming, 2009. "Mobility, Skills, and the Michigan Non-Compete Experiment," Management Science, INFORMS, vol. 55(6), pages 875-889, June.
  2. Raffo, Julio & Lhuillery, Stéphane, 2009. "How to play the "Names Game": Patent retrieval comparing different heuristics," Research Policy, Elsevier, vol. 38(10), pages 1617-1627, December.
  3. Grid Thoma & Salvatore Torrisi & Alfonso Gambardella & Dominique Guellec & Bronwyn H. Hall & Dietmar Harhoff, 2010. "Harmonizing and Combining Large Datasets - An Application to Firm-Level Patent and Accounting Data," NBER Working Papers 15851, National Bureau of Economic Research, Inc.
  4. Jinyoung Kim & Sangjoon John Lee & Gerald Marschke, 2010. "The Influence of University Research on Industrial Innovation," Discussion Paper Series 1006, Institute of Economic Research, Korea University.
  5. Grid Thoma & Salvatore Torrisi, 2007. "Creating Powerful Indicators for Innovation Studies with Approximate Matching Algorithms. A test based on PATSTAT and Amadeus databases," KITeS Working Papers 211, KITeS, Centre for Knowledge, Internationalization and Technology Studies, Universita' Bocconi, Milano, Italy, revised Dec 2007.
  6. repec:cmi:wpaper:cemi-workingpaper-2009-006 is not listed on IDEAS
  7. Stefano Breschi & Francesco Lissoni, 2009. "Mobility of skilled workers and co-invention networks: an anatomy of localized knowledge flows," Journal of Economic Geography, Oxford University Press, vol. 9(4), pages 439-468, July.
  8. Lorenzo Cassi & Nicolas Carayol, 2009. "Who's Who in Patents. A Bayesian approach," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00631750, HAL.
  9. Stefano Breschi & Francesco Lissoni & Fabio Montobbio, 2006. "University patenting and scientific productivity. A quantitative study of Italian academic inventors," KITeS Working Papers 189, KITeS, Centre for Knowledge, Internationalization and Technology Studies, Universita' Bocconi, Milano, Italy, revised Nov 2006.
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:grt:wpegrt:2012-29. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Emmanuel Petit)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.