Patent data have been widely used in research on technological innovation to characterize firms' locations as well as the proximities among firms in knowledge space. Researchers could measure proximity among firms with a variety of measures based on patent class data, including Euclidean distance, correlation, and angle between firms' patent class distributions. Alternatively, one could measure proximity using overlap in cited patents. We point out that measures of proximity based on small numbers of patents are imprecisely measured random variables. Measures computed on samples with few patents generate both biased and imprecise measures of proximity. We explore the effects of larger sample sizes and coarser patent class breakdowns in mitigating these problems. Where possible, we suggest that researchers increase their sample sizes by aggregating years or using all of the listed patent classes on a patent, rather than just the first.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
Publisher Info
Paper provided by National Bureau of Economic Research, Inc in its series NBER Working Papers with number
13322.
Length: Date of creation: Aug 2007 Date of revision: Handle: RePEc:nbr:nberwo:13322
Note: IO PR Contact details of provider: Postal: National Bureau of Economic Research, 1050 Massachusetts Avenue Cambridge, MA 02138, U.S.A. Phone: 617-868-3900 Email: Web page: http://www.nber.org More information through EDIRC
For technical questions regarding this item, or to correct its listing, contact: ().