Record-Linkage from a Technical Point of View
AbstractTRecord linkage is used for preparing sampling frames, deduplication of lists and combining information on the same object from two different databases. If the identifiers of the same objects in two different databases have error free unique common identifiers like personal identification numbers (PID), record linkage is a simple file merge operation. If the identifiers contains errors, record linkage is a challenging task. In many applications, the files have widely different numbers of observations, for example a few thousand records of a sample survey and a few million records of an administrative database of social security numbers. Available software, privacy issues and future research topics are discussed.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by German Council for Social and Economic Data (RatSWD) in its series Working Paper Series of the German Council for Social and Economic Data with number 124.
Date of creation: 2009
Date of revision:
Record-Linkage; Data-mining; Privacy preserving protocols;
You can help add them by filling out this form.
reading list or among the top items on IDEAS.Access and download statisticsgeneral information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (RatSWD) The email address of this maintainer does not seem to be valid anymore. Please ask RatSWD to update the entry or send us the correct address.
If references are entirely missing, you can add them using this form.