Linking Household Survey and Administrative Record Data: What Should the Matching Variables Be?
AbstractLinkages of household survey responses with administrative data may be based on unique individual identifiers or on survey respondent characteristics. The benefits gained from using unique identifiers need to be assessed in the light of potential problems such as non-response and measurement error. We report on a study that linked survey responses to UK government agency records on benefits and tax credits in five different ways. One matched on a respondent-supplied National Insurance Number and the other four used different combinations of sex, name, address, and date of birth. As many linkages were made using matches on sex, date of birth, and post-code, or on sex, date of birth, first name and family name, as were made using matches on self-reported National Insurance Number, and the former were also relatively accurate when assessed in terms of false positive and false negative rates. The five independent matching exercises also shed light on the potential returns from hierarchical and pooled matching.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by DIW Berlin, German Institute for Economic Research in its series Discussion Papers of DIW Berlin with number 489.
Length: II, 22 p.
Date of creation: 2005
Date of revision:
Record linkage; Matching; National Insurance number; Measurement error;
Other versions of this item:
Find related papers by JEL classification:
- C22 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- repec:ese:iserwp:2000-38 is not listed on IDEAS
- Lorenzo Cappellari & Stephen P. Jenkins, 2003.
"Multivariate probit regression using simulated maximum likelihood,"
United Kingdom Stata Users' Group Meetings 2003
10, Stata Users Group.
- Lorenzo Cappellari & Stephen P. Jenkins, 2003. "Multivariate probit regression using simulated maximum likelihood," Stata Journal, StataCorp LP, vol. 3(3), pages 278-294, September.
- Annette Jäckle & Emanuela Sala & Stephen P. Jenkins & Peter Lynn, 2005.
"Validation of Survey Data on Income and Employment: The ISMIE Experience,"
Discussion Papers of DIW Berlin
488, DIW Berlin, German Institute for Economic Research.
- Steven Haider & Gary Solon, 2000.
"Non Random Selection in the HRS Social Security Earnings Sample,"
00-01, RAND Corporation Publications Department.
- Haider, S. & Solon, G., 2000. "Nonrandom Selection in the HRS Social Security Earnings Sample," Papers 00-01, RAND - Labor and Population Program.
- Simon Burgess & Deborah Wilson, 2003. "Ethnic Segregation in England's Schools," The Centre for Market and Public Organisation 03/086, Department of Economics, University of Bristol, UK.
- repec:ese:iserwp:2004-14 is not listed on IDEAS
- repec:ese:iserwp:2005-16 is not listed on IDEAS
- repec:ese:iserwp:2004-27 is not listed on IDEAS
- Hartmann, Josef & Krug, Gerhard, 2007. "Verknüpfung von Befragungs- und Prozessdaten : Selektivität durch fehlende Zustimmung der Befragten?," IAB Discussion Paper 200713, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
- repec:ese:iserwp:2004-12 is not listed on IDEAS
- repec:ese:iserwp:2004-28 is not listed on IDEAS
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Bibliothek).
If references are entirely missing, you can add them using this form.