Linking Household Survey and Administrative Record Data: What Should the Matching Variables Be?
Linkages of household survey responses with administrative data may be based on unique individual identifiers or on survey respondent characteristics. The benefits gained from using unique identifiers need to be assessed in the light of potential problems such as non-response and measurement error. We report on a study that linked survey responses to UK government agency records on benefits and tax credits in five different ways. One matched on a respondent-supplied National Insurance Number and the other four used different combinations of sex, name, address, and date of birth. As many linkages were made using matches on sex, date of birth, and post-code, or on sex, date of birth, first name and family name, as were made using matches on self-reported National Insurance Number, and the former were also relatively accurate when assessed in terms of false positive and false negative rates. The five independent matching exercises also shed light on the potential returns from hierarchical and pooled matching.
|Length:||II, 22 p.|
|Date of creation:||2005|
|Date of revision:|
|Contact details of provider:|| Postal: |
Web page: http://www.diw.de/en
More information through EDIRC
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Simon Burgess & Deborah Wilson, 2004.
"Ethnic Segretation in England's Schools,"
079, Centre for Analysis of Social Exclusion, LSE.
- Steven Haider & Gary Solon, 2000.
"Non Random Selection in the HRS Social Security Earnings Sample,"
00-01, RAND Corporation Publications Department.
- Haider, S. & Solon, G., 2000. "Nonrandom Selection in the HRS Social Security Earnings Sample," Papers 00-01, RAND - Labor and Population Program.
- Lorenzo Cappellari & Stephen P. Jenkins, 2003.
"Multivariate probit regression using simulated maximum likelihood,"
StataCorp LP, vol. 3(3), pages 278-294, September.
- Lorenzo Cappellari & Stephen P. Jenkins, 2003. "Multivariate probit regression using simulated maximum likelihood," United Kingdom Stata Users' Group Meetings 2003 10, Stata Users Group.
- Annette Jäckle & Emanuela Sala & Stephen P. Jenkins & Peter Lynn, 2005.
"Validation of Survey Data on Income and Employment: The ISMIE Experience,"
Discussion Papers of DIW Berlin
488, DIW Berlin, German Institute for Economic Research.
- repec:ese:iserwp:2000-38 is not listed on IDEAS
When requesting a correction, please mention this item's handle: RePEc:diw:diwwpp:dp489. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Bibliothek)
If references are entirely missing, you can add them using this form.