Felicity Clemens () (London School of Hygiene and Tropical Medicine)
Abstract
The cleaning and verification process of many different types of datasets often involves considering similar problems. This presentation will give a very brief simple overview of three useful processes and their associated Stata commands: 1. Finding, counting and removing duplicated data and other multiple entries; 2. Summing individual-level entries to give an overall score per individual - when to treat missing data as 0; 3. Recap of merging data and uses of the merge command The presentation will outline the difficulties that are frequently encountered in these three situations and show how they can be addressed using the common Stata commands of count, rsum, sum and merge/append respectively.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.