This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Towards Self-Contained Data: Attaching Validation Routines to Variables

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
William Rising () (Bellarmine University)
Abstract

One of Stata's great strengths is its data management abilities. When either building or sharing data sets, some of the most time-consuming activities are validating the data and writing documentation for the data. Much of this futility could be avoided if data sets were self-contained, i.e. if they could validate themselves. Showing how this can be done within Stata is the purpose of this talk. What will be demonstrated is a package of commands for attaching validation rules to the variables themselves, via characteristics, along with commands for running error checks and marking suspicious observations in the data set. The validation system is flexible enough that simple checks continue to work even if variable names change or if the data are reshaped, and is rich enough that validation may depend on other variables in the data set. Since the validation is at the variable level, the self-validation also works if variables are recombined with data from other data sets. With these tools, Stata's data sets will become truly self-contained.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://repec.org/nasug2006/ckvarTalk.beamer.pdf
File Format: application/pdf
File Function:
Download Restriction: no
File URL: http://repec.org/nasug2006/CheckvarChar_v1.0.0.zip
File Format: application/zip
File Function:
Download Restriction: no

Publisher Info
Paper provided by Stata Users Group in its series North American Stata Users' Group Meetings 2006 with number 10.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 23 Jul 2006
Date of revision:
Handle: RePEc:boc:asug06:10

Contact details of provider:
Postal: Administration Building, 140 Commonwealth Avenue, Chestnut Hill MA 02467
Phone: 617-552-3670
Fax: 617-552-2308
Email:
Web page: http://www.stata.com/meeting/5nasug
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Christopher F Baum).

Related research
Keywords:

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? About 1000 journals are listed on RePEc.

This page was last updated on 2009-12-13.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.