This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Collaborative Data Management for Longitudinal Studies

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Stephen Brehm () (University of Chicago)
L. Philip Schumm () (Department of Health Studies, University of Chicago)
Abstract

Efficient data cleaning and management is critical to the success of any large research project. This is particularly true in the case of longitudinal studies and/or those in which the data management tasks are shared among many individuals. Faced with several such projects, we developed a flexible, easy-to-use system for cleaning and managing research datasets. The system is modular, making it easy for different individuals to work on different parts of the process. This modularity also permits substantial code reuse over multiple waves of a longitudinal study. A central focus of the system is the idea of data testing; users write tests for specific variables that may then be rerun when a new wave of data becomes available or when changes to the data have been made. Although the basic ideas could be implemented in any statistical package or programming language, Stata is particularly well-suited to the task. In addition, we have written an ado-file to automate the process of building a data set and another to generate basic tests automatically from an existing dataset. Although the system was designed for use by large, collaborative projects, individuals can also benefit from using it for personal research projects.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://repec.org/nasug2005/CollaborativeDataManagementforLongitudinalStudies.ppt
File Format: application/x-mspowerpoint
File Function: presentation slides
Download Restriction: no

Publisher Info
Paper provided by Stata Users Group in its series North American Stata Users' Group Meetings 2005 with number 17.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 12 Jul 2005
Date of revision:
Handle: RePEc:boc:asug05:17

Contact details of provider:
Postal: Administration Building, 140 Commonwealth Avenue, Chestnut Hill MA 02467
Phone: 617-552-3670
Fax: 617-552-2308
Email:
Web page: http://www.stata.com/meeting/4nasug
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Christopher F Baum).

Related research
Keywords:

Statistics
Access and download statistics

Did you know? Authors registered on the RePEc Author Service receive monthly emails with details about downloads and abstract views of their works.

This page was last updated on 2009-12-2.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.