This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Speaking Stata: Distinct observations

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Nicholas J. Cox () (Durham University, UK)
Gary M. Longton () (Fred Hutchinson Cancer Research Center, Seattle)

Additional information is available for the following registered author(s):

Abstract

Distinct observations are those different with respect to one or more variables, considered either individually or jointly. Distinctness is thus a key aspect of the similarity or difference of observations. It is sometimes confounded with uniqueness. Counting the number of distinct observations may be required at any point from initial data cleaning or checking to subsequent statistical analysis. We review how far existing commands in official Stata offer solutions to this issue, and we show how to answer questions about distinct observations from first principles by using the by prefix and the egen command. The new distinct command is offered as a convenience tool.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.stata-journal.com/article.html?article=dm0042
File Format:
File Function: link to article purchase
Download Restriction: no
File URL: http://www.stata-journal.com/software/sj8-4/dm0042/
File Format: text/html
File Function:
Download Restriction: no

Publisher Info
Article provided by StataCorp LP in its journal Stata Journal.

Volume (Year): 8 (2008)
Issue (Month): 4 (December)
Pages: 557-568
Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Handle: RePEc:tsj:stataj:v:8:y:2008:i:4:p:557-568

Contact details of provider:
Web page: http://www.stata-journal.com/

Order Information:
Web: http://www.stata-journal.com/subscription.html

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords: distinct; by; egen; distinctness; uniqueness; data management;

Statistics
Access and download statistics

Did you know? IDEAS also indexes book chapters.

This page was last updated on 2009-10-27.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.