Speaking Stata: Distinct observations
Distinct observations are those diﬀerent with respect to one or more variables, considered either individually or jointly. Distinctness is thus a key aspect of the similarity or diﬀerence of observations. It is sometimes confounded with uniqueness. Counting the number of distinct observations may be required at any point from initial data cleaning or checking to subsequent statistical analysis. We review how far existing commands in oﬃcial Stata oﬀer solutions to this issue, and we show how to answer questions about distinct observations from ﬁrst principles by using the by preﬁx and the egen command. The new distinct command is oﬀered as a convenience tool.
Volume (Year): 8 (2008)
Issue (Month): 4 (December)
|Contact details of provider:|| Web page: http://www.stata-journal.com/|
|Order Information:||Web: http://www.stata-journal.com/subscription.html|
When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:8:y:2008:i:4:p:557-568. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Christopher F. Baum)or (Lisa Gilmore)
If references are entirely missing, you can add them using this form.