A shortcut through long loops: An illustration of two alternatives to looping over observations
It is well known that looping over observations can be slow and should be avoided. The objective of this article is to discuss two alternative solutions to looping over observations that can be used to overcome a particular data-management problem of merging datasets in which unique key identifiers changed over time. The ﬁrst alternative, mapch, which is introduced in this article, uses a combination of appending, indexing, and merging to solve the problem, while the second alternative uses repeated merging. Both solutions are much quicker than looping over observations. However, depending on the nature of the problem, one solution may work better than the other. It is argued that the use of such dataset-type manipulations may be suitable to overcome other data-management problems. More generally speaking, the issue that is addressed—searching for an alternative to looping over observations—may be common and illustrates the importance of balancing the costs of developing an efficient solution with the benefits accruing from that solution. Copyright 2008 by StataCorp LP.
Volume (Year): 8 (2008)
Issue (Month): 4 (December)
|Contact details of provider:|| Web page: http://www.stata-journal.com/|
|Order Information:||Web: http://www.stata-journal.com/subscription.html|
When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:8:y:2008:i:4:p:540-553. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Christopher F. Baum)or (Lisa Gilmore)
If references are entirely missing, you can add them using this form.