Translation from narrative text to standard codes variables with Stata
In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.
Volume (Year): 10 (2010)
Issue (Month): 3 (September)
|Note:||to access software from within Stata, net describe http://www.stata-journal.com/software/sj10-3/dm0050/|
|Contact details of provider:|| Web page: http://www.stata-journal.com/|
|Order Information:||Web: http://www.stata-journal.com/subscription.html|
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Rafal Raciborski, 2008. "kountry: A Stata utility for merging cross-country data from multiple sources," Stata Journal, StataCorp LP, vol. 8(3), pages 390-400, September.
When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:10:y:2010:i:3:p:458-481. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Christopher F. Baum)or (Lisa Gilmore)
If references are entirely missing, you can add them using this form.