IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v8y2008i1p49-67.html
   My bibliography  Save this article

A new framework for managing and analyzing multiply imputed data in Stata

Author

Listed:
  • John B. Carlin

    (Clinical Epidemiology & Biostatistics Unit, Murdoch Children’s Research Institute & University of Melbourne)

  • John C. Galati

    (Clinical Epidemiology & Biostatistics Unit, Murdoch Children’s Research Institute & University of Melbourne)

  • Patrick Royston

    (Cancer Group, MRC Clinical Trials Unit)

Abstract

A new set of tools is described for performing analyses of an ensemble of datasets that includes multiple copies of the original data with imputations of missing values, as required for the method of multiple imputation. The tools replace those originally developed by the authors. They are based on a simple data management paradigm in which the imputed datasets are all stored along with the original data in a single dataset with a vertically stacked format, as proposed by Royston in his ice and micombine commands. Stacking into a single dataset simplifies the management of the imputed datasets compared with storing them individually. Analysis and manipulation of the stacked datasets is performed with a new prefix command, mim, which can accommodate data imputed by any method as long as a few simple rules are followed in creating the imputed data. mim can validly fit most of the regression models available in Stata to multiply imputed datasets, giving parameter estimates and confidence intervals computed according to Rubin’s results for multiple imputation inference. Particular attention is paid to limiting the available postestimation commands to those that are known to be valid within the multiple imputation context. However, the user has flexibility to override these defaults. Features of these new tools are illustrated using two previously published examples. Copyright 2008 by StataCorp LP.

Suggested Citation

  • John B. Carlin & John C. Galati & Patrick Royston, 2008. "A new framework for managing and analyzing multiply imputed data in Stata," Stata Journal, StataCorp LP, vol. 8(1), pages 49-67, February.
  • Handle: RePEc:tsj:stataj:v:8:y:2008:i:1:p:49-67
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0139
    Download Restriction: no

    File URL: http://www.stata-journal.com/software/sj8-1/st0139/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John B. Carlin & Ning Li & Philip Greenwood & Carolyn Coffey, 2003. "Tools for analyzing multiple imputed datasets," Stata Journal, StataCorp LP, vol. 3(3), pages 226-244, September.
    2. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Patrick Royston & John B. Carlin & Ian R. White, 2009. "Multiple imputation of missing values: New features for mim," Stata Journal, StataCorp LP, vol. 9(2), pages 252-264, June.
    2. Florence A. Becot & Shoshanah M. Inwood, 2022. "Medical economic vulnerability: a next step in expanding the farm resilience scholarship," Agriculture and Human Values, Springer;The Agriculture, Food, & Human Values Society (AFHVS), vol. 39(3), pages 1097-1116, September.
    3. Patrick Royston, 2007. "Multiple imputation of missing values: further update of ice, with an emphasis on interval censoring," Stata Journal, StataCorp LP, vol. 7(4), pages 445-464, December.
    4. Song, Lijun, 2015. "Does who you know in the positional hierarchy protect or hurt? Social capital, comparative reference group, and depression in two societies," Social Science & Medicine, Elsevier, vol. 136, pages 117-127.
    5. Kranjac, Ashley Wendell, 2015. "The moderating effect of self-efficacy on normal-weight, overweight, and obese children's math achievement: A longitudinal analysis," Social Science & Medicine, Elsevier, vol. 128(C), pages 168-177.
    6. Stephen Vaisey, 2010. "What People Want: Rethinking Poverty, Culture, and Educational Attainment," The ANNALS of the American Academy of Political and Social Science, , vol. 629(1), pages 75-101, May.
    7. Keese, Matthias, 2012. "Who feels constrained by high debt burdens? Subjective vs. objective measures of household debt," Journal of Economic Psychology, Elsevier, vol. 33(1), pages 125-141.
    8. Enno Siemsen & Aleda V. Roth & Sridhar Balasubramanian & Gopesh Anand, 2009. "The Influence of Psychological Safety and Confidence in Knowledge on Employee Knowledge Sharing," Manufacturing & Service Operations Management, INFORMS, vol. 11(3), pages 429-447, April.
    9. Helen Dakin & Nancy Devlin & Yan Feng & Nigel Rice & Phill O'Neill & David Parkin, 2015. "The Influence of Cost‐Effectiveness and Other Factors on Nice Decisions," Health Economics, John Wiley & Sons, Ltd., vol. 24(10), pages 1256-1271, October.
    10. Wesley Eddings & Yulia Marchenko, 2012. "Diagnostics for multiple imputation in Stata," Stata Journal, StataCorp LP, vol. 12(3), pages 353-367, September.
    11. H. Joseph Newton & Nicholas J. Cox, 2016. "The Stata Journal Editors' Prize 2016: Patrick Royston," Stata Journal, StataCorp LP, vol. 16(4), pages 815-825, December.
    12. Olívia Lucena de Medeiros & Jorge Otávio Maia Barreto & Matthew Harris & Letícia Xander Russo & Everton Nunes da Silva, 2020. "Delivering maternal and childcare at primary healthcare level: The role of PMAQ as a pay for performance strategy in Brazil," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-18, October.
    13. Austin Nichols, 2008. "Erratum and discussion of propensity-score reweighting," Stata Journal, StataCorp LP, vol. 8(4), pages 532-539, December.
    14. Yulia V. Marchenko & Jerome P. Reiter, 2009. "Improved degrees of freedom for multivariate significance tests obtained from multiply imputed, small-sample data," Stata Journal, StataCorp LP, vol. 9(3), pages 388-397, September.
    15. Masters, Samuel H. & Burstein, Roy & Amofah, George & Abaogye, Patrick & Kumar, Santosh & Hanlon, Michael, 2013. "Travel time to maternity care and its effect on utilization in rural Ghana: A multilevel analysis," Social Science & Medicine, Elsevier, vol. 93(C), pages 147-154.
    16. Terri Friedline & Stacia West, 2016. "Financial Education is not Enough: Millennials May Need Financial Capability to Demonstrate Healthier Financial Behaviors," Journal of Family and Economic Issues, Springer, vol. 37(4), pages 649-671, December.
    17. Petrit Gashi & Iraj Hashi & Geoff Pugh, 2014. "Export behaviour of SMEs in transition countries," Small Business Economics, Springer, vol. 42(2), pages 407-435, February.
    18. Patrick Royston, 2009. "Multiple imputation of missing values: Further update of ice, with an emphasis on categorical variables," Stata Journal, StataCorp LP, vol. 9(3), pages 466-477, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John Bound & Michael F. Lovenheim & Sarah Turner, 2010. "Why Have College Completion Rates Declined? An Analysis of Changing Student Preparation and Collegiate Resources," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 129-157, July.
    2. Jue Yang & Shunsuke Managi & Masayuki Sato, 2015. "The effect of institutional quality on national wealth: an examination using multiple imputation method," Environmental Economics and Policy Studies, Springer;Society for Environmental Economics and Policy Studies - SEEPS, vol. 17(3), pages 431-453, July.
    3. Wesley Eddings & Yulia Marchenko, 2012. "Diagnostics for multiple imputation in Stata," Stata Journal, StataCorp LP, vol. 12(3), pages 353-367, September.
    4. Patrick Royston, 2005. "Multiple imputation of missing values: update," Stata Journal, StataCorp LP, vol. 5(2), pages 188-201, June.
    5. Gabriele Beissel Durrant, 2009. "Imputation Methods for Handling Item-Nonresponse in the Social Sciences: A Methodological Review," Working Papers id:2007, eSocialSciences.
    6. Harald Schoen, 2008. "Identity, Instrumental Self-Interest and Institutional Evaluations," European Union Politics, , vol. 9(1), pages 5-29, March.
    7. Julia S. Goldberg, 2011. "Identity Salience and Involvement among Resident and Nonresident Fathers," Working Papers 1323, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    8. Hilde Coffe & Catherine Bolzendahl, 2011. "Gender Gaps in Political Participation Across Sub-Saharan African Nations," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 102(2), pages 245-264, June.
    9. Ronald Mincy & Jennifer Hill & Marilyn Sinkewicz, 2009. "Marriage: Cause or mere indicator of future earnings growth?," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 28(3), pages 417-439.
    10. von Hinke Kessler Scholder S, 2009. "Genetic Markers as Instrumental Variables: An Application to Child Fat Mass and Academic Achievement," Health, Econometrics and Data Group (HEDG) Working Papers 09/25, HEDG, c/o Department of Economics, University of York.
    11. Natalia Krüger & Axel McCallum & Víctor Volman, 2020. "Segregación escolar por nivel socioeconómico: disparidades entre las provincias argentinas," Asociación Argentina de Economía Política: Working Papers 4362, Asociación Argentina de Economía Política.
    12. Anna K Stuck & Marie Méan & Andreas Limacher & Marc Righini & Kurt Jaeger & Hans-Jürg Beer & Joseph Osterwalder & Beat Frauchiger & Christian M Matter & Nils Kucher & Michael Egloff & Markus Aschwande, 2014. "The Adherence to Initial Processes of Care in Elderly Patients with Acute Venous Thromboembolism," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
    13. Ethan T. Hunt & Bridget Armstrong & Brie M. Turner-McGrievy & Michael W. Beets & Robert G. Weaver, 2021. "Differences by School Location in Summer and School Monthly Weight Change: Findings from a Nationally Representative Sample," IJERPH, MDPI, vol. 18(21), pages 1-12, November.
    14. repec:pri:crcwel:wp12-10-ff is not listed on IDEAS
    15. Shoham, David A. & Vupputuri, Suma & Kaufman, Jay S. & Kshirsagar, Abhijit V. & Diez Roux, Ana V. & Coresh, Josef & Heiss, Gerardo, 2008. "Kidney disease and the cumulative burden of life course socioeconomic conditions: The Atherosclerosis Risk in Communities (ARIC) Study," Social Science & Medicine, Elsevier, vol. 67(8), pages 1311-1320, October.
    16. Elizabeth Washbrook & Paul Gregg & Carol Propper, 2014. "A decomposition analysis of the relationship between parental income and multiple child outcomes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 177(4), pages 757-782, October.
    17. Jennifer March Augustine, 2016. "Exploring New Life Course Patterns of Mother’s Continuing Secondary and College Education," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(6), pages 727-755, December.
    18. Robin S. H�gn�s & Marcia J. Carlson, 2009. "Intergenerational Relationships and Union Stability in Fragile Families," Working Papers 1174, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    19. Zsuzsa Blasko & Artur Pokropek & Joanna Sikora, 2018. "Science career plans of adolescents: patterns, trends and gender divides," JRC Research Reports JRC109135, Joint Research Centre.
    20. Sonia Bhalotra & Samantha Rawlings, 2013. "Gradients of the Intergenerational Transmission of Health in Developing Countries," The Review of Economics and Statistics, MIT Press, vol. 95(2), pages 660-672, May.
    21. Martine Audibert & Yong He & Jacky Mathonnat, 2013. "Two-Period Comparison of Healthcare Demand with Income Growth and Population Aging in Rural China: Implications for Adjustment of the Healthcare Supply and Development," Working Papers halshs-00846088, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:8:y:2008:i:1:p:49-67. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.