IDEAS home Printed from https://ideas.repec.org/p/ldr/wpaper/13.html
   My bibliography  Save this paper

Sequential Regression Multiple Imputation for Incomplete Multivariate Data using Markov Chain Monte Carlo

Author

Listed:
  • Miguel Lacerda
  • Cally Ardington

    (SALDRU, School of Economics, University of Cape TownAuthor-Email:)

  • Murray Leibbrandt

    (SALDRU, School of Economics, University of Cape Town)

Abstract

This paper discusses the theoretical background to handling missing data in a multivariate context. Earlier methods for dealing with item non-response are reviewed, followed by an examination of some of the more modern methods and, in particular, multiple imputation. One such technique, known as sequential regression multivariate imputation, which employs a Markov chain Monte Carlo algorithm is described and implemented. It is demonstrated that distributional convergence is rapid and only a few imputations are necessary in order to produce accurate point estimates and preserve multivariate relationships, whilst adequately accounting for the uncertainty introduced by the imputation procedure. It is further shown that lower fractions of missing data and the inclusion of relevant covariates in the imputation model are desirable in terms of bias reduction.

Suggested Citation

  • Miguel Lacerda & Cally Ardington & Murray Leibbrandt, 2007. "Sequential Regression Multiple Imputation for Incomplete Multivariate Data using Markov Chain Monte Carlo," SALDRU Working Papers 13, Southern Africa Labour and Development Research Unit, University of Cape Town.
  • Handle: RePEc:ldr:wpaper:13
    as

    Download full text from publisher

    File URL: https://opensaldru.uct.ac.za/bitstream/handle/11090/41/2008_13.pdf?sequence=1
    File Function: Full text
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    2. Javaras, Kristin N. & Van Dyk, David A., 2003. "Multiple Imputation for Incomplete Data With Semicontinuous Variables," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 703-715, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nic Baigrie & Katherine Eyal, 2014. "An Evaluation of the Determinants and Implications of Panel Attrition in the National Income Dynamics Survey (2008-2010)," South African Journal of Economics, Economic Society of South Africa, vol. 82(1), pages 39-65, March.
    2. Derek Yu, 2013. "Some factors influencing the comparability and reliability of poverty estimates across household surveys," Working Papers 03/2013, Stellenbosch University, Department of Economics.
    3. David McLennan & Michael Noble & Gemma Wright & Helen Barnes & Faith Masekesa, 2021. "Exploring the quality of income data in two African household surveys for the purpose of tax-benefit microsimulation modelling: Imputing employment income in Tanzania and Zambia," WIDER Working Paper Series wp-2021-134, World Institute for Development Economic Research (UNU-WIDER).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harald Schoen, 2008. "Identity, Instrumental Self-Interest and Institutional Evaluations," European Union Politics, , vol. 9(1), pages 5-29, March.
    2. Julia S. Goldberg, 2011. "Identity Salience and Involvement among Resident and Nonresident Fathers," Working Papers 1323, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    3. Hilde Coffe & Catherine Bolzendahl, 2011. "Gender Gaps in Political Participation Across Sub-Saharan African Nations," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 102(2), pages 245-264, June.
    4. Ronald Mincy & Jennifer Hill & Marilyn Sinkewicz, 2009. "Marriage: Cause or mere indicator of future earnings growth?," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 28(3), pages 417-439.
    5. John Bound & Michael F. Lovenheim & Sarah Turner, 2010. "Why Have College Completion Rates Declined? An Analysis of Changing Student Preparation and Collegiate Resources," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 129-157, July.
    6. von Hinke Kessler Scholder S, 2009. "Genetic Markers as Instrumental Variables: An Application to Child Fat Mass and Academic Achievement," Health, Econometrics and Data Group (HEDG) Working Papers 09/25, HEDG, c/o Department of Economics, University of York.
    7. Natalia Krüger & Axel McCallum & Víctor Volman, 2020. "Segregación escolar por nivel socioeconómico: disparidades entre las provincias argentinas," Asociación Argentina de Economía Política: Working Papers 4362, Asociación Argentina de Economía Política.
    8. Anna K Stuck & Marie Méan & Andreas Limacher & Marc Righini & Kurt Jaeger & Hans-Jürg Beer & Joseph Osterwalder & Beat Frauchiger & Christian M Matter & Nils Kucher & Michael Egloff & Markus Aschwande, 2014. "The Adherence to Initial Processes of Care in Elderly Patients with Acute Venous Thromboembolism," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
    9. Ethan T. Hunt & Bridget Armstrong & Brie M. Turner-McGrievy & Michael W. Beets & Robert G. Weaver, 2021. "Differences by School Location in Summer and School Monthly Weight Change: Findings from a Nationally Representative Sample," IJERPH, MDPI, vol. 18(21), pages 1-12, November.
    10. repec:pri:crcwel:wp12-10-ff is not listed on IDEAS
    11. Shoham, David A. & Vupputuri, Suma & Kaufman, Jay S. & Kshirsagar, Abhijit V. & Diez Roux, Ana V. & Coresh, Josef & Heiss, Gerardo, 2008. "Kidney disease and the cumulative burden of life course socioeconomic conditions: The Atherosclerosis Risk in Communities (ARIC) Study," Social Science & Medicine, Elsevier, vol. 67(8), pages 1311-1320, October.
    12. Elizabeth Washbrook & Paul Gregg & Carol Propper, 2014. "A decomposition analysis of the relationship between parental income and multiple child outcomes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 177(4), pages 757-782, October.
    13. Jennifer March Augustine, 2016. "Exploring New Life Course Patterns of Mother’s Continuing Secondary and College Education," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(6), pages 727-755, December.
    14. Robin S. H�gn�s & Marcia J. Carlson, 2009. "Intergenerational Relationships and Union Stability in Fragile Families," Working Papers 1174, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    15. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    16. Zsuzsa Blasko & Artur Pokropek & Joanna Sikora, 2018. "Science career plans of adolescents: patterns, trends and gender divides," JRC Research Reports JRC109135, Joint Research Centre.
    17. Sonia Bhalotra & Samantha Rawlings, 2013. "Gradients of the Intergenerational Transmission of Health in Developing Countries," The Review of Economics and Statistics, MIT Press, vol. 95(2), pages 660-672, May.
    18. Martine Audibert & Yong He & Jacky Mathonnat, 2013. "Two-Period Comparison of Healthcare Demand with Income Growth and Population Aging in Rural China: Implications for Adjustment of the Healthcare Supply and Development," Working Papers halshs-00846088, HAL.
    19. Stoddard, Pamela & Handley, Margaret A. & Vargas Bustamante, Arturo & Schillinger, Dean, 2011. "The influence of indigenous status and community indigenous composition on obesity and diabetes among Mexican adults," Social Science & Medicine, Elsevier, vol. 73(11), pages 1635-1643.
    20. Manuel S. González Canché, 2017. "Financial Benefits of Rapid Student Loan Repayment: An Analytic Framework Employing Two Decades of Data," The ANNALS of the American Academy of Political and Social Science, , vol. 671(1), pages 154-182, May.
    21. Sandner, Malte & Jungmann, Tanja, 2017. "Gender-specific effects of early childhood intervention: Evidence from a randomized controlled trial," Labour Economics, Elsevier, vol. 45(C), pages 59-78.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ldr:wpaper:13. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Alison Siljeur (email available below). General contact details of provider: https://edirc.repec.org/data/sauctza.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.