IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v83y2018i4d10.1007_s11336-018-9616-y.html
   My bibliography  Save this article

Multiple Imputation for Bounded Variables

Author

Listed:
  • Marco Geraci

    (University of South Carolina)

  • Alexander McLain

    (University of South Carolina)

Abstract

Missing data are a common issue in statistical analyses. Multiple imputation is a technique that has been applied in countless research studies and has a strong theoretical basis. Most of the statistical literature on multiple imputation has focused on unbounded continuous variables, with mostly ad hoc remedies for variables with bounded support. These approaches can be unsatisfactory when applied to bounded variables as they can produce misleading inferences. In this paper, we propose a flexible quantile-based imputation model suitable for distributions defined over singly or doubly bounded intervals. Proper support of the imputed values is ensured by applying a family of transformations with singly or doubly bounded range. Simulation studies demonstrate that our method is able to deal with skewness, bimodality, and heteroscedasticity and has superior properties as compared to competing approaches, such as log-normal imputation and predictive mean matching. We demonstrate the application of the proposed imputation procedure by analysing data on mathematical development scores in children from the Millennium Cohort Study, UK. We also show a specific advantage of our methods using a small psychiatric dataset. Our methods are relevant in a number of fields, including education and psychology.

Suggested Citation

  • Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.
  • Handle: RePEc:spr:psycho:v:83:y:2018:i:4:d:10.1007_s11336-018-9616-y
    DOI: 10.1007/s11336-018-9616-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-018-9616-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-018-9616-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    2. Paul T. von Hippel, 2013. "Should a Normal Imputation Model be Modified to Impute Skewed Variables?," Sociological Methods & Research, , vol. 42(1), pages 105-138, February.
    3. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 287-296, July.
    4. Stephen Machin & Sandra McNally, 2005. "Gender and Student Achievement in English Schools," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 21(3), pages 357-372, Autumn.
    5. Mu, Yunming & He, Xuming, 2007. "Power Transformation Toward a Linear Regression Quantile," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 269-279, March.
    6. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    7. Royston, Patrick & White, Ian R., 2011. "Multiple Imputation by Chained Equations (MICE): Implementation in Stata," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i04).
    8. He, Yulei & Raghunathan, Trivellore E., 2006. "Tukey's gh Distribution for Multiple Imputation," The American Statistician, American Statistical Association, vol. 60, pages 251-256, August.
    9. Yulei He & Trivellore E. Raghunathan, 2012. "Multiple imputation using multivariate gh transformations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(10), pages 2177-2198, June.
    10. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys: Reply," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 300-301, July.
    11. Bernd Fitzenberger & Ralf A. Wilke & Xuan Zhang, 2010. "Implementing Box-Cox Quantile Regression," Econometric Reviews, Taylor & Francis Journals, vol. 29(2), pages 158-181, April.
    12. Buchinsky, Moshe, 1995. "Quantile regression, Box-Cox transformation model, and the U.S. wage structure, 1963-1987," Journal of Econometrics, Elsevier, vol. 65(1), pages 109-154, January.
    13. Hakan Demirtas & Donald Hedeker, 2008. "Imputing continuous data under some non‐Gaussian distributions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 62(2), pages 193-205, May.
    14. Hakim-Moulay Dehbi & Mario Cortina-Borja & Marco Geraci, 2016. "Aranda-Ordaz quantile regression for student performance assessment," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(1), pages 58-71, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Urko Aguirre-Larracoechea & Cruz E. Borges, 2021. "Imputation for Repeated Bounded Outcome Data: Statistical and Machine-Learning Approaches," Mathematics, MDPI, vol. 9(17), pages 1-27, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    2. Youngjoo Cho & Debashis Ghosh, 2021. "Quantile-Based Subgroup Identification for Randomized Clinical Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(1), pages 90-128, April.
    3. Ahfock, Daniel & Pyne, Saumyadipta & McLachlan, Geoffrey J., 2022. "Statistical file-matching of non-Gaussian data: A game theoretic approach," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    4. Hyung-Gun Kim & Kwong-Chin Hung & Sung Park, 2015. "Determinants of Housing Prices in Hong Kong: A Box-Cox Quantile Regression Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 50(2), pages 270-287, February.
    5. Maria Marino & Alessio Farcomeni, 2015. "Linear quantile regression models for longitudinal experiments: an overview," METRON, Springer;Sapienza Università di Roma, vol. 73(2), pages 229-247, August.
    6. Ralf Münnich & Siegfried Gabler & Christian Bruch & Jan Pablo Burgard & Tobias Enderle & Jan-Philipp Kolb & Thomas Zimmermann, 2015. "Tabellenauswertungen im Zensus unter Berücksichtigung fehlender Werte," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 9(3), pages 269-304, December.
    7. Michael D. Teter & Johannes O. Royset & Alexandra M. Newman, 2019. "Modeling uncertainty of expert elicitation for use in risk-based optimization," Annals of Operations Research, Springer, vol. 280(1), pages 189-210, September.
    8. Saeideh Kamgar & Florian Meinfelder & Ralf Münnich & Hamidreza Navvabpour, 2020. "Estimation within the new integrated system of household surveys in Germany," Statistical Papers, Springer, vol. 61(5), pages 2091-2117, October.
    9. Jayeeta Bhattacharya, 2020. "Quantile regression with generated dependent variable and covariates," Papers 2012.13614, arXiv.org.
    10. Jana Emmenegger & Ralf Münnich & Jannik Schaller, 2022. "Evaluating Data Fusion Methods to Improve Income Modelling," Research Papers in Economics 2022-03, University of Trier, Department of Economics.
    11. Jensen, Are & Clausen, Tommy H., 2017. "Origins and emergence of exploration and exploitation capabilities in new technology-based firms," Technological Forecasting and Social Change, Elsevier, vol. 120(C), pages 163-175.
    12. Ann-Marie Küchler & Dana Schultchen & Tim Dretzler & Morten Moshagen & David D. Ebert & Harald Baumeister, 2023. "A Three-Armed Randomized Controlled Trial to Evaluate the Effectiveness, Acceptance, and Negative Effects of StudiCare Mindfulness, an Internet- and Mobile-Based Intervention for College Students with," IJERPH, MDPI, vol. 20(4), pages 1-23, February.
    13. Gerko Vink & Stef van Buuren, 2013. "Multiple Imputation of Squared Terms," Sociological Methods & Research, , vol. 42(4), pages 598-607, November.
    14. Renate S M Buisman & Katharina Pittner & Marieke S Tollenaar & Jolanda Lindenberg & Lisa J M van den Berg & Laura H C G Compier-de Block & Joost R van Ginkel & Lenneke R A Alink & Marian J Bakermans-K, 2020. "Intergenerational transmission of child maltreatment using a multi-informant multi-generation family design," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-23, March.
    15. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    16. Lamu, Admassu N. & Olsen, Jan Abel, 2016. "The relative importance of health, income and social relations for subjective well-being: An integrative analysis," Social Science & Medicine, Elsevier, vol. 152(C), pages 176-185.
    17. Kristian Kleinke & Jost Reinecke, 2013. "Multiple imputation of incomplete zero-inflated count data," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 67(3), pages 311-336, August.
    18. Rabea Aschenbruck & Gero Szepannek & Adalbert F. X. Wilhelm, 2023. "Imputation Strategies for Clustering Mixed-Type Data with Missing Values," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 2-24, April.
    19. Williams, Randi M. & Zhang, Jing & Woodard, Nathaniel & Slade, Jimmie & Santos, Sherie Lou Zara & Knott, Cheryl L., 2020. "Development and validation of an instrument to assess institutionalization of health promotion in faith-based organizations," Evaluation and Program Planning, Elsevier, vol. 79(C).
    20. Mingyang Cai & Gerko Vink, 2022. "A note on imputing squares via polynomial combination approach," Computational Statistics, Springer, vol. 37(5), pages 2185-2201, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:83:y:2018:i:4:d:10.1007_s11336-018-9616-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.