IDEAS home Printed from https://ideas.repec.org/a/spr/sankhb/v83y2021i1d10.1007_s13571-021-00251-4.html
   My bibliography  Save this article

Imputation for Skewed Data: Multivariate Lomax Case

Author

Listed:
  • Zhixin Lun

    (Oakland University
    University of California)

  • Ravindra Khattree

    (Oakland University)

Abstract

Most multiple imputation methods for multivariate missing data have been developed for normally distributed data. However, methods may not be suitable for nonnegative and/or highly skewed data. We propose an approach by using Expectation-Maximization (EM) method based on the assumption of multivariate Lomax distribution on non-negative skewed data. Extensive simulations show that this proposed method outperforms the regular normality-based EM and k-nearest-neighbor (k NN) imputation methods under the missing completely at random (MCAR) mechanism. An application on a real-world biomedical data is then provided.

Suggested Citation

  • Zhixin Lun & Ravindra Khattree, 2021. "Imputation for Skewed Data: Multivariate Lomax Case," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 86-113, May.
  • Handle: RePEc:spr:sankhb:v:83:y:2021:i:1:d:10.1007_s13571-021-00251-4
    DOI: 10.1007/s13571-021-00251-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13571-021-00251-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13571-021-00251-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michael W. Robbins & Sujit K. Ghosh & Joshua D. Habiger, 2013. "Imputation in High-Dimensional Economic Data as Applied to the Agricultural Resource Management Survey," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(501), pages 81-95, March.
    2. Kowarik, Alexander & Templ, Matthias, 2016. "Imputation with the R Package VIM," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i07).
    3. Yulei He & Trivellore E. Raghunathan, 2012. "Multiple imputation using multivariate gh transformations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(10), pages 2177-2198, June.
    4. Hakan Demirtas & Donald Hedeker, 2008. "Imputing continuous data under some non‐Gaussian distributions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 62(2), pages 193-205, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Paul T. von Hippel, 2013. "Should a Normal Imputation Model be Modified to Impute Skewed Variables?," Sociological Methods & Research, , vol. 42(1), pages 105-138, February.
    2. Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.
    3. Juana Sanchez & Sydney Noelle Kahmann, 2017. "R&D, Attrition and Multiple Imputation in BRDIS," Working Papers 17-13, Center for Economic Studies, U.S. Census Bureau.
    4. Maciej Beręsewicz & Dagmara Nikulin, 2018. "Informal employment in Poland: an empirical spatial analysis," Spatial Economic Analysis, Taylor & Francis Journals, vol. 13(3), pages 338-355, July.
    5. Schalk Burger & Searle Silverman & Gary van Vuuren, 2018. "Deriving Correlation Matrices for Missing Financial Time-Series Data," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 10(10), pages 105-105, October.
    6. Henry Webel & Lili Niu & Annelaura Bach Nielsen & Marie Locard-Paulet & Matthias Mann & Lars Juhl Jensen & Simon Rasmussen, 2024. "Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    7. Bram Janssens & Matthias Bogaert & Mathijs Maton, 2023. "Predicting the next Pogačar: a data analytical approach to detect young professional cycling talents," Annals of Operations Research, Springer, vol. 325(1), pages 557-588, June.
    8. Chhetri, Netra & Ghimire, Rajiv & Wagner, Melissa & Wang, Meng, 2020. "Global citizen deliberation: Case of world-wide views on climate and energy," Energy Policy, Elsevier, vol. 147(C).
    9. Ieva Burakauskaitė & Andrius Čiginas, 2023. "An Approach to Integrating a Non-Probability Sample in the Population Census," Mathematics, MDPI, vol. 11(8), pages 1-14, April.
    10. Carlos Miguel Lemos & Ross Joseph Gore & Ivan Puga-Gonzalez & F LeRon Shults, 2019. "Dimensionality and factorial invariance of religiosity among Christians and the religiously unaffiliated: A cross-cultural analysis based on the International Social Survey Programme," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-36, May.
    11. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    12. Selcuk Bayraci, 2017. "Application of profit-based credit scoring models using R," Romanian Statistical Review, Romanian Statistical Review, vol. 65(4), pages 3-28, December.
    13. Matthias Templ, 2023. "Enhancing Precision in Large-Scale Data Analysis: An Innovative Robust Imputation Algorithm for Managing Outliers and Missing Values," Mathematics, MDPI, vol. 11(12), pages 1-22, June.
    14. D'Antoni, Jeremy M. & Khanal, Aditya R. & Mishra, Ashok K., 2014. "Examining Labor Substitution: Does Family Matter for U.S. Cash Grain Farmers?," Journal of Agricultural and Applied Economics, Southern Agricultural Economics Association, vol. 46(2), pages 1-12, May.
    15. Riccardo D’Alberto & Matteo Zavalloni & Meri Raggi & Davide Viaggi, 2018. "AES Impact Evaluation With Integrated Farm Data: Combining Statistical Matching and Propensity Score Matching," Sustainability, MDPI, vol. 10(11), pages 1-24, November.
    16. Nicholas Tierney & Dianne Cook, 2018. "Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations," Monash Econometrics and Business Statistics Working Papers 14/18, Monash University, Department of Econometrics and Business Statistics.
    17. P. B. Kenfac Dongmezo & P. N. Mwita & I. R. Kamga Tchwaket, 2017. "Imputation Based Treatment Effect Estimators," Journal of Statistical and Econometric Methods, SCIENPRESS Ltd, vol. 6(3), pages 1-2.
    18. Maria Lucia Parrella & Giuseppina Albano & Michele La Rocca & Cira Perna, 2019. "Reconstructing missing data sequences in multivariate time series: an application to environmental data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(2), pages 359-383, June.
    19. Yulei He & Trivellore E. Raghunathan, 2012. "Multiple imputation using multivariate gh transformations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(10), pages 2177-2198, June.
    20. Burns, Christopher & Prager, Daniel & Ghosh, Sujit & Goodwin, Barry, 2015. "Imputing for Missing Data in the ARMS Household Section: A Multivariate Imputation Approach," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205291, Agricultural and Applied Economics Association.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankhb:v:83:y:2021:i:1:d:10.1007_s13571-021-00251-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.