IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v35y2020i2d10.1007_s00180-019-00930-x.html
   My bibliography  Save this article

Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods

Author

Listed:
  • Shen-Ming Lee

    (Feng Chia University
    Academia Sinica)

  • T. Martin Lukusa

    (Academia Sinica)

  • Chin-Shang Li

    (The State University of New York, University at Buffalo)

Abstract

Zero-inflated Poisson (ZIP) regression is widely applied to model effects of covariates on an outcome count with excess zeros. In some applications, covariates in a ZIP regression model are partially observed. Based on the imputed data generated by applying the multiple imputation (MI) schemes developed by Wang and Chen (Ann Stat 37:490–517, 2009), two methods are proposed to estimate the parameters of a ZIP regression model with covariates missing at random. One, proposed by Rubin (in: Proceedings of the survey research methods section of the American Statistical Association, 1978), consists of obtaining a unified estimate as the average of estimates from all imputed datasets. The other, proposed by Fay (J Am Stat Assoc 91:490–498, 1996), consists of averaging the estimating scores from all imputed data sets to solve the imputed estimating equation. Moreover, it is shown that the two proposed estimation methods are asymptotically equivalent to the semiparametric inverse probability weighting method. A modified formula is proposed to estimate the variances of the MI estimators. An extensive simulation study is conducted to investigate the performance of the estimation methods. The practicality of the methodology is illustrated with a dataset of motorcycle survey of traffic regulations.

Suggested Citation

  • Shen-Ming Lee & T. Martin Lukusa & Chin-Shang Li, 2020. "Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods," Computational Statistics, Springer, vol. 35(2), pages 725-754, June.
  • Handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00930-x
    DOI: 10.1007/s00180-019-00930-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-019-00930-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-019-00930-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cameron,A. Colin & Trivedi,Pravin K., 2013. "Regression Analysis of Count Data," Cambridge Books, Cambridge University Press, number 9781107667273, January.
    2. T. Martin Lukusa & Shen-Ming Lee & Chin-Shang Li, 2016. "Semiparametric estimation of a zero-inflated Poisson regression model with missing covariates," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(4), pages 457-483, May.
    3. Mullahy, John, 1986. "Specification and testing of some modified count data models," Journal of Econometrics, Elsevier, vol. 33(3), pages 341-365, December.
    4. Creemers, An & Aerts, Marc & Hens, Niel & Molenberghs, Geert, 2012. "A nonparametric approach to weighted estimating equations for regression analysis with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 100-113, January.
    5. David Clayton & David Spiegelhalter & Graham Dunn & Andrew Pickles, 1998. "Analysis of longitudinal binary data from multiphase sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 71-87.
    6. Shou-En Lu & Yong Lin & Wei-Chung Joe Shih, 2004. "Analyzing Excessive No Changes in Clinical Trials with Clustered Data," Biometrics, The International Biometric Society, vol. 60(1), pages 257-267, March.
    7. Chen, Xue-Dong & Fu, Ying-Zi, 2011. "Model selection for zero-inflated regression with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 765-773, January.
    8. Daniel B. Hall & Jing Shen, 2010. "Robust Estimation for Zero‐Inflated Poisson Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 237-252, June.
    9. Shen‐Ming Lee & Wen‐Han Hwang & Jean de Dieu Tapsoba, 2016. "Estimation in closed capture–recapture models when covariates are missing at random," Biometrics, The International Biometric Society, vol. 72(4), pages 1294-1304, December.
    10. Jansakul, N. & Hinde, J. P., 2002. "Score Tests for Zero-Inflated Poisson Models," Computational Statistics & Data Analysis, Elsevier, vol. 40(1), pages 75-96, July.
    11. Wang, Suojin & Wang, C. Y., 2001. "A note on kernel assisted estimators in missing covariate regression," Statistics & Probability Letters, Elsevier, vol. 55(4), pages 439-449, December.
    12. D. Böhning & E. Dietz & P. Schlattmann & L. Mendonça & U. Kirchner, 1999. "The zero‐inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 162(2), pages 195-209.
    13. Hsieh, S.H. & Lee, S.M. & Shen, P.S., 2009. "Semiparametric analysis of randomized response data with missing covariates in logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2673-2692, May.
    14. Shen-Ming Lee & Mei-Jih Gee & Shu-Hui Hsieh, 2011. "Semiparametric Methods in the Proportional Odds Model for Ordinal Response Data with Missing Covariates," Biometrics, The International Biometric Society, vol. 67(3), pages 788-798, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Truong-Nhat Le & Shen-Ming Lee & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Randomized Response Techniques: A Systematic Review from the Pioneering Work of Warner (1965) to the Present," Mathematics, MDPI, vol. 11(7), pages 1-26, April.
    2. Shen-Ming Lee & Truong-Nhat Le & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Estimation of logistic regression with covariates missing separately or simultaneously via multiple imputation methods," Computational Statistics, Springer, vol. 38(2), pages 899-934, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. T. Martin Lukusa & Shen-Ming Lee & Chin-Shang Li, 2016. "Semiparametric estimation of a zero-inflated Poisson regression model with missing covariates," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(4), pages 457-483, May.
    2. Abbas Moghimbeigi & Mohammed Reza Eshraghian & Kazem Mohammad & Brian Mcardle, 2008. "Multilevel zero-inflated negative binomial regression modeling for over-dispersed count data with extra zeros," Journal of Applied Statistics, Taylor & Francis Journals, vol. 35(10), pages 1193-1202.
    3. Shen-Ming Lee & Truong-Nhat Le & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Estimation of logistic regression with covariates missing separately or simultaneously via multiple imputation methods," Computational Statistics, Springer, vol. 38(2), pages 899-934, June.
    4. Lukusa, Martin T. & Phoa, Frederick Kin Hing, 2020. "A note on the weighting-type estimations of the zero-inflated Poisson regression model with missing data in covariates," Statistics & Probability Letters, Elsevier, vol. 158(C).
    5. Buu-Chau Truong & Nguyen Van Thuan & Nguyen Huu Hau & Michael McAleer, 2019. "Applications of the Newton-Raphson Method in Decision Sciences and Education," Advances in Decision Sciences, Asia University, Taiwan, vol. 23(4), pages 52-80, December.
    6. Kim-Hung Pho & Tuan-Kiet Tran & Thi Diem-Chinh Ho & Wing-Keung Wong, 2019. "Optimal Solution Techniques in Decision Sciences A Review," Advances in Decision Sciences, Asia University, Taiwan, vol. 23(1), pages 114-161, March.
    7. Yixuan Zou & Jan Hannig & Derek S. Young, 2021. "Generalized fiducial inference on the mean of zero-inflated Poisson and Poisson hurdle models," Journal of Statistical Distributions and Applications, Springer, vol. 8(1), pages 1-15, December.
    8. K. F. Lam & Hongqi Xue & Yin Bun Cheung, 2006. "Semiparametric Analysis of Zero-Inflated Count Data," Biometrics, The International Biometric Society, vol. 62(4), pages 996-1003, December.
    9. Moghimbeigi, Abbas & Eshraghian, Mohammad Reza & Mohammad, Kazem & McArdle, Brian, 2009. "A score test for zero-inflation in multilevel count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1239-1248, February.
    10. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    11. J. M. C. Santos Silva & Silvana Tenreyro, 2022. "The Log of Gravity at 15," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 21(3), pages 423-437, September.
    12. Chiara Bocci & Laura Grassini & Emilia Rocco, 2021. "A multiple inflated negative binomial hurdle regression model: analysis of the Italians’ tourism behaviour during the Great Recession," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(4), pages 1109-1133, October.
    13. David Todem & Wei‐Wen Hsu & KyungMann Kim, 2023. "Nonparametric scanning tests of homogeneity for hierarchical models with continuous covariates," Biometrics, The International Biometric Society, vol. 79(3), pages 2063-2075, September.
    14. Jiang, Yuan & House, Lisa A., 2017. "Comparison of the Performance of Count Data Models under Different Zero-Inflation Scenarios Using Simulation Studies," 2017 Annual Meeting, July 30-August 1, Chicago, Illinois 258342, Agricultural and Applied Economics Association.
    15. José M. R. Murteira & Mário A. G. Augusto, 2017. "Hurdle models of repayment behaviour in personal loan contracts," Empirical Economics, Springer, vol. 53(2), pages 641-667, September.
    16. Tousifur Rahman & Partha Jyoti Hazarika & M. Masoom Ali & Manash Pratim Barman, 2022. "Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic," Annals of Data Science, Springer, vol. 9(5), pages 1103-1127, October.
    17. Rainer Winkelmann, 2015. "Counting on count data models," IZA World of Labor, Institute of Labor Economics (IZA), pages 148-148, May.
    18. Joan Costa-Font & Sergi Jiménez-Martín & Cristina Vilaplana, 2016. "Does long-term care subsidisation reduce unnecessary hospitalisations?," Economics Working Papers 1535, Department of Economics and Business, Universitat Pompeu Fabra.
    19. Ana María Martínez-Rodríguez & Antonio Conde-Sánchez & María José Olmo-Jiménez, 2019. "A new approach to truncated regression for count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 103(4), pages 503-526, December.
    20. Moritz Berger & Gerhard Tutz, 2021. "Transition models for count data: a flexible alternative to fixed distribution models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(4), pages 1259-1283, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00930-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.