IDEAS home Printed from https://ideas.repec.org/a/eee/insuma/v86y2019icp8-18.html
   My bibliography  Save this article

Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data

Author

Listed:
  • Chen, Kun
  • Huang, Rui
  • Chan, Ngai Hang
  • Yau, Chun Yip

Abstract

Customized personal rate offering is of growing importance in the insurance industry. To achieve this, an important step is to identify subgroups of insureds from the corresponding heterogeneous claim frequency data. In this paper, a penalized Poisson regression approach for subgroup analysis in claim frequency data is proposed. Subjects are assumed to follow a zero-inflated Poisson regression model with group-specific intercepts, which capture group characteristics of claim frequency. A penalized likelihood function is derived and optimized to identify the group-specific intercepts and effects of individual covariates. To handle the challenges arising from the optimization of the penalized likelihood function, an alternating direction method of multipliers algorithm is developed and its convergence is established. Simulation studies and real applications are provided for illustrations.

Suggested Citation

  • Chen, Kun & Huang, Rui & Chan, Ngai Hang & Yau, Chun Yip, 2019. "Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 8-18.
  • Handle: RePEc:eee:insuma:v:86:y:2019:i:c:p:8-18
    DOI: 10.1016/j.insmatheco.2019.01.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167668718300593
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.insmatheco.2019.01.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. P. Tseng, 2001. "Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization," Journal of Optimization Theory and Applications, Springer, vol. 109(3), pages 475-494, June.
    2. Guelman, Leo & Guillén, Montserrat & Pérez-Marín, Ana M., 2014. "A survey of personalized treatment models for pricing strategies in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 58(C), pages 68-76.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    5. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    6. Partrat, Christian, 1994. "Compound model for two dependent kinds of claim," Insurance: Mathematics and Economics, Elsevier, vol. 15(2-3), pages 219-231, December.
    7. Yanlin Tang & Liya Xiang & Zhongyi Zhu, 2014. "Risk Factor Selection in Rate Making: EM Adaptive LASSO for Zero‐Inflated Poisson Regression Models," Risk Analysis, John Wiley & Sons, vol. 34(6), pages 1112-1127, June.
    8. Yip, Karen C.H. & Yau, Kelvin K.W., 2005. "On modeling claim frequency data in general insurance with extra zeros," Insurance: Mathematics and Economics, Elsevier, vol. 36(2), pages 153-163, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marjan Qazvini, 2019. "On the Validation of Claims with Excess Zeros in Liability Insurance: A Comparative Study," Risks, MDPI, vol. 7(3), pages 1-17, June.
    2. Minwoo Kim & Himchan Jeong & Dipak Dey, 2022. "Approximation of Zero-Inflated Poisson Credibility Premium via Variational Bayes Approach," Risks, MDPI, vol. 10(3), pages 1-11, March.
    3. Bladt, Martin & Yslas, Jorge, 2023. "Robust claim frequency modeling through phase-type mixture-of-experts regression," Insurance: Mathematics and Economics, Elsevier, vol. 111(C), pages 1-22.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael R. Wierzbicki & Li-Bing Guo & Qing-Tao Du & Wensheng Guo, 2014. "Sparse Semiparametric Nonlinear Model With Application to Chromatographic Fingerprints," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(508), pages 1339-1349, December.
    2. Peng, Heng & Lu, Ying, 2012. "Model selection in linear mixed effect models," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 109-129.
    3. Shuang Zhang & Xingdong Feng, 2022. "Distributed identification of heterogeneous treatment effects," Computational Statistics, Springer, vol. 37(1), pages 57-89, March.
    4. Jun Zhu & Hsin‐Cheng Huang & Perla E. Reyes, 2010. "On selection of spatial linear models for lattice data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(3), pages 389-402, June.
    5. Tang, Linjun & Zhou, Zhangong & Wu, Changchun, 2012. "Weighted composite quantile estimation and variable selection method for censored regression model," Statistics & Probability Letters, Elsevier, vol. 82(3), pages 653-663.
    6. Gaorong Li & Liugen Xue & Heng Lian, 2012. "SCAD-penalised generalised additive models with non-polynomial dimensionality," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 681-697.
    7. Xia Chen & Liyue Mao, 2020. "Penalized empirical likelihood for partially linear errors-in-variables models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 597-623, December.
    8. Tizheng Li & Xiaojuan Kang, 2022. "Variable selection of higher-order partially linear spatial autoregressive model with a diverging number of parameters," Statistical Papers, Springer, vol. 63(1), pages 243-285, February.
    9. Yanlin Tang & Liya Xiang & Zhongyi Zhu, 2014. "Risk Factor Selection in Rate Making: EM Adaptive LASSO for Zero‐Inflated Poisson Regression Models," Risk Analysis, John Wiley & Sons, vol. 34(6), pages 1112-1127, June.
    10. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    11. Fei Jin & Lung-fei Lee, 2018. "Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices," Econometrics, MDPI, vol. 6(1), pages 1-24, February.
    12. Xinyang Wang & Dehui Wang & Kai Yang, 2021. "Integer-valued time series model order shrinkage and selection via penalized quasi-likelihood approach," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(5), pages 713-750, July.
    13. Zhixuan Fu & Shuangge Ma & Haiqun Lin & Chirag R. Parikh & Bingqing Zhou, 2017. "Penalized Variable Selection for Multi-center Competing Risks Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 379-405, December.
    14. Guang Cheng & Hao Zhang & Zuofeng Shang, 2015. "Sparse and efficient estimation for partial spline models with increasing dimension," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 93-127, February.
    15. Zhangong Zhou & Rong Jiang & Weimin Qian, 2013. "LAD variable selection for linear models with randomly censored data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 287-300, February.
    16. Joel L. Horowitz & Lars Nesheim, 2018. "Using penalized likelihood to select parameters in a random coefficients multinomial logit model," CeMMAP working papers CWP29/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    17. Sakyajit Bhattacharya & Paul McNicholas, 2014. "A LASSO-penalized BIC for mixture model selection," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(1), pages 45-61, March.
    18. Ramon I. Garcia & Joseph G. Ibrahim & Hongtu Zhu, 2010. "Variable Selection in the Cox Regression Model with Covariates Missing at Random," Biometrics, The International Biometric Society, vol. 66(1), pages 97-104, March.
    19. Dengke Xu & Zhongzhan Zhang & Liucang Wu, 2014. "Variable selection in high-dimensional double generalized linear models," Statistical Papers, Springer, vol. 55(2), pages 327-347, May.
    20. Ngai Hang Chan & Linhao Gao & Wilfredo Palma, 2022. "Simultaneous variable selection and structural identification for time‐varying coefficient models," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(4), pages 511-531, July.

    More about this item

    Keywords

    ADMM algorithm; Car insurance data; Concave pairwise fusion penalty; Heterogeneity; Subgroup analysis;
    All these keywords.

    JEL classification:

    • C2 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:insuma:v:86:y:2019:i:c:p:8-18. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505554 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.