IDEAS home Printed from https://ideas.repec.org/a/bla/jrinsu/v88y2021i3p625-663.html
   My bibliography  Save this article

Robust estimates of insurance misrepresentation through kernel quantile regression mixtures

Author

Listed:
  • Hong Li
  • Qifan Song
  • Jianxi Su

Abstract

This paper pertains to a class of nonparametric methods for studying the misrepresentation issue in insurance applications. For this purpose, mixture models based on quantile regression in reproducing kernel Hilbert spaces are employed. Compared with the existing parametric approaches, the proposed framework features a more flexible statistics structure which could alleviate the risk of model misspecification, and is in the meantime more robust to outliers in the data. The proposed framework can not only estimate the prevalence of misrepresentation in the data, but also help identify the most suspicious individuals for the validation purpose. Through embedding state‐of‐the‐art machine learning techniques, we present a novel statistics procedure to efficiently estimate the proposed misrepresentation model in the presence of massive data. The proposed methodology is applied to study the Medical Expenditure Panel Survey data, and a significant degree of misrepresentation activity is found on the self‐reported insurance status.

Suggested Citation

  • Hong Li & Qifan Song & Jianxi Su, 2021. "Robust estimates of insurance misrepresentation through kernel quantile regression mixtures," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 625-663, September.
  • Handle: RePEc:bla:jrinsu:v:88:y:2021:i:3:p:625-663
    DOI: 10.1111/jori.12358
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/jori.12358
    Download Restriction: no

    File URL: https://libkey.io/10.1111/jori.12358?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Akakpo, Rexford M. & Xia, Michelle & Polansky, Alan M., 2019. "Frequentist Inference In Insurance Ratemaking Models Adjusting For Misrepresentation," ASTIN Bulletin, Cambridge University Press, vol. 49(1), pages 117-146, January.
    2. Paul Embrechts & Marius Hofert, 2013. "A note on generalized inverses," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 77(3), pages 423-432, June.
    3. Andrea Gabrielli & Mario V. Wüthrich, 2018. "An Individual Claims History Simulation Machine," Risks, MDPI, vol. 6(2), pages 1-32, March.
    4. Fabio Baione & Davide Biancalana, 2019. "An Individual Risk Model for Premium Calculation Based on Quantile: A Comparison between Generalized Linear Models and Quantile Regression," North American Actuarial Journal, Taylor & Francis Journals, vol. 23(4), pages 573-590, October.
    5. Bhattacharya, Jay & Bundorf, M. Kate, 2009. "The incidence of the healthcare costs of obesity," Journal of Health Economics, Elsevier, vol. 28(3), pages 649-658, May.
    6. Ana M. Pérez-Marín & Montserrat Guillen & Manuela Alcañiz & Lluís Bermúdez, 2019. "Quantile Regression with Telematics Information to Assess the Risk of Driving above the Posted Speed Limit," Risks, MDPI, vol. 7(3), pages 1-11, July.
    7. S. M. Schennach & Yingyao Hu, 2013. "Nonparametric Identification and Semiparametric Estimation of Classical Measurement Error Models Without Side Information," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(501), pages 177-186, March.
    8. Susanne M. Schennach, 2016. "Recent Advances in the Measurement Error Literature," Annual Review of Economics, Annual Reviews, vol. 8(1), pages 341-377, October.
    9. Howard D. Bondell & Brian J. Reich & Huixia Wang, 2010. "Noncrossing quantile regression curve estimation," Biometrika, Biometrika Trust, vol. 97(4), pages 825-838.
    10. Michelle Xia, 2018. "Bayesian Adjustment for Insurance Misrepresentation in Heavy-Tailed Loss Regression," Risks, MDPI, vol. 6(3), pages 1-16, August.
    11. John Cawley & Chad Meyerhoefer & Adam Biener & Mette Hammer & Neil Wintfeld, 2015. "Savings in Medical Expenditures Associated with Reductions in Body Mass Index Among US Adults with Obesity, by Diabetes Status," PharmacoEconomics, Springer, vol. 33(7), pages 707-722, July.
    12. Wu, Qiang & Yao, Weixin, 2016. "Mixtures of quantile regressions," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 162-176.
    13. Aprajit Mahajan, 2006. "Identification and Estimation of Regression Models with Misclassification," Econometrica, Econometric Society, vol. 74(3), pages 631-665, May.
    14. Li, Youjuan & Liu, Yufeng & Zhu, Ji, 2007. "Quantile Regression in Reproducing Kernel Hilbert Spaces," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 255-268, March.
    15. Helmut Küchenhoff & Samuel M. Mwalili & Emmanuel Lesaffre, 2006. "A General Method for Dealing with Misclassification in Regression: The Misclassification SIMEX," Biometrics, The International Biometric Society, vol. 62(1), pages 85-96, March.
    16. Yuan, Ming, 2006. "GACV for quantile smoothing splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 813-829, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Michele Tumminello & Andrea Consiglio & Pietro Vassallo & Riccardo Cesari & Fabio Farabullini, 2023. "Insurance fraud detection: A statistically validated network approach," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 90(2), pages 381-419, June.
    2. Daniel Bauer & James Tyler Leverty & Joan Schmit & Justin Sydnor, 2021. "Symposium on insure‐tech, digitalization, and big‐data techniques in risk management and insurance," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 525-528, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kato, Kengo & Sasaki, Yuya, 2019. "Uniform confidence bands for nonparametric errors-in-variables regression," Journal of Econometrics, Elsevier, vol. 213(2), pages 516-555.
    2. Mochen Yang & Edward McFowland III & Gordon Burtch & Gediminas Adomavicius, 2020. "Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem," Papers 2012.10790, arXiv.org.
    3. Hu, Yingyao, 2017. "The econometrics of unobservables: Applications of measurement error models in empirical industrial organization and labor economics," Journal of Econometrics, Elsevier, vol. 200(2), pages 154-168.
    4. Young Jun Lee & Daniel Wilhelm, 2020. "Testing for the presence of measurement error in Stata," Stata Journal, StataCorp LP, vol. 20(2), pages 382-404, June.
    5. Takahide Yanagi, 2019. "Inference on local average treatment effects for misclassified treatment," Econometric Reviews, Taylor & Francis Journals, vol. 38(8), pages 938-960, September.
    6. Susanne M. Schennach, 2012. "Measurement error in nonlinear models - a review," CeMMAP working papers 41/12, Institute for Fiscal Studies.
    7. Sungwan Bang & Soo-Heang Eo & Yong Mee Cho & Myoungshic Jhun & HyungJun Cho, 2016. "Non-crossing weighted kernel quantile regression with right censored data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(1), pages 100-121, January.
    8. Songfeng Zheng, 2014. "A generalized Newton algorithm for quantile regression models," Computational Statistics, Springer, vol. 29(6), pages 1403-1426, December.
    9. Hu, Yingyao & Schennach, Susanne & Shiu, Ji-Liang, 2022. "Identification of nonparametric monotonic regression models with continuous nonclassical measurement errors," Journal of Econometrics, Elsevier, vol. 226(2), pages 269-294.
    10. Daniel Wilhelm, 2018. "Testing for the presence of measurement error," CeMMAP working papers CWP45/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    11. Mochen Yang & Edward McFowland & Gordon Burtch & Gediminas Adomavicius, 2022. "Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 138-155, October.
    12. Jooyong Shim & Changha Hwang & Kyungha Seok, 2014. "Composite support vector quantile regression estimation," Computational Statistics, Springer, vol. 29(6), pages 1651-1665, December.
    13. Cawley, John, 2015. "An economy of scales: A selective review of obesity's economic causes, consequences, and solutions," Journal of Health Economics, Elsevier, vol. 43(C), pages 244-268.
    14. Lian, Heng & Meng, Jie & Fan, Zengyan, 2015. "Simultaneous estimation of linear conditional quantiles with penalized splines," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 1-21.
    15. Hao Dong & Yuya Sasaki, 2022. "Estimation of Average Derivatives of Latent Regressors: With an Application to Inference on Buffer-Stock Saving," Papers 2209.05914, arXiv.org.
    16. Lorenzo Almada & Ian McCarthy & Rusty Tchernis, 2016. "What Can We Learn about the Effects of Food Stamps on Obesity in the Presence of Misreporting?," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 98(4), pages 997-1017.
    17. Aurélie Bertrand & Ingrid Van Keilegom & Catherine Legrand, 2019. "Flexible parametric approach to classical measurement error variance estimation without auxiliary data," Biometrics, The International Biometric Society, vol. 75(1), pages 297-307, March.
    18. Shih-Kang Chao & Wolfgang Karl Härdle & Hien Pham-Thu, 2014. "Credit Risk Calibration based on CDS Spreads," SFB 649 Discussion Papers SFB649DP2014-026, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    19. Park, Jinho, 2017. "Solution path for quantile regression with epsilon-insensitive loss in a reproducing kernel Hilbert space," Statistics & Probability Letters, Elsevier, vol. 126(C), pages 205-211.
    20. Bertrand, Aurelie & Van Keilegom, Ingrid & Legrand, Catherine, 2017. "Flexible parametric approach to classical measurement error variance estimation without auxiliary data," LIDAM Discussion Papers ISBA 2017025, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jrinsu:v:88:y:2021:i:3:p:625-663. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/ariaaea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.