IDEAS home Printed from https://ideas.repec.org/p/hhs/nhhfms/2020_015.html
   My bibliography  Save this paper

Fraud detection by a multinomial model: Separating honesty from unobserved fraud

Author

Listed:
  • Andersson, Jonas

    (Dept. of Business and Management Science, Norwegian School of Economics)

  • Olden, Andreas

    (Dept. of Business and Management Science, Norwegian School of Economics)

  • Rusina, Aija

    (Dept. of Business and Management Science, Norwegian School of Economics)

Abstract

In this paper we investigate the EM-estimator of the model by Caudill et al. (2005). The purpose of the model is to identify items, e.g. individuals or companies, that are wrongly classified as honest; an example of this is the detection of tax evasion. Normally, we observe two groups of items, labeled fraudulent and honest, but suspect that many of the observationally honest items are, in fact, fraudulent. The items observed as honest are therefore divided into two unobserved groups, honestH, representing the truly honest, and honestF, representing the items that are observed as honest, but that are actually fraudulent. By using a multinomial logit model and assuming commonality between the observed fraudulent and the unobserved honestF, Caudill et al. (2005) present a method that uses the EM-algorithm to separate them. By means of a Monte Carlo study, we investigate how well the method performs, and under what circumstances. We also study how well bootstrapped standard errors estimates the standard deviation of the parameter estimators.

Suggested Citation

  • Andersson, Jonas & Olden, Andreas & Rusina, Aija, 2020. "Fraud detection by a multinomial model: Separating honesty from unobserved fraud," Discussion Papers 2020/15, Norwegian School of Economics, Department of Business and Management Science.
  • Handle: RePEc:hhs:nhhfms:2020_015
    as

    Download full text from publisher

    File URL: https://hdl.handle.net/11250/2721233
    File Function: Full text
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jing Ai & Patrick L. Brockett & Linda L. Golden & Montserrat Guillén, 2013. "A Robust Unsupervised Method for Fraud Rate Estimation," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 80(1), pages 121-143, March.
    2. Alan D. Olinsky & Paul M. Mangiameli & Shaw K. Chen, 1996. "Statistical Support of Forensic Auditing," Interfaces, INFORMS, vol. 26(6), pages 95-104, December.
    3. Hausman, J. A. & Abrevaya, Jason & Scott-Morton, F. M., 1998. "Misclassification of the dependent variable in a discrete-response setting," Journal of Econometrics, Elsevier, vol. 87(2), pages 239-269, September.
    4. Jing Ai & Patrick Brockett & Linda Golden, 2009. "Assessing Consumer Fraud Risk in Insurance Claims," North American Actuarial Journal, Taylor & Francis Journals, vol. 13(4), pages 438-458.
    5. Steven B. Caudill & Mercedes Ayuso & Montserrat Guillén, 2005. "Fraud Detection Using a Multinomial Logit Model With Missing Information," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 72(4), pages 539-550, December.
    6. Artis, Manuel & Ayuso, Mercedes & Guillen, Montserrat, 1999. "Modelling different types of automobile insurance fraud behaviour in the Spanish market," Insurance: Mathematics and Economics, Elsevier, vol. 24(1-2), pages 67-81, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Urbina, Jilber & Guillén, Montserrat, 2013. "An application of capital allocation principles to operational risk," MPRA Paper 75726, University Library of Munich, Germany, revised Dec 2013.
    2. Jing Ai & Patrick L. Brockett & Linda L. Golden & Montserrat Guillén, 2013. "A Robust Unsupervised Method for Fraud Rate Estimation," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 80(1), pages 121-143, March.
    3. Pulina, Manuela & Paba, Antonello, 2010. "A discrete choice approach to model credit card fraud," MPRA Paper 20019, University Library of Munich, Germany.
    4. Ming-Jyh Wang & Chieh-Hua Wen & Lawrence W Lan, 2010. "Modelling Different Types of Bundled Automobile Insurance Choice Behaviour: The Case of Taiwan*," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 35(2), pages 290-308, April.
    5. Steven B. Caudill & Mercedes Ayuso & Montserrat Guillén, 2005. "Fraud Detection Using a Multinomial Logit Model With Missing Information," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 72(4), pages 539-550, December.
    6. Bermúdez, Ll. & Pérez, J.M. & Ayuso, M. & Gómez, E. & Vázquez, F.J., 2008. "A Bayesian dichotomous model with asymmetric link for fraud in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 42(2), pages 779-786, April.
    7. Denisa BANULESCU-RADU & Meryem YANKOL-SCHALCK, 2021. "Fraud detection in the era of Machine Learning: a household insurance case," LEO Working Papers / DR LEO 2904, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    8. Nguimkeu, Pierre & Denteh, Augustine & Tchernis, Rusty, 2019. "On the estimation of treatment effects with endogenous misreporting," Journal of Econometrics, Elsevier, vol. 208(2), pages 487-506.
    9. Adele Bergin, 2015. "Employer Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," LABOUR, CEIS, vol. 29(2), pages 194-223, June.
    10. Kocięcki, Andrzej & Kolasa, Marcin, 2023. "A solution to the global identification problem in DSGE models," Journal of Econometrics, Elsevier, vol. 236(2).
    11. John K. Dagsvik & Zhiyang Jia, 2016. "Labor Supply as a Choice Among Latent Jobs: Unobserved Heterogeneity and Identification," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(3), pages 487-506, April.
    12. Esmeralda Ramalho, 2004. "Covariate Measurement Error in Endogenous Stratified Samples," Economics Working Papers 2_2004, University of Évora, Department of Economics (Portugal).
    13. Acerenza, Santiago & Ban, Kyunghoon & Kedagni, Desire, 2021. "Marginal Treatment Effects with Misclassified Treatment," ISU General Staff Papers 202106180700001132, Iowa State University, Department of Economics.
    14. Donal O'Neill & Olive Sweetman, 2013. "Estimating Obesity Rates in Europe in the Presence of Self-Reporting Errors," Economics Department Working Paper Series n236-13.pdf, Department of Economics, National University of Ireland - Maynooth.
    15. Yuko Mori & Takashi Kurosaki, 2011. "Does Political Reservation Affect Voting Behavior? Empirical Evidence from India," Global COE Hi-Stat Discussion Paper Series gd11-205, Institute of Economic Research, Hitotsubashi University.
    16. van Soest, Arthur & Hurd, Michael, 2008. "A Test for Anchoring and Yea-Saying in Experimental Consumption Data," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 126-136, March.
    17. Kyung Min Kang & Robert A. Moffitt, 2019. "The Effect of SNAP and School Food Programs on Food Security, Diet Quality, and Food Spending: Sensitivity to Program Reporting Error," Southern Economic Journal, John Wiley & Sons, vol. 86(1), pages 156-201, July.
    18. Maximiliano Dvorkin, 2021. "International trade and labor reallocation: misclassification errors, mobility, and switching costs," Working Papers 2021-014, Federal Reserve Bank of St. Louis, revised Dec 2023.
    19. Ranney, Christine K. & Gomez, Miguel I., 2010. "Food Stamps, Food Insufficiency and Health of the Elderly," Working Papers 126968, Cornell University, Department of Applied Economics and Management.
    20. Yokoo, Hide-Fumi & Arimura, Toshi H. & Chattopadhyay, Mriduchhanda & Katayama, Hajime, 2023. "Subjective risk belief function in the field: Evidence from cooking fuel choices and health in India," Journal of Development Economics, Elsevier, vol. 161(C).

    More about this item

    Keywords

    Fraud detection; EM-algorithm; multinomial logit model; Monte Carlo study;
    All these keywords.

    JEL classification:

    • C00 - Mathematical and Quantitative Methods - - General - - - General
    • C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hhs:nhhfms:2020_015. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Stein Fossen (email available below). General contact details of provider: https://edirc.repec.org/data/dfnhhno.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.