IDEAS home Printed from https://ideas.repec.org/a/sae/joudef/v18y2021i3p175-192.html
   My bibliography  Save this article

Stacked generalizations in imbalanced fraud data sets using resampling methods

Author

Listed:
  • Kathleen R Kerwin
  • Nathaniel D Bastian

Abstract

Predicting fraud is challenging due to inherent issues in the fraud data structure, since the crimes are committed through trickery or deceit with an ever-present moving target of changing modus operandi to circumvent human and system controls. As a national security challenge, criminals continually exploit the electronic financial system to defraud consumers and businesses by finding weaknesses in the system, including in audit controls. This study uses stacked generalization using meta or super learners for improving the performance of algorithms in step one (minimizing the algorithm error rate to reduce its bias in the learning set) and then in step two the results are input into the meta learner with its stacked blended output (with the weakest algorithms learning better). A fundamental key to fraud data is that it is inherently not systematic, and an optimal resampling methodology has yet not been identified. Building a test harness, for all permutations of algorithm sample set pairs, demonstrates that the complex, intrinsic data structures are all thoroughly tested. A comparative analysis on fraud data that applies stacked generalizations provides useful insight to find the optimal mathematical formula for imbalanced fraud data sets necessary to improve upon fraud detection for national security.

Suggested Citation

  • Kathleen R Kerwin & Nathaniel D Bastian, 2021. "Stacked generalizations in imbalanced fraud data sets using resampling methods," The Journal of Defense Modeling and Simulation, , vol. 18(3), pages 175-192, July.
  • Handle: RePEc:sae:joudef:v:18:y:2021:i:3:p:175-192
    DOI: 10.1177/1548512920962219
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/1548512920962219
    Download Restriction: no

    File URL: https://libkey.io/10.1177/1548512920962219?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:joudef:v:18:y:2021:i:3:p:175-192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.