IDEAS home Printed from https://ideas.repec.org/a/wsi/jikmxx/v15y2016i04ns0219649216500428.html
   My bibliography  Save this article

Deriving Correlated Sets of Website Features for Phishing Detection: A Computational Intelligence Approach

Author

Listed:
  • Fadi Thabtah

    (Applied Business and Computing, Nelson Marlborough Institute of Technology, Auckland, New Zealand)

  • Neda Abdelhamid

    (Information Technology, Auckland Institute of Studies, Auckland, New Zealand)

Abstract

Classification is one of the major tasks in data mining which aims to build classifiers for decision making. One of the most recent online threats is phishing, which has caused significant losses to online shoppers, electronic businesses and financial institutions. A common way of phishing is impersonating online websites to deceive online users and steal their financial information. One way to guide the anti-phishing classification method is to preliminarily identify a minimal set of related features so the search space can be reduced. The aim of this paper is to compare different features assessment techniques in the website phishing context in order to determine the minimal set of features for detecting phishing activities. Experimental results on real phishing datasets consisting of 30 features has been conducted using three known features selection methods. New features cutoffs have been identified after statistical analysis utilising three data mining classification methods. We have been able to identify new clusters of features that when used together are able to detect phishing activities. Further, important correlations among common features have been derived.

Suggested Citation

  • Fadi Thabtah & Neda Abdelhamid, 2016. "Deriving Correlated Sets of Website Features for Phishing Detection: A Computational Intelligence Approach," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 15(04), pages 1-17, December.
  • Handle: RePEc:wsi:jikmxx:v:15:y:2016:i:04:n:s0219649216500428
    DOI: 10.1142/S0219649216500428
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219649216500428
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219649216500428?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Anthony Gramaje & Fadi Thabtah & Neda Abdelhamid & Sayan Kumar Ray, 2021. "Patient Discharge Classification Using Machine Learning Techniques," Annals of Data Science, Springer, vol. 8(4), pages 755-767, December.
    2. Fadi Thabtah & Li Zhang & Neda Abdelhamid, 2019. "NBA Game Result Prediction Using Feature Analysis and Machine Learning," Annals of Data Science, Springer, vol. 6(1), pages 103-116, March.
    3. Majed Rajab, 2019. "Visualisation Model Based on Phishing Features," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 1-17, March.
    4. Firuz Kamalov & Fadi Thabtah, 2017. "A Feature Selection Method Based on Ranked Vector Scores of Features for Classification," Annals of Data Science, Springer, vol. 4(4), pages 483-502, December.
    5. Fadi Thabtah & Firuz Kamalov, 2017. "Phishing Detection: A Case Analysis on Classifiers with Rules Using Machine Learning," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 16(04), pages 1-16, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:jikmxx:v:15:y:2016:i:04:n:s0219649216500428. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/jikm/jikm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.