IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v16y2024i11p414-d1516985.html
   My bibliography  Save this article

An Effective Ensemble Approach for Preventing and Detecting Phishing Attacks in Textual Form

Author

Listed:
  • Zaher Salah

    (Department of Information Technology, Faculty of Prince Al-Hussein Bin Abdullah II for Information Technology, The Hashemite University, Zarqa 13133, Jordan)

  • Hamza Abu Owida

    (Department of Medical Engineering, Faculty of Engineering, Al-Ahliyya Amman University, Ammman 19328, Jordan)

  • Esraa Abu Elsoud

    (Department of Computer Science, Faculty of Information Technology, Zarqa University, Zarqa 13100, Jordan)

  • Esraa Alhenawi

    (Department of Computer Science, Faculty of Information Technology, Zarqa University, Zarqa 13100, Jordan)

  • Suhaila Abuowaida

    (Department of Computer Science, Faculty of Information Technology, Al Al-Bayt University, Mafraq 25113, Jordan)

  • Nawaf Alshdaifat

    (Faculty of Information Technology, Applied Science Private University, Amman 11937, Jordan)

Abstract

Phishing email assaults have been a prevalent cybercriminal tactic for many decades. Various detectors have been suggested over time that rely on textual information. However, to address the growing prevalence of phishing emails, more sophisticated techniques are required to use all aspects of emails to improve the detection capabilities of machine learning classifiers. This paper presents a novel approach to detecting phishing emails. The proposed methodology combines ensemble learning techniques with various variables, such as word frequency, the presence of specific keywords or phrases, and email length, to improve detection accuracy. We provide two approaches for the planned task; The first technique employs ensemble learning soft voting, while the second employs weighted ensemble learning. Both strategies use distinct machine learning algorithms to concurrently process the characteristics, reducing their complexity and enhancing the model’s performance. An extensive assessment and analysis are conducted, considering unique criteria designed to minimize biased and inaccurate findings. Our empirical experiments demonstrates that using ensemble learning to merge attributes in the evolution of phishing emails showcases the competitive performance of ensemble learning over other machine learning algorithms. This superiority is underscored by achieving an F1-score of 0.90 in the weighted ensemble method and 0.85 in the soft voting method, showcasing the effectiveness of this approach.

Suggested Citation

  • Zaher Salah & Hamza Abu Owida & Esraa Abu Elsoud & Esraa Alhenawi & Suhaila Abuowaida & Nawaf Alshdaifat, 2024. "An Effective Ensemble Approach for Preventing and Detecting Phishing Attacks in Textual Form," Future Internet, MDPI, vol. 16(11), pages 1-24, November.
  • Handle: RePEc:gam:jftint:v:16:y:2024:i:11:p:414-:d:1516985
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/16/11/414/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/16/11/414/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Roy, Atin & Chakraborty, Subrata, 2023. "Support vector machine in structural reliability analysis: A review," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiaan Zhang & Wenxin Liu & Zhenzhen Wang & Ruiqing Fan, 2024. "Electric Vehicle Power Consumption Modelling Method Based on Improved Ant Colony Optimization-Support Vector Regression," Energies, MDPI, vol. 17(17), pages 1-17, August.
    2. Zhao, Enyong & Wang, Qihan & Alamdari, Mehrisadat Makki & Gao, Wei, 2023. "Advanced virtual model assisted most probable point capturing method for engineering structures," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
    3. Mehmet Kayakuş & Fatma Yiğit Açikgöz & Mirela Nicoleta Dinca & Onder Kabas, 2024. "Sustainable Brand Reputation: Evaluation of iPhone Customer Reviews with Machine Learning and Sentiment Analysis," Sustainability, MDPI, vol. 16(14), pages 1-17, July.
    4. Yu, Shui & Ren, Yuyao & Wu, Xiao & Guo, Peng & Li, Yun, 2024. "Dynamic pruning-based Bayesian support vector regression for reliability analysis," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
    5. Yanlin Wang & Zhijun Cheng & Zichen Wang, 2024. "Multi-Output Bayesian Support Vector Regression Considering Dependent Outputs," Mathematics, MDPI, vol. 12(18), pages 1-20, September.
    6. Yu, Shui & Wu, Xiao & Zhao, Dongyu & Li, Yun, 2024. "A two-level surrogate framework for demand-objective time-variant reliability-based design optimization," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
    7. Li, Guosheng & Ma, Shuaichao & Zhang, Dequan & Yang, Leping & Zhang, Weihua & Wu, Zeping, 2024. "An efficient sequential anisotropic RBF reliability analysis method with fast cross-validation and parallelizability," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    8. Xin, Fukang & Wang, Pan & Wang, Qirui & Li, Lei & Cheng, Lei & Lei, Huajin & Ma, Fangyun, 2024. "Parallel adaptive ensemble of metamodels combined with hypersphere sampling for rare failure events," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
    9. Tian, Jilun & Zhang, Jiusi & Jiang, Yuchen & Wu, Shimeng & Luo, Hao & Yin, Shen, 2024. "A novel generalized source-free domain adaptation approach for cross-domain industrial fault diagnosis," Reliability Engineering and System Safety, Elsevier, vol. 243(C).
    10. Fang, Chen & Xu, You-Lin & Li, Yongle & Li, Jinrong, 2024. "Serviceability analysis of sea-crossing bridges under correlated wind and wave loads," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
    11. Li, Chen & Wen, Jiong-Ran & Wan, Jing & Taylan, Osman & Fei, Cheng-Wei, 2024. "Adaptive directed support vector machine method for the reliability evaluation of aeroengine structure," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
    12. Pei, Pei & Zhou, Tong, 2023. "One-step look-ahead policy for active learning reliability analysis," Reliability Engineering and System Safety, Elsevier, vol. 236(C).
    13. Ouyang, Linhan & Che, Yushuai & Park, Chanseok & Chen, Yuejian, 2024. "A novel active learning Gaussian process modeling-based method for time-dependent reliability analysis considering mixed variables," Reliability Engineering and System Safety, Elsevier, vol. 244(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:16:y:2024:i:11:p:414-:d:1516985. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.