IDEAS home Printed from https://ideas.repec.org/a/spr/telsys/v76y2021i1d10.1007_s11235-020-00739-w.html
   My bibliography  Save this article

Phishing website detection using support vector machines and nature-inspired optimization algorithms

Author

Listed:
  • Sagnik Anupam

    (DPS RK Puram)

  • Arpan Kumar Kar

    (Indian Institute of Technology)

Abstract

Phishing websites are amongst the biggest threats Internet users face today, and existing methods like blacklisting, using SSL certificates, etc. often fail to keep up with the increasing number of threats. This paper aims to utilise different properties of a website URL, and use a machine learning model to classify websites as phishing and non-phishing. These properties include the IP address length, the authenticity of the HTTPs request being sent by the website, usage of pop-up windows to enter data, Server Form Handler status, etc. A Support Vector Machine binary classifier trained on an existing dataset has been used to predict if a website was a legitimate website or not, by finding an optimum hyperplane to separate the two categories. This optimum hyperplane is found with the help of four optimization algorithms, the Bat Algorithm, the Firefly Algorithm, the Grey Wolf Optimiser algorithm and the Whale Optimization Algorithm, which are inspired by various natural phenomena. Amongst the four nature-inspired optimization algorithms, it has been determined that the Grey Wolf Optimiser algorithm’s performance is significantly better than that of the Firefly Algorithm, but there is no significant difference while comparing the performance of any other pair of algorithms. However, all four nature-inspired optimization algorithms perform significantly better than the grid-search optimized Random Forest classifier model described in earlier research.

Suggested Citation

  • Sagnik Anupam & Arpan Kumar Kar, 2021. "Phishing website detection using support vector machines and nature-inspired optimization algorithms," Telecommunication Systems: Modelling, Analysis, Design and Management, Springer, vol. 76(1), pages 17-32, January.
  • Handle: RePEc:spr:telsys:v:76:y:2021:i:1:d:10.1007_s11235-020-00739-w
    DOI: 10.1007/s11235-020-00739-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11235-020-00739-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11235-020-00739-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Laura Auria & Rouslan A. Moro, 2008. "Support Vector Machines (SVM) as a Technique for Solvency Analysis," Discussion Papers of DIW Berlin 811, DIW Berlin, German Institute for Economic Research.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang Ling & Zhang Jia Hao, 2022. "Intrusion Detection Using Normalized Mutual Information Feature Selection and Parallel Quantum Genetic Algorithm," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 18(1), pages 1-24, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aurelia Rybak & Aleksandra Rybak & Spas D. Kolev, 2023. "Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression," Energies, MDPI, vol. 16(22), pages 1-17, November.
    2. Kyuhan Lee & Jinsoo Park & Iljoo Kim & Youngseok Choi, 2018. "Predicting movie success with machine learning techniques: ways to improve accuracy," Information Systems Frontiers, Springer, vol. 20(3), pages 577-588, June.
    3. Mostafaei, Kamran & maleki, Shaho & Zamani Ahmad Mahmoudi, Mohammad & Knez, Dariusz, 2022. "Risk management prediction of mining and industrial projects by support vector machine," Resources Policy, Elsevier, vol. 78(C).
    4. Yoon-Joo Park, 2018. "Predicting the Helpfulness of Online Customer Reviews across Different Product Types," Sustainability, MDPI, vol. 10(6), pages 1-20, May.
    5. Abdulkadir Atalan, 2023. "Forecasting drinking milk price based on economic, social, and environmental factors using machine learning algorithms," Agribusiness, John Wiley & Sons, Ltd., vol. 39(1), pages 214-241, January.
    6. Junlong Zhang & Youbin He & Yuan Zhang & Weifeng Li & Junjie Zhang, 2022. "Well-Logging-Based Lithology Classification Using Machine Learning Methods for High-Quality Reservoir Identification: A Case Study of Baikouquan Formation in Mahu Area of Junggar Basin, NW China," Energies, MDPI, vol. 15(10), pages 1-15, May.
    7. Ni, Ji & Chen, Bowei & Allinson, Nigel M. & Ye, Xujiong, 2020. "A hybrid model for predicting human physical activity status from lifelogging data," European Journal of Operational Research, Elsevier, vol. 281(3), pages 532-542.
    8. Petrakova Aleksandra & Merkurjeva Galina & Affenzeller Michael, 2015. "Heterogeneous versus Homogeneous Machine Learning Ensembles," Information Technology and Management Science, Sciendo, vol. 18(1), pages 135-140, December.
    9. Mukhtar Sani & Maxime Piffard & Vincent Heiries, 2023. "Fault Detection for PEM Fuel Cells via Analytical Redundancy: A Critical Review and Prospects," Energies, MDPI, vol. 16(14), pages 1-16, July.
    10. Wang, Xinlin & Ahn, Sung-Hoon, 2020. "Real-time prediction and anomaly detection of electrical load in a residential community," Applied Energy, Elsevier, vol. 259(C).
    11. Söhnke M. Bartram & Jürgen Branke & Mehrshad Motahari, 2020. "Artificial intelligence in asset management," Working Papers 20202001, Cambridge Judge Business School, University of Cambridge.
    12. Hemraj Verma & Garima Verma, 2020. "Prediction Model for Bollywood Movie Success: A Comparative Analysis of Performance of Supervised Machine Learning Algorithms," The Review of Socionetwork Strategies, Springer, vol. 14(1), pages 1-17, April.
    13. Thierry Delahaye & Rodrigo Acuna-Agost & Nicolas Bondoux & Anh-Quan Nguyen & Mourad Boudia, 2017. "Data-driven models for itinerary preferences of air travelers and application for dynamic pricing optimization," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 16(6), pages 621-639, December.
    14. Yee-Fan Tan & Lee-Yeng Ong & Meng-Chew Leow & Yee-Xian Goh, 2021. "Exploring Time-Series Forecasting Models for Dynamic Pricing in Digital Signage Advertising," Future Internet, MDPI, vol. 13(10), pages 1-24, September.
    15. Nehrebecka Natalia, 2018. "Predicting the Default Risk of Companies. Comparison of Credit Scoring Models: Logit Vs Support Vector Machines," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 54-73, June.
    16. Changju Lee & Sunghoon Lee, 2022. "Exploring the Contributions by Transportation Features to Urban Economy: An Experiment of a Scalable Tree-Boosting Algorithm with Big Data," Land, MDPI, vol. 11(4), pages 1-30, April.
    17. Amarda Cano, 2020. "Evolution of Public Debt in Albania during 1990-2017 and its impact on the Economic Growth," European Journal of Marketing and Economics Articles, Revistia Research and Publishing, vol. 4, January -.
    18. Swati Anand & Kushendra Mishra, 2022. "Identifying potential millennial customers for financial institutions using SVM," Journal of Financial Services Marketing, Palgrave Macmillan, vol. 27(4), pages 335-345, December.
    19. Hossein Hassani & Emmanuel S. Silva & Marine Combe & Demetra Andreou & Mansi Ghodsi & Mohammad Reza Yeganegi & Rodolphe E. Gozlan, 2019. "A Support Vector Machine Based Approach for Predicting the Risk of Freshwater Disease Emergence in England," Stats, MDPI, vol. 2(1), pages 1-15, February.
    20. Kyuhan Lee & Jinsoo Park & Iljoo Kim & Youngseok Choi, 0. "Predicting movie success with machine learning techniques: ways to improve accuracy," Information Systems Frontiers, Springer, vol. 0, pages 1-12.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:telsys:v:76:y:2021:i:1:d:10.1007_s11235-020-00739-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.