IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v12y2010i2d10.1007_s10796-008-9131-2.html
   My bibliography  Save this article

An SVM-based machine learning method for accurate internet traffic classification

Author

Listed:
  • Ruixi Yuan

    (Tsinghua University)

  • Zhu Li

    (Tsinghua University)

  • Xiaohong Guan

    (Tsinghua University
    Xi’an Jiaotong University)

  • Li Xu

    (Beijing Jiaotong University
    Old Dominion University)

Abstract

Accurate and timely traffic classification is critical in network security monitoring and traffic engineering. Traditional methods based on port numbers and protocols have proven to be ineffective in terms of dynamic port allocation and packet encapsulation. The signature matching methods, on the other hand, require a known signature set and processing of packet payload, can only handle the signatures of a limited number of IP packets in real-time. A machine learning method based on SVM (supporting vector machine) is proposed in this paper for accurate Internet traffic classification. The method classifies the Internet traffic into broad application categories according to the network flow parameters obtained from the packet headers. An optimized feature set is obtained via multiple classifier selection methods. Experimental results using traffic from campus backbone show that an accuracy of 99.42% is achieved with the regular biased training and testing samples. An accuracy of 97.17% is achieved when un-biased training and testing samples are used with the same feature set. Furthermore, as all the feature parameters are computable from the packet headers, the proposed method is also applicable to encrypted network traffic.

Suggested Citation

  • Ruixi Yuan & Zhu Li & Xiaohong Guan & Li Xu, 2010. "An SVM-based machine learning method for accurate internet traffic classification," Information Systems Frontiers, Springer, vol. 12(2), pages 149-156, April.
  • Handle: RePEc:spr:infosf:v:12:y:2010:i:2:d:10.1007_s10796-008-9131-2
    DOI: 10.1007/s10796-008-9131-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-008-9131-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-008-9131-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ling Li & Ricardo Valerdi & John N. Warfield, 2008. "Advances in enterprise information systems," Information Systems Frontiers, Springer, vol. 10(5), pages 499-501, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wendi Zhang & Bin Li & Alan Wee-Chung Liew & Eduardo Roca & Tarlok Singh, 2023. "Predicting the returns of the US real estate investment trust market: evidence from the group method of data handling neural network," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-33, December.
    2. Xueling Li & Yujie Long & Meixi Fan & Yong Chen, 2022. "Drilling down artificial intelligence in entrepreneurial management: A bibliometric perspective," Systems Research and Behavioral Science, Wiley Blackwell, vol. 39(3), pages 379-396, May.
    3. Hong Jiang & Jinlong Gai & Shukuan Zhao & Peggy E. Chaudhry & Sohail S. Chaudhry, 2022. "Applications and development of artificial intelligence system from the perspective of system science: A bibliometric review," Systems Research and Behavioral Science, Wiley Blackwell, vol. 39(3), pages 361-378, May.
    4. Shouhui Pan & Li Wang & Kaiyi Wang & Zhuming Bi & Siqing Shan & Bo Xu, 2014. "A Knowledge Engineering Framework for Identifying Key Impact Factors from Safety‐Related Accident Cases," Systems Research and Behavioral Science, Wiley Blackwell, vol. 31(3), pages 383-397, May.
    5. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 0. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 0, pages 1-11.
    6. Ajaya K. Swain & Valeria R. Garza, 2023. "Key Factors in Achieving Service Level Agreements (SLA) for Information Technology (IT) Incident Resolution," Information Systems Frontiers, Springer, vol. 25(2), pages 819-834, April.
    7. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 2020. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 22(5), pages 1009-1019, October.
    8. Chulhwan Chris Bang, 2015. "Information systems frontiers: Keyword analysis and classification," Information Systems Frontiers, Springer, vol. 17(1), pages 217-237, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chulhwan Chris Bang, 2015. "Information systems frontiers: Keyword analysis and classification," Information Systems Frontiers, Springer, vol. 17(1), pages 217-237, February.
    2. Fang Liu & Zhuming Bi & Eric L. Xu & Qin Ga & Quanyu Yang & Yingzhong Yang & Lan Ma & Tana Wuren & Rili Ge, 2015. "An integrated systems approach to plateau ecosystem management—a scientific application in Qinghai and Tibet plateau," Information Systems Frontiers, Springer, vol. 17(2), pages 337-350, April.
    3. Petter Gottschalk, 2010. "Knowledge management technology for organized crime risk assessment," Information Systems Frontiers, Springer, vol. 12(3), pages 267-275, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:12:y:2010:i:2:d:10.1007_s10796-008-9131-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.