IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v249y2017i1d10.1007_s10479-014-1711-6.html
   My bibliography  Save this article

Weighted relaxed support vector machines

Author

Listed:
  • Onur Şeref

    (Virginia Polytechnic Institute and State University)

  • Talayeh Razzaghi

    (University of Central Florida)

  • Petros Xanthopoulos

    (University of Central Florida)

Abstract

Classification of imbalanced data is challenging when outliers exist. In this paper, we propose a supervised learning method to simultaneously classify imbalanced data and reduce the influence of outliers. The proposed method is a cost-sensitive extension of the relaxed support vector machines (RSVM), where the restricted penalty free-slack is split independently between the two classes in proportion to the number samples in each class with different weights, hence given the name weighted relaxed support vector machines (WRSVM). We compare classification results of WRSVM with SVM, WSVM and RSVM on public benchmark datasets with imbalanced classes and outlier noise, and show that WRSVM produces more accurate and robust classification results.

Suggested Citation

  • Onur Şeref & Talayeh Razzaghi & Petros Xanthopoulos, 2017. "Weighted relaxed support vector machines," Annals of Operations Research, Springer, vol. 249(1), pages 235-271, February.
  • Handle: RePEc:spr:annopr:v:249:y:2017:i:1:d:10.1007_s10479-014-1711-6
    DOI: 10.1007/s10479-014-1711-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-014-1711-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-014-1711-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gitae Kim & Bongsug Chae & David Olson, 2013. "A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models," Service Business, Springer;Pan-Pacific Business Association, vol. 7(1), pages 167-182, March.
    2. Claudio Cifarelli & Mario R. Guarracino & Onur Seref & Salvatore Cuciniello & Panos M. Pardalos, 2007. "Incremental Classification with Generalized Eigenvalues," Journal of Classification, Springer;The Classification Society, vol. 24(2), pages 205-219, September.
    3. Huang, Chien-Ming & Lee, Yuh-Jye & Lin, Dennis K.J. & Huang, Su-Yun, 2007. "Model selection for support vector machines via uniform design," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 335-346, September.
    4. Petros Xanthopoulos & Mario Guarracino & Panos Pardalos, 2014. "Robust generalized eigenvalue classifier with ellipsoidal uncertainty," Annals of Operations Research, Springer, vol. 216(1), pages 327-342, May.
    5. G. Zioutas & L. Pitsoulis & A. Avramidis, 2009. "Quadratic mixed integer programming and support vectors for deleting outliers in robust regression," Annals of Operations Research, Springer, vol. 166(1), pages 339-353, February.
    6. Hsinchun Chen & Ganesan Shankaranarayanan & Linlin She & Anand Iyer, 1998. "A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(8), pages 693-705.
    7. Mohammad Poursaeidi & O. Kundakcioglu, 2014. "Robust support vector machines for multiple instance learning," Annals of Operations Research, Springer, vol. 216(1), pages 205-227, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Talayeh Razzaghi & Ilya Safro & Joseph Ewing & Ehsan Sadrfaridpour & John D. Scott, 2019. "Predictive models for bariatric surgery risks with imbalanced medical datasets," Annals of Operations Research, Springer, vol. 280(1), pages 1-18, September.
    2. Che Xu & Wenjun Chang & Weiyong Liu, 2023. "Data-driven decision model based on local two-stage weighted ensemble learning," Annals of Operations Research, Springer, vol. 325(2), pages 995-1028, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Orestis P. Panagopoulos & Petros Xanthopoulos & Talayeh Razzaghi & Onur Şeref, 2019. "Relaxed support vector regression," Annals of Operations Research, Springer, vol. 276(1), pages 191-210, May.
    2. Panagopoulos, Orestis P. & Pappu, Vijay & Xanthopoulos, Petros & Pardalos, Panos M., 2016. "Constrained subspace classifier for high dimensional datasets," Omega, Elsevier, vol. 59(PA), pages 40-46.
    3. Ching-Hsin Wang & Feng-Chia Li, 2020. "Economic design under gamma shock model of the control chart for sustainable operations," Annals of Operations Research, Springer, vol. 290(1), pages 169-190, July.
    4. Yen-Chun Chou & Howard Hao-Chun Chuang, 2018. "A predictive investigation of first-time customer retention in online reservation services," Service Business, Springer;Pan-Pacific Business Association, vol. 12(4), pages 685-699, December.
    5. Songul Cinaroglu, 2020. "Modelling unbalanced catastrophic health expenditure data by using machine‐learning methods," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(4), pages 168-181, October.
    6. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2009. "Variable selection and oversampling in the use of smooth support vector machines for predicting the default risk of companies," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(6), pages 512-534.
    7. Ximing Wang & Neng Fan & Panos M. Pardalos, 2018. "Robust chance-constrained support vector machines with second-order moment information," Annals of Operations Research, Springer, vol. 263(1), pages 45-68, April.
    8. Talayeh Razzaghi & Oleg Roderick & Ilya Safro & Nicholas Marko, 2016. "Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-18, May.
    9. Danijel Bratina & Armand Faganel, 2023. "Using Supervised Machine Learning Methods for RFM Segmentation: A Casino Direct Marketing Communication Case," Tržište/Market, Faculty of Economics and Business, University of Zagreb, vol. 35(1), pages 7-22.
    10. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2007. "The Default Risk of Firms Examined with Smooth Support Vector Machines," Discussion Papers of DIW Berlin 757, DIW Berlin, German Institute for Economic Research.
    11. Yang, YouLong & Che, JinXing & Li, YanYing & Zhao, YanJun & Zhu, SuLing, 2016. "An incremental electric load forecasting model based on support vector regression," Energy, Elsevier, vol. 113(C), pages 796-808.
    12. C. Chatzinakos & L. Pitsoulis & G. Zioutas, 2016. "Optimization techniques for robust multivariate location and scatter estimation," Journal of Combinatorial Optimization, Springer, vol. 31(4), pages 1443-1460, May.
    13. Petros Xanthopoulos & Mario Guarracino & Panos Pardalos, 2014. "Robust generalized eigenvalue classifier with ellipsoidal uncertainty," Annals of Operations Research, Springer, vol. 216(1), pages 327-342, May.
    14. Saeed Ketabchi & Hossein Moosaei & Mohamad Razzaghi & Panos M. Pardalos, 2019. "An improvement on parametric $$\nu $$ ν -support vector algorithm for classification," Annals of Operations Research, Springer, vol. 276(1), pages 155-168, May.
    15. Murtaza Nasir & Nichalin Summerfield & Ali Dag & Asil Oztekin, 2020. "A service analytic approach to studying patient no-shows," Service Business, Springer;Pan-Pacific Business Association, vol. 14(2), pages 287-313, June.
    16. Shuguang He & Wei Jiang & Houtao Deng, 2018. "A distance-based control chart for monitoring multivariate processes using support vector machines," Annals of Operations Research, Springer, vol. 263(1), pages 191-207, April.
    17. Emel Şeyma Küçükaşcı & Mustafa Gökçe Baydoğan & Z. Caner Taşkın, 2022. "Multiple instance classification via quadratic programming," Journal of Global Optimization, Springer, vol. 83(4), pages 639-670, August.
    18. Farnè, Matteo & Vouldis, Angelos T., 2018. "A methodology for automised outlier detection in high-dimensional datasets: an application to euro area banks' supervisory data," Working Paper Series 2171, European Central Bank.
    19. Wang, Yong & Fu, Chengqun & Guo, Jie & Yu, Qin, 2016. "A robust regression based on weighted LSSVM and penalized trimmed squaresAuthor-Name: Liu, Jianyong," Chaos, Solitons & Fractals, Elsevier, vol. 89(C), pages 328-334.
    20. Chunneng Huang & Tianjun Fu & Hsinchun Chen, 2010. "Text‐based video content classification for online video‐sharing sites," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(5), pages 891-906, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:249:y:2017:i:1:d:10.1007_s10479-014-1711-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.