IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v249y2017i1d10.1007_s10479-014-1711-6.html
   My bibliography  Save this article

Weighted relaxed support vector machines

Author

Listed:
  • Onur Şeref

    (Virginia Polytechnic Institute and State University)

  • Talayeh Razzaghi

    (University of Central Florida)

  • Petros Xanthopoulos

    (University of Central Florida)

Abstract

Classification of imbalanced data is challenging when outliers exist. In this paper, we propose a supervised learning method to simultaneously classify imbalanced data and reduce the influence of outliers. The proposed method is a cost-sensitive extension of the relaxed support vector machines (RSVM), where the restricted penalty free-slack is split independently between the two classes in proportion to the number samples in each class with different weights, hence given the name weighted relaxed support vector machines (WRSVM). We compare classification results of WRSVM with SVM, WSVM and RSVM on public benchmark datasets with imbalanced classes and outlier noise, and show that WRSVM produces more accurate and robust classification results.

Suggested Citation

  • Onur Şeref & Talayeh Razzaghi & Petros Xanthopoulos, 2017. "Weighted relaxed support vector machines," Annals of Operations Research, Springer, vol. 249(1), pages 235-271, February.
  • Handle: RePEc:spr:annopr:v:249:y:2017:i:1:d:10.1007_s10479-014-1711-6
    DOI: 10.1007/s10479-014-1711-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-014-1711-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-014-1711-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hsinchun Chen & Ganesan Shankaranarayanan & Linlin She & Anand Iyer, 1998. "A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(8), pages 693-705.
    2. Gitae Kim & Bongsug Chae & David Olson, 2013. "A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models," Service Business, Springer;Pan-Pacific Business Association, vol. 7(1), pages 167-182, March.
    3. Huang, Chien-Ming & Lee, Yuh-Jye & Lin, Dennis K.J. & Huang, Su-Yun, 2007. "Model selection for support vector machines via uniform design," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 335-346, September.
    4. Mohammad Poursaeidi & O. Kundakcioglu, 2014. "Robust support vector machines for multiple instance learning," Annals of Operations Research, Springer, vol. 216(1), pages 205-227, May.
    5. Claudio Cifarelli & Mario R. Guarracino & Onur Seref & Salvatore Cuciniello & Panos M. Pardalos, 2007. "Incremental Classification with Generalized Eigenvalues," Journal of Classification, Springer;The Classification Society, vol. 24(2), pages 205-219, September.
    6. Petros Xanthopoulos & Mario Guarracino & Panos Pardalos, 2014. "Robust generalized eigenvalue classifier with ellipsoidal uncertainty," Annals of Operations Research, Springer, vol. 216(1), pages 327-342, May.
    7. G. Zioutas & L. Pitsoulis & A. Avramidis, 2009. "Quadratic mixed integer programming and support vectors for deleting outliers in robust regression," Annals of Operations Research, Springer, vol. 166(1), pages 339-353, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Talayeh Razzaghi & Ilya Safro & Joseph Ewing & Ehsan Sadrfaridpour & John D. Scott, 2019. "Predictive models for bariatric surgery risks with imbalanced medical datasets," Annals of Operations Research, Springer, vol. 280(1), pages 1-18, September.
    2. Che Xu & Wenjun Chang & Weiyong Liu, 2023. "Data-driven decision model based on local two-stage weighted ensemble learning," Annals of Operations Research, Springer, vol. 325(2), pages 995-1028, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Orestis P. Panagopoulos & Petros Xanthopoulos & Talayeh Razzaghi & Onur Şeref, 2019. "Relaxed support vector regression," Annals of Operations Research, Springer, vol. 276(1), pages 191-210, May.
    2. Panagopoulos, Orestis P. & Pappu, Vijay & Xanthopoulos, Petros & Pardalos, Panos M., 2016. "Constrained subspace classifier for high dimensional datasets," Omega, Elsevier, vol. 59(PA), pages 40-46.
    3. Ching-Hsin Wang & Feng-Chia Li, 2020. "Economic design under gamma shock model of the control chart for sustainable operations," Annals of Operations Research, Springer, vol. 290(1), pages 169-190, July.
    4. Yen-Chun Chou & Howard Hao-Chun Chuang, 2018. "A predictive investigation of first-time customer retention in online reservation services," Service Business, Springer;Pan-Pacific Business Association, vol. 12(4), pages 685-699, December.
    5. Songul Cinaroglu, 2020. "Modelling unbalanced catastrophic health expenditure data by using machine‐learning methods," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(4), pages 168-181, October.
    6. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2009. "Variable selection and oversampling in the use of smooth support vector machines for predicting the default risk of companies," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(6), pages 512-534.
    7. Ximing Wang & Neng Fan & Panos M. Pardalos, 2018. "Robust chance-constrained support vector machines with second-order moment information," Annals of Operations Research, Springer, vol. 263(1), pages 45-68, April.
    8. Yang, YouLong & Che, JinXing & Li, YanYing & Zhao, YanJun & Zhu, SuLing, 2016. "An incremental electric load forecasting model based on support vector regression," Energy, Elsevier, vol. 113(C), pages 796-808.
    9. Petros Xanthopoulos & Mario Guarracino & Panos Pardalos, 2014. "Robust generalized eigenvalue classifier with ellipsoidal uncertainty," Annals of Operations Research, Springer, vol. 216(1), pages 327-342, May.
    10. Shuguang He & Wei Jiang & Houtao Deng, 2018. "A distance-based control chart for monitoring multivariate processes using support vector machines," Annals of Operations Research, Springer, vol. 263(1), pages 191-207, April.
    11. Chunneng Huang & Tianjun Fu & Hsinchun Chen, 2010. "Text‐based video content classification for online video‐sharing sites," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(5), pages 891-906, May.
    12. M. R. Guarracino & S. Cuciniello & P. M. Pardalos, 2009. "Classification and Characterization of Gene Expression Data with Generalized Eigenvalues," Journal of Optimization Theory and Applications, Springer, vol. 141(3), pages 533-545, June.
    13. I. Edhem Sakarya & O. Erhun Kundakcioglu, 2023. "Multi-instance learning by maximizing the area under receiver operating characteristic curve," Journal of Global Optimization, Springer, vol. 85(2), pages 351-375, February.
    14. Luca Insolia & Ana Kenney & Francesca Chiaromonte & Giovanni Felici, 2022. "Simultaneous feature selection and outlier detection with optimality guarantees," Biometrics, The International Biometric Society, vol. 78(4), pages 1592-1603, December.
    15. Mike G. Tsionas, 2021. "Multi-criteria optimization in regression," Annals of Operations Research, Springer, vol. 306(1), pages 7-25, November.
    16. De Brabanter, K. & De Brabanter, J. & Suykens, J.A.K. & De Moor, B., 2010. "Optimized fixed-size kernel models for large data sets," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1484-1504, June.
    17. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2007. "The Default Risk of Firms Examined with Smooth Support Vector Machines," Discussion Papers of DIW Berlin 757, DIW Berlin, German Institute for Economic Research.
    18. Chuang, S.C. & Hung, Y.C., 2010. "Uniform design over general input domains with applications to target region estimation in computer experiments," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 219-232, January.
    19. Kazim Topuz & Hasmet Uner & Asil Oztekin & Mehmet Bayram Yildirim, 2018. "Predicting pediatric clinic no-shows: a decision analytic framework using elastic net and Bayesian belief network," Annals of Operations Research, Springer, vol. 263(1), pages 479-499, April.
    20. Vera L. Miguéis & Ana S. Camanho & José Borges, 2017. "Predicting direct marketing response in banking: comparison of class imbalance methods," Service Business, Springer;Pan-Pacific Business Association, vol. 11(4), pages 831-849, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:249:y:2017:i:1:d:10.1007_s10479-014-1711-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.