IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v293y2021i1p24-35.html
   My bibliography  Save this article

A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification

Author

Listed:
  • Jiménez-Cordero, Asunción
  • Morales, Juan Miguel
  • Pineda, Salvador

Abstract

In recent years, feature selection has become a challenging problem in several machine learning fields, such as classification problems. Support Vector Machine (SVM) is a well-known technique applied in classification tasks. Various methodologies have been proposed in the literature to select the most relevant features in SVM. Unfortunately, all of them either deal with the feature selection problem in the linear classification setting or propose ad-hoc approaches that are difficult to implement in practice. In contrast, we propose an embedded feature selection method based on a min-max optimization problem, where a trade-off between model complexity and classification accuracy is sought. By leveraging duality theory, we equivalently reformulate the min-max problem and solve it without further ado using off-the-shelf software for nonlinear optimization. The efficiency and usefulness of our approach are tested on several benchmark data sets in terms of accuracy, number of selected features and interpretability.

Suggested Citation

  • Jiménez-Cordero, Asunción & Morales, Juan Miguel & Pineda, Salvador, 2021. "A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification," European Journal of Operational Research, Elsevier, vol. 293(1), pages 24-35.
  • Handle: RePEc:eee:ejores:v:293:y:2021:i:1:p:24-35
    DOI: 10.1016/j.ejor.2020.12.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221720310195
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2020.12.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ghaddar, Bissan & Naoum-Sawaya, Joe, 2018. "High dimensional data classification and feature selection using support vector machines," European Journal of Operational Research, Elsevier, vol. 265(3), pages 993-1004.
    2. Blanquero, R. & Carrizosa, E. & Jiménez-Cordero, A. & Martín-Barragán, B., 2019. "Functional-bandwidth kernel for Support Vector Machine with Functional Data: An alternating optimization algorithm," European Journal of Operational Research, Elsevier, vol. 275(1), pages 195-207.
    3. Bertolazzi, P. & Felici, G. & Festa, P. & Fiscon, G. & Weitschek, E., 2016. "Integer programming models for feature selection: New extensions and a randomized solution algorithm," European Journal of Operational Research, Elsevier, vol. 250(2), pages 389-399.
    4. Li, An-Da & He, Zhen & Wang, Qing & Zhang, Yang, 2019. "Key quality characteristics selection for imbalanced production data using a two-phase bi-objective feature selection method," European Journal of Operational Research, Elsevier, vol. 274(3), pages 978-989.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mi, Yunlong & Quan, Pei & Shi, Yong & Wang, Zongrun, 2022. "Concept-cognitive computing system for dynamic classification," European Journal of Operational Research, Elsevier, vol. 301(1), pages 287-299.
    2. Yang, Dongchuan & Guo, Ju-e & Li, Yanzhao & Sun, Shaolong & Wang, Shouyang, 2023. "Short-term load forecasting with an improved dynamic decomposition-reconstruction-ensemble approach," Energy, Elsevier, vol. 263(PA).
    3. Labbé, Martine & Landete, Mercedes & Leal, Marina, 2023. "Dendrograms, minimum spanning trees and feature selection," European Journal of Operational Research, Elsevier, vol. 308(2), pages 555-567.
    4. Goodell, John W. & Ben Jabeur, Sami & Saâdaoui, Foued & Nasir, Muhammad Ali, 2023. "Explainable artificial intelligence modeling to forecast bitcoin prices," International Review of Financial Analysis, Elsevier, vol. 88(C).
    5. Díaz, Verónica & Montoya, Ricardo & Maldonado, Sebastián, 2023. "Preference estimation under bounded rationality: Identification of attribute non-attendance in stated-choice data using a support vector machines approach," European Journal of Operational Research, Elsevier, vol. 304(2), pages 797-812.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiang, He & Tao, Changqi & Dong, Yao & Xiong, Ren, 2021. "Robust low-rank multiple kernel learning with compound regularization," European Journal of Operational Research, Elsevier, vol. 295(2), pages 634-647.
    2. Li, An-Da & He, Zhen & Wang, Qing & Zhang, Yang, 2019. "Key quality characteristics selection for imbalanced production data using a two-phase bi-objective feature selection method," European Journal of Operational Research, Elsevier, vol. 274(3), pages 978-989.
    3. Zhang, Yishi & Zhu, Ruilin & Chen, Zhijun & Gao, Jie & Xia, De, 2021. "Evaluating and selecting features via information theoretic lower bounds of feature inner correlations for high-dimensional data," European Journal of Operational Research, Elsevier, vol. 290(1), pages 235-247.
    4. Gao, Zheming & Fang, Shu-Cherng & Luo, Jian & Medhin, Negash, 2021. "A kernel-free double well potential support vector machine with applications," European Journal of Operational Research, Elsevier, vol. 290(1), pages 248-262.
    5. Gambella, Claudio & Ghaddar, Bissan & Naoum-Sawaya, Joe, 2021. "Optimization problems for machine learning: A survey," European Journal of Operational Research, Elsevier, vol. 290(3), pages 807-828.
    6. Subhadip Sarkar, 2023. "ABC classification using extended R-model, SVM and Lorenz curve," OPSEARCH, Springer;Operational Research Society of India, vol. 60(3), pages 1433-1455, September.
    7. Yifat Douek-Pinkovich & Irad Ben-Gal & Tal Raviv, 2021. "The generalized test collection problem," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 372-386, July.
    8. Jiapeng Liu & Miłosz Kadziński & Xiuwu Liao & Xiaoxin Mao, 2021. "Data-Driven Preference Learning Methods for Value-Driven Multiple Criteria Sorting with Interacting Criteria," INFORMS Journal on Computing, INFORMS, vol. 33(2), pages 586-606, May.
    9. Manlio Gaudioso & Giovanni Giallombardo & Giovanna Miglionico, 2023. "Sparse optimization via vector k-norm and DC programming with an application to feature selection for support vector machines," Computational Optimization and Applications, Springer, vol. 86(2), pages 745-766, November.
    10. Ni, Ji & Chen, Bowei & Allinson, Nigel M. & Ye, Xujiong, 2020. "A hybrid model for predicting human physical activity status from lifelogging data," European Journal of Operational Research, Elsevier, vol. 281(3), pages 532-542.
    11. Basna Mohammed Salih Hasan & Nawzat Sadiq Ahmed, 2021. "Feature selection technique applied in Medical application by Supervised algorithm: A Review," International Journal of Science and Business, IJSAB International, vol. 5(3), pages 190-203.
    12. You-Shyang Chen & Ying-Hsun Hung & Yu-Sheng Lin, 2023. "A Study to Identify Long-Term Care Insurance Using Advanced Intelligent RST Hybrid Models with Two-Stage Performance Evaluation," Mathematics, MDPI, vol. 11(13), pages 1-34, July.
    13. Davila-Pena, Laura & García-Jurado, Ignacio & Casas-Méndez, Balbina, 2022. "Assessment of the influence of features on a classification problem: An application to COVID-19 patients," European Journal of Operational Research, Elsevier, vol. 299(2), pages 631-641.
    14. Daehan Won & Hasan Manzour & Wanpracha Chaovalitwongse, 2020. "Convex Optimization for Group Feature Selection in Networked Data," INFORMS Journal on Computing, INFORMS, vol. 32(1), pages 182-198, January.
    15. Pi, J. & Wang, Honggang & Pardalos, Panos M., 2021. "A dual reformulation and solution framework for regularized convex clustering problems," European Journal of Operational Research, Elsevier, vol. 290(3), pages 844-856.
    16. He Jiang, 2023. "Robust forecasting in spatial autoregressive model with total variation regularization," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 195-211, March.
    17. Jimenez-Marquez, Jose Luis & Gonzalez-Carrasco, Israel & Lopez-Cuadrado, Jose Luis & Ruiz-Mezcua, Belen, 2019. "Towards a big data framework for analyzing social media content," International Journal of Information Management, Elsevier, vol. 44(C), pages 1-12.
    18. Emilio Carrizosa & Cristina Molero-Río & Dolores Romero Morales, 2021. "Mathematical optimization in classification and regression trees," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 5-33, April.
    19. Víctor Blanco & Alberto Japón & Justo Puerto, 2020. "Optimal arrangements of hyperplanes for SVM-based multiclass classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(1), pages 175-199, March.
    20. He Jiang & Weihua Zheng, 2022. "Deep learning with regularized robust long‐ and short‐term memory network for probabilistic short‐term load forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1201-1216, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:293:y:2021:i:1:p:24-35. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.