IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v230y2013i3p581-595.html
   My bibliography  Save this article

A memetic approach to construct transductive discrete support vector machines

Author

Listed:
  • Brandner, Hubertus
  • Lessmann, Stefan
  • Voß, Stefan

Abstract

Transductive learning involves the construction and application of prediction models to classify a fixed set of decision objects into discrete groups. It is a special case of classification analysis with important applications in web-mining, corporate planning and other areas. This paper proposes a novel transductive classifier that is based on the philosophy of discrete support vector machines. We formalize the task to estimate the class labels of decision objects as a mixed integer program. A memetic algorithm is developed to solve the mathematical program and to construct a transductive support vector machine classifier, respectively. Empirical experiments on synthetic and real-world data evidence the effectiveness of the new approach and demonstrate that it identifies high quality solutions in short time. Furthermore, the results suggest that the class predictions following from the memetic algorithm are significantly more accurate than the predictions of a CPLEX-based reference classifier. Comparisons to other transductive and inductive classifiers provide further support for our approach and suggest that it performs competitive with respect to several benchmarks.

Suggested Citation

  • Brandner, Hubertus & Lessmann, Stefan & Voß, Stefan, 2013. "A memetic approach to construct transductive discrete support vector machines," European Journal of Operational Research, Elsevier, vol. 230(3), pages 581-595.
  • Handle: RePEc:eee:ejores:v:230:y:2013:i:3:p:581-595
    DOI: 10.1016/j.ejor.2013.05.010
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221713004098
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2013.05.010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    2. Carlotta Orsenigo & Carlo Vercellis, 2009. "Multicategory classification via discrete support vector machines," Computational Management Science, Springer, vol. 6(1), pages 101-114, February.
    3. William Gehrlein & Bret Wagner, 1997. "A two-stage least cost credit scoring model," Annals of Operations Research, Springer, vol. 74(0), pages 159-171, November.
    4. P. S. Bradley & Usama M. Fayyad & O. L. Mangasarian, 1999. "Mathematical Programming for Data Mining: Formulations and Challenges," INFORMS Journal on Computing, INFORMS, vol. 11(3), pages 217-238, August.
    5. Koehler, Gary J., 1991. "Improper linear discriminant classifiers," European Journal of Operational Research, Elsevier, vol. 50(2), pages 188-198, January.
    6. Paul Rubin, 1997. "Solving mixed integer classification problems by decomposition," Annals of Operations Research, Springer, vol. 74(0), pages 51-64, November.
    7. Jiayin Qi & Li Zhang & Yanping Liu & Ling Li & Yongpin Zhou & Yao Shen & Liang Liang & Huaizu Li, 2009. "ADTreesLogit model for customer churn prediction," Annals of Operations Research, Springer, vol. 168(1), pages 247-265, April.
    8. Lessmann, Stefan & Voß, Stefan, 2009. "A reference model for customer-centric data mining with support vector machines," European Journal of Operational Research, Elsevier, vol. 199(2), pages 520-530, December.
    9. Gestel, Tony Van & Baesens, Bart & Suykens, Johan A.K. & Van den Poel, Dirk & Baestaens, Dirk-Emma & Willekens, Marleen, 2006. "Bayesian kernel based classification for financial distress detection," European Journal of Operational Research, Elsevier, vol. 172(3), pages 979-1003, August.
    10. Bot, Radu Ioan & Lorenz, Nicole, 2011. "Optimization problems in statistical learning: Duality and optimality conditions," European Journal of Operational Research, Elsevier, vol. 213(2), pages 395-404, September.
    11. P. S. Bradley & O. L. Mangasarian & W. N. Street, 1998. "Feature Selection via Mathematical Programming," INFORMS Journal on Computing, INFORMS, vol. 10(2), pages 209-217, May.
    12. Sexton, Randall S. & McMurtrey, Shannon & Cleavenger, Dean, 2006. "Knowledge discovery using a neural network simultaneous optimization algorithm on a real world classification problem," European Journal of Operational Research, Elsevier, vol. 168(3), pages 1009-1018, February.
    13. Xue Bai & Rema Padman & Joseph Ramsey & Peter Spirtes, 2008. "Tabu Search-Enhanced Graphical Models for Classification in High Dimensions," INFORMS Journal on Computing, INFORMS, vol. 20(3), pages 423-437, August.
    14. O. L. Mangasarian, 1965. "Linear and Nonlinear Separation of Patterns by Linear Programming," Operations Research, INFORMS, vol. 13(3), pages 444-452, June.
    15. West, David & Mangiameli, Paul & Rampal, Rohit & West, Vivian, 2005. "Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application," European Journal of Operational Research, Elsevier, vol. 162(2), pages 532-551, April.
    16. Crone, Sven F. & Lessmann, Stefan & Stahlbock, Robert, 2006. "The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing," European Journal of Operational Research, Elsevier, vol. 173(3), pages 781-800, September.
    17. Frank Plastria & Steven De Bruyne & Emilio Carrizosa, 2010. "Alternating local search based VNS for linear classification," Annals of Operations Research, Springer, vol. 174(1), pages 121-134, February.
    18. Olvi L. Mangasarian & W. Nick Street & William H. Wolberg, 1995. "Breast Cancer Diagnosis and Prognosis Via Linear Programming," Operations Research, INFORMS, vol. 43(4), pages 570-577, August.
    19. Baesens, Bart & Viaene, Stijn & Van den Poel, Dirk & Vanthienen, Jan & Dedene, Guido, 2002. "Bayesian neural network learning for repeat purchase modelling in direct marketing," European Journal of Operational Research, Elsevier, vol. 138(1), pages 191-211, April.
    20. Dimitris Bertsimas & Romy Shioda, 2007. "Classification and Regression via Integer Optimization," Operations Research, INFORMS, vol. 55(2), pages 252-271, April.
    21. Talla Nobibon, Fabrice & Leus, Roel & Spieksma, Frits C.R., 2011. "Optimization models for targeted offers in direct marketing: Exact and heuristic algorithms," European Journal of Operational Research, Elsevier, vol. 210(3), pages 670-683, May.
    22. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    23. Orsenigo, Carlotta & Vercellis, Carlo, 2004. "Discrete support vector decision trees via tabu search," Computational Statistics & Data Analysis, Elsevier, vol. 47(2), pages 311-322, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jann Michael Weinand & Kenneth Sorensen & Pablo San Segundo & Max Kleinebrahm & Russell McKenna, 2020. "Research trends in combinatorial optimisation," Papers 2012.01294, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pedro Duarte Silva, A., 2017. "Optimization approaches to Supervised Classification," European Journal of Operational Research, Elsevier, vol. 261(2), pages 772-788.
    2. Lessmann, Stefan & Voß, Stefan, 2009. "A reference model for customer-centric data mining with support vector machines," European Journal of Operational Research, Elsevier, vol. 199(2), pages 520-530, December.
    3. Chen, Zhen-Yu & Fan, Zhi-Ping & Sun, Minghe, 2012. "A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data," European Journal of Operational Research, Elsevier, vol. 223(2), pages 461-472.
    4. Fan, Zhi-Ping & Sun, Minghe, 2015. "Behavior-aware user response modeling in social media: Learning from diverse heterogeneous dataAuthor-Name: Chen, Zhen-Yu," European Journal of Operational Research, Elsevier, vol. 241(2), pages 422-434.
    5. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    6. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    7. B Baesens & C Mues & D Martens & J Vanthienen, 2009. "50 years of data mining and OR: upcoming trends and challenges," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(1), pages 16-23, May.
    8. Meisel, Stephan & Mattfeld, Dirk, 2010. "Synergies of Operations Research and Data Mining," European Journal of Operational Research, Elsevier, vol. 206(1), pages 1-10, October.
    9. R Fildes & K Nikolopoulos & S F Crone & A A Syntetos, 2008. "Forecasting and operational research: a review," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 59(9), pages 1150-1172, September.
    10. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    11. R. Chandrasekaran & Young U. Ryu & Varghese S. Jacob & Sungchul Hong, 2005. "Isotonic Separation," INFORMS Journal on Computing, INFORMS, vol. 17(4), pages 462-474, November.
    12. W. Nick Street, 2005. "Oblique Multicategory Decision Trees Using Nonlinear Programming," INFORMS Journal on Computing, INFORMS, vol. 17(1), pages 25-31, February.
    13. Balaji Padmanabhan & Alexander Tuzhilin, 2003. "On the Use of Optimization for Data Mining: Theoretical Interactions and eCRM Opportunities," Management Science, INFORMS, vol. 49(10), pages 1327-1343, October.
    14. P. S. Bradley & Usama M. Fayyad & O. L. Mangasarian, 1999. "Mathematical Programming for Data Mining: Formulations and Challenges," INFORMS Journal on Computing, INFORMS, vol. 11(3), pages 217-238, August.
    15. Olafsson, Sigurdur & Li, Xiaonan & Wu, Shuning, 2008. "Operations research and data mining," European Journal of Operational Research, Elsevier, vol. 187(3), pages 1429-1448, June.
    16. Tang, Leilei & Thomas, Lyn & Fletcher, Mary & Pan, Jiazhu & Marshall, Andrew, 2014. "Assessing the impact of derived behavior information on customer attrition in the financial service industry," European Journal of Operational Research, Elsevier, vol. 236(2), pages 624-633.
    17. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    18. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    19. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    20. K. Coussement & D. Van Den Poel, 2008. "Improving Customer Attrition Prediction by Integrating Emotions from Client/Company Interaction Emails and Evaluating Multiple Classifiers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/527, Ghent University, Faculty of Economics and Business Administration.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:230:y:2013:i:3:p:581-595. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.