IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v317y2024i2p303-316.html
   My bibliography  Save this article

Interpretable generalized additive neural networks

Author

Listed:
  • Kraus, Mathias
  • Tschernutter, Daniel
  • Weinzierl, Sven
  • Zschech, Patrick

Abstract

We propose Interpretable Generalized Additive Neural Networks (IGANN), a novel machine learning model that uses gradient boosting and tailored neural networks to obtain high predictive performance while being interpretable to humans. We derive an efficient training algorithm based on the theory of extreme learning machines, that allows reducing the training process to solving a sequence of regularized linear regressions. We analyze the algorithm theoretically, provide insights into the rate of change of so-called shape functions, and show that the computational complexity of the training process scales linearly with the number of samples in the training dataset. We implement IGANN in PyTorch, which allows the model to be trained on graphics processing units (GPUs) to speed up training. We demonstrate favorable results in a variety of numerical experiments and showcase IGANN’s value in three real-world case studies for productivity prediction, credit scoring, and criminal recidivism prediction.

Suggested Citation

  • Kraus, Mathias & Tschernutter, Daniel & Weinzierl, Sven & Zschech, Patrick, 2024. "Interpretable generalized additive neural networks," European Journal of Operational Research, Elsevier, vol. 317(2), pages 303-316.
  • Handle: RePEc:eee:ejores:v:317:y:2024:i:2:p:303-316
    DOI: 10.1016/j.ejor.2023.06.032
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221723005027
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2023.06.032?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Coussement, Kristof & Benoit, Dries Frederik & Van den Poel, Dirk, 2009. "Improved Marketing Decision Making in a Customer Churn Prediction Context Using Generalized Additive Models," Working Papers 2009/18, Hogeschool-Universiteit Brussel, Faculteit Economie en Management.
    2. Kraus, Mathias & Feuerriegel, Stefan & Oztekin, Asil, 2020. "Deep learning in business analytics and operations research: Models, applications and managerial implications," European Journal of Operational Research, Elsevier, vol. 281(3), pages 628-641.
    3. Carbonneau, Real & Laframboise, Kevin & Vahidov, Rustam, 2008. "Application of machine learning techniques for supply chain demand forecasting," European Journal of Operational Research, Elsevier, vol. 184(3), pages 1140-1154, February.
    4. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    5. De Bock, Koen W. & Coussement, Kristof & Van den Poel, Dirk, 2010. "Ensemble classification based on generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1535-1546, June.
    6. Christian Janiesch & Patrick Zschech & Kai Heinrich, 2021. "Machine learning and deep learning," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(3), pages 685-695, September.
    7. Dragos Florin Ciocan & Velibor V. Mišić, 2022. "Interpretable Optimal Stopping," Management Science, INFORMS, vol. 68(3), pages 1616-1638, March.
    8. Philipp Borchert & Kristof Coussement & Arno de Caigny & Jochen de Weerdt, 2023. "Extending business failure prediction models with textual website content using deep learning," Post-Print hal-03976762, HAL.
    9. Julian Senoner & Torbjørn Netland & Stefan Feuerriegel, 2022. "Using Explainable Artificial Intelligence to Improve Process Quality: Evidence from Semiconductor Manufacturing," Management Science, INFORMS, vol. 68(8), pages 5704-5723, August.
    10. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    11. Arno de Caigny & Kristof Coussement & Koen W. de Bock, 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," Post-Print hal-01741661, HAL.
    12. Bastos, João A. & Matos, Sara M., 2022. "Explainable models of credit losses," European Journal of Operational Research, Elsevier, vol. 301(1), pages 386-394.
    13. Mitrović, Sandra & Baesens, Bart & Lemahieu, Wilfried & De Weerdt, Jochen, 2018. "On the operational efficiency of different feature types for telco Churn prediction," European Journal of Operational Research, Elsevier, vol. 267(3), pages 1141-1155.
    14. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    2. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    3. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    4. Thuy, Arthur & Benoit, Dries F., 2024. "Explainability through uncertainty: Trustworthy decision-making with neural networks," European Journal of Operational Research, Elsevier, vol. 317(2), pages 330-340.
    5. Sobrie, Léon & Verschelde, Marijn & Roets, Bart, 2024. "Explainable real-time predictive analytics on employee workload in digital railway control rooms," European Journal of Operational Research, Elsevier, vol. 317(2), pages 437-448.
    6. Louis Geiler & Séverine Affeldt & Mohamed Nadif, 2022. "A survey on machine learning methods for churn prediction," Post-Print hal-03824873, HAL.
    7. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    8. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    9. Borchert, Philipp & Coussement, Kristof & De Caigny, Arno & De Weerdt, Jochen, 2023. "Extending business failure prediction models with textual website content using deep learning," European Journal of Operational Research, Elsevier, vol. 306(1), pages 348-357.
    10. Schaeffer, Satu Elisa & Rodriguez Sanchez, Sara Veronica, 2020. "Forecasting client retention — A machine-learning approach," Journal of Retailing and Consumer Services, Elsevier, vol. 52(C).
    11. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    12. Stevens, Alexander & De Smedt, Johannes, 2024. "Explainability in process outcome prediction: Guidelines to obtain interpretable and faithful models," European Journal of Operational Research, Elsevier, vol. 317(2), pages 317-329.
    13. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    14. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    15. Youngkeun Choi & Jae W. Choi, 2023. "Assessing the Predictive Performance of Machine Learning in Direct Marketing Response," International Journal of E-Business Research (IJEBR), IGI Global, vol. 19(1), pages 1-12, January.
    16. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    17. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    18. Chen, Yan & Zhang, Lei & Zhao, Yulu & Xu, Bing, 2022. "Implementation of penalized survival models in churn prediction of vehicle insurance," Journal of Business Research, Elsevier, vol. 153(C), pages 162-171.
    19. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    20. Bram Janssens & Matthias Bogaert & Astrid Bagué & Dirk Van den Poel, 2024. "B2Boost: instance-dependent profit-driven modelling of B2B churn," Annals of Operations Research, Springer, vol. 341(1), pages 267-293, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:317:y:2024:i:2:p:303-316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.