IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v325y2023i2d10.1007_s10479-022-04599-2.html
   My bibliography  Save this article

Data-driven decision model based on local two-stage weighted ensemble learning

Author

Listed:
  • Che Xu

    (Hefei University of Technology
    Key Laboratory of Process Optimization and Intelligent Decision-Making, Ministry of Education
    Ministry of Education Engineering Research Center for Intelligent Decision-Making and Information System Technologies)

  • Wenjun Chang

    (Hefei University of Technology
    Key Laboratory of Process Optimization and Intelligent Decision-Making, Ministry of Education
    Ministry of Education Engineering Research Center for Intelligent Decision-Making and Information System Technologies)

  • Weiyong Liu

    (University of Science and Technology of China)

Abstract

To improve the decision performance using historical decision data, this paper proposes a data-driven decision model based on local two-stage weighted ensemble learning. The assessments of historical alternatives are collected from a multicriteria framework. For each new alternative, a set of its similar alternatives is determined from historical alternatives using the K-Nearest Neighbor technique, and then a set of base classifiers (BCs) is generated by the historical assessments. Based on ensemble error and diversity of BCs in predicting the similar historical alternatives of the new alternative, a local two-stage weighted ensemble method is developed to learn the optimal BC weights for the new alternative. Such a learning process not only considers the changes of BCs’ competence in facing different alternatives (instances) but also avoids falling into the dilemma of balancing the accuracy and diversity of BCs. By combining the continuous outputs of different BCs with the learned BC weights, the weighted ensemble outputs are obtained for the similar historical alternatives of the new alternative. Based on these outputs and the assessments of those similar historical alternatives on criteria, a linear optimization model is constructed to learn criterion weights. Using the learned criterion weights, the interpretable decision is performed. The advantages of the proposed decision model against four traditional decision models are validated by a real case study for the diagnosis of thyroid nodules. Thirty real datasets examine the competence of the proposed weighted ensemble method against mainstream ensemble methods and combination rules.

Suggested Citation

  • Che Xu & Wenjun Chang & Weiyong Liu, 2023. "Data-driven decision model based on local two-stage weighted ensemble learning," Annals of Operations Research, Springer, vol. 325(2), pages 995-1028, June.
  • Handle: RePEc:spr:annopr:v:325:y:2023:i:2:d:10.1007_s10479-022-04599-2
    DOI: 10.1007/s10479-022-04599-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-022-04599-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-022-04599-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Talayeh Razzaghi & Ilya Safro & Joseph Ewing & Ehsan Sadrfaridpour & John D. Scott, 2019. "Predictive models for bariatric surgery risks with imbalanced medical datasets," Annals of Operations Research, Springer, vol. 280(1), pages 1-18, September.
    2. Wang, Haifeng & Zheng, Bichen & Yoon, Sang Won & Ko, Hoo Sang, 2018. "A support vector machine-based ensemble algorithm for breast cancer diagnosis," European Journal of Operational Research, Elsevier, vol. 267(2), pages 687-699.
    3. Guo, Mengzhuo & Zhang, Qingpeng & Liao, Xiuwu & Chen, Frank Youhua & Zeng, Daniel Dajun, 2021. "A hybrid machine learning framework for analyzing human decision-making through learning preferences," Omega, Elsevier, vol. 101(C).
    4. Chao Fu & Weiyong Liu & Wenjun Chang, 2020. "Data-driven multiple criteria decision making for diagnosis of thyroid cancer," Annals of Operations Research, Springer, vol. 293(2), pages 833-862, October.
    5. César Alfaro & Javier Cano-Montero & Javier Gómez & Javier M. Moguerza & Felipe Ortega, 2016. "A multi-stage method for content classification and opinion mining on weblog comments," Annals of Operations Research, Springer, vol. 236(1), pages 197-213, January.
    6. L. Fernandes & A. Fischer & J. Júdice & C. Requejo & J. Soares, 1998. "A block active set algorithm for large-scalequadratic programming with box constraints," Annals of Operations Research, Springer, vol. 81(0), pages 75-96, June.
    7. Guo, Mengzhuo & Liao, Xiuwu & Liu, Jiapeng & Zhang, Qingpeng, 2020. "Consumer preference analysis: A data-driven multiple criteria approach integrating online information," Omega, Elsevier, vol. 96(C).
    8. Onur Şeref & Talayeh Razzaghi & Petros Xanthopoulos, 2017. "Weighted relaxed support vector machines," Annals of Operations Research, Springer, vol. 249(1), pages 235-271, February.
    9. Jiamin Wang, 2012. "Robust optimization analysis for multiple attribute decision making problems with imprecise information," Annals of Operations Research, Springer, vol. 197(1), pages 109-122, August.
    10. Ling Tang & Shuai Wang & Kaijian He & Shouyang Wang, 2015. "A novel mode-characteristic-based decomposition ensemble model for nuclear energy consumption forecasting," Annals of Operations Research, Springer, vol. 234(1), pages 111-132, November.
    11. César Alfaro & Javier Cano-Montero & Javier Gómez & Javier Moguerza & Felipe Ortega, 2016. "A multi-stage method for content classification and opinion mining on weblog comments," Annals of Operations Research, Springer, vol. 236(1), pages 197-213, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Han & Gupta, Ashish & Zhang, Jie & Flor, Nick, 2020. "Who will use augmented reality? An integrated approach based on text analytics and field survey," European Journal of Operational Research, Elsevier, vol. 281(3), pages 502-516.
    2. Hui Yuan & Wei Xu & Qian Li & Raymond Lau, 2018. "Topic sentiment mining for sales performance prediction in e-commerce," Annals of Operations Research, Springer, vol. 270(1), pages 553-576, November.
    3. Justin Longo & Alan Rodney Dobell, 2018. "The Limits of Policy Analytics: Early Examples and the Emerging Boundary of Possibilities," Politics and Governance, Cogitatio Press, vol. 6(4), pages 5-17.
    4. Matteo Cinelli & Valerio Ficcadenti & Jessica Riccioni, 2020. "The interconnectedness of the economic content in the speeches of the US Presidents," Papers 2002.07880, arXiv.org.
    5. Chao Fu & Dongyue Wang & Wenjun Chang, 2023. "Data-driven analysis of influence between radiologists for diagnosis of breast lesions," Annals of Operations Research, Springer, vol. 328(1), pages 419-449, September.
    6. Matteo Cinelli & Valerio Ficcadenti & Jessica Riccioni, 2021. "The interconnectedness of the economic content in the speeches of the US Presidents," Annals of Operations Research, Springer, vol. 299(1), pages 593-615, April.
    7. Long Mai & Bac Le, 2021. "Joint sentence and aspect-level sentiment analysis of product comments," Annals of Operations Research, Springer, vol. 300(2), pages 493-513, May.
    8. Horst Treiblmaier & Patrick Mair, 2021. "Textual Data Science for Logistics and Supply Chain Management," Logistics, MDPI, vol. 5(3), pages 1-15, August.
    9. Tong, Huagang & Zhu, Jianjun, 2023. "A parallel approach with the strategy-proof mechanism for large-scale group decision making: An application in industrial internet," European Journal of Operational Research, Elsevier, vol. 311(1), pages 173-195.
    10. Hakim Akeb & Aldo Lévy & Mohamed Rdali, 2022. "A quantitative method for opinion ratings and analysis: an event study," Annals of Operations Research, Springer, vol. 313(2), pages 625-638, June.
    11. Manrui Jiang & Lifen Jia & Zhensong Chen & Wei Chen, 2022. "The two-stage machine learning ensemble models for stock price prediction by combining mode decomposition, extreme learning machine and improved harmony search algorithm," Annals of Operations Research, Springer, vol. 309(2), pages 553-585, February.
    12. Wu, Xingli & Liao, Huchang, 2021. "Modeling personalized cognition of customers in online shopping," Omega, Elsevier, vol. 104(C).
    13. Abdur Rasool & Chayut Bunterngchit & Luo Tiejian & Md. Ruhul Islam & Qiang Qu & Qingshan Jiang, 2022. "Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis," IJERPH, MDPI, vol. 19(6), pages 1-19, March.
    14. Meimei Xia & Jian Chen & Juliang Zhang, 2015. "Multi-criteria decision making based on relative measures," Annals of Operations Research, Springer, vol. 229(1), pages 791-811, June.
    15. Huang, Yumeng & Dai, Xingyu & Wang, Qunwei & Zhou, Dequn, 2021. "A hybrid model for carbon price forecastingusing GARCH and long short-term memory network," Applied Energy, Elsevier, vol. 285(C).
    16. Joanna Błajda & Edyta Barnaś & Anna Kucab, 2022. "Application of Personalized Education in the Mobile Medical App for Breast Self-Examination," IJERPH, MDPI, vol. 19(8), pages 1-21, April.
    17. Astorino, Annabella & Avolio, Matteo & Fuduli, Antonio, 2022. "A maximum-margin multisphere approach for binary Multiple Instance Learning," European Journal of Operational Research, Elsevier, vol. 299(2), pages 642-652.
    18. Meshwa Rameshbhai Savalia & Jaiprakash Vinodkumar Verma, 2023. "Classifying Malignant and Benign Tumors of Breast Cancer: A Comparative Investigation Using Machine Learning Techniques," International Journal of Reliable and Quality E-Healthcare (IJRQEH), IGI Global, vol. 12(1), pages 1-19, January.
    19. Baldomero-Naranjo, Marta & Martínez-Merino, Luisa I. & Rodríguez-Chía, Antonio M., 2020. "Tightening big Ms in integer programming formulations for support vector machines with ramp loss," European Journal of Operational Research, Elsevier, vol. 286(1), pages 84-100.
    20. Zhao, Meng & Li, Bin & Ren, Jiali & Hao, Zhihua, 2023. "Competition equilibrium of ride-sourcing platforms and optimal government subsidies considering customers’ green preference under peak carbon dioxide emissions," International Journal of Production Economics, Elsevier, vol. 255(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:325:y:2023:i:2:d:10.1007_s10479-022-04599-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.