IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i3p88-d1076144.html
   My bibliography  Save this article

Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)

Author

Listed:
  • Yibrah Gebreyesus

    (School of Computer Science, University College of Dublin, D04 V1W8 Dublin, Ireland)

  • Damian Dalton

    (School of Computer Science, University College of Dublin, D04 V1W8 Dublin, Ireland)

  • Sebastian Nixon

    (School of Computer Science, Wolaita Sodo University, Wolaita P.O. Box 138, Ethiopia)

  • Davide De Chiara

    (ENEA-R.C. Portici, 80055 Portici (NA), Italy)

  • Marta Chinnici

    (ENEA-R.C. Casaccia, 00196 Rome, Italy)

Abstract

The need for artificial intelligence (AI) and machine learning (ML) models to optimize data center (DC) operations increases as the volume of operations management data upsurges tremendously. These strategies can assist operators in better understanding their DC operations and help them make informed decisions upfront to maintain service reliability and availability. The strategies include developing models that optimize energy efficiency, identifying inefficient resource utilization and scheduling policies, and predicting outages. In addition to model hyperparameter tuning, feature subset selection (FSS) is critical for identifying relevant features for effectively modeling DC operations to provide insight into the data, optimize model performance, and reduce computational expenses. Hence, this paper introduces the Shapley Additive exPlanation (SHAP) values method, a class of additive feature attribution values for identifying relevant features that is rarely discussed in the literature. We compared its effectiveness with several commonly used, importance-based feature selection methods. The methods were tested on real DC operations data streams obtained from the ENEA CRESCO6 cluster with 20,832 cores. To demonstrate the effectiveness of SHAP compared to other methods, we selected the top ten most important features from each method, retrained the predictive models, and evaluated their performance using the MAE, RMSE, and MPAE evaluation criteria. The results presented in this paper demonstrate that the predictive models trained using features selected with the SHAP-assisted method performed well, with a lower error and a reasonable execution time compared to other methods.

Suggested Citation

  • Yibrah Gebreyesus & Damian Dalton & Sebastian Nixon & Davide De Chiara & Marta Chinnici, 2023. "Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)," Future Internet, MDPI, vol. 15(3), pages 1-17, February.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:3:p:88-:d:1076144
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/3/88/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/3/88/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhen Yang & Jinhong Du & Yiting Lin & Zhen Du & Li Xia & Qianchuan Zhao & Xiaohong Guan, 2022. "Increasing the energy efficiency of a data center based on machine learning," Journal of Industrial Ecology, Yale University, vol. 26(1), pages 323-335, February.
    2. Anastasiia Grishina & Marta Chinnici & Ah-Lian Kor & Eric Rondeau & Jean-Philippe Georges, 2020. "A Machine Learning Solution for Data Center Thermal Characteristics Analysis," Energies, MDPI, vol. 13(17), pages 1-13, August.
    3. Mohamed Sameer Hoosain & Babu Sena Paul & Seeram Ramakrishna, 2020. "The Impact of 4IR Digital Technologies and Circular Thinking on the United Nations Sustainable Development Goals," Sustainability, MDPI, vol. 12(23), pages 1-16, December.
    4. Agnieszka Malkowska & Maria Urbaniec & Malgorzata Kosala, 2021. "The impact of digital transformation on European countries: insights from a comparative analysis," Equilibrium. Quarterly Journal of Economics and Economic Policy, Institute of Economic Research, vol. 16(2), pages 325-355, June.
    5. Anders S. G. Andrae & Tomas Edler, 2015. "On Global Electricity Usage of Communication Technology: Trends to 2030," Challenges, MDPI, vol. 6(1), pages 1-41, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhenxiang Cao & Liqing Peng, 2023. "The Impact of Digital Economics on Environmental Quality: A System Dynamics Approach," SAGE Open, , vol. 13(4), pages 21582440231, December.
    2. Steffen Dalsgaard, 2022. "Can IT Resolve the Climate Crisis? Sketching the Role of an Anthropology of Digital Technology," Sustainability, MDPI, vol. 14(10), pages 1-17, May.
    3. Axenbeck, Janna & Niebel, Thomas, 2021. "Climate Protection Potentials of Digitalized Production Processes: Microeconometric Evidence," 23rd ITS Biennial Conference, Online Conference / Gothenburg 2021. Digital societies and industrial transformations: Policies, markets, and technologies in a post-Covid world 238007, International Telecommunications Society (ITS).
    4. Lange, Steffen & Pohl, Johanna & Santarius, Tilman, 2020. "Digitalization and energy consumption. Does ICT reduce energy demand?," Ecological Economics, Elsevier, vol. 176(C).
    5. Damar Novtahaning & Hasnain Ali Shah & Jae-Mo Kang, 2022. "Deep Learning Ensemble-Based Automated and High-Performing Recognition of Coffee Leaf Disease," Agriculture, MDPI, vol. 12(11), pages 1-16, November.
    6. Francisco Olmo-García & Fernando Javier Crecente-Romero & María Teresa Val-Núñez & María Sarabia-Alegría, 2023. "Entrepreneurial activity in an environment of digital transformation: an analysis of relevant factors in the euro area," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-10, December.
    7. Beata Bieszk-Stolorz & Krzysztof Dmytrów, 2022. "Assessment of the Similarity of the Situation in the EU Labour Markets and Their Changes in the Face of the COVID-19 Pandemic," Sustainability, MDPI, vol. 14(6), pages 1-20, March.
    8. Paraschiv Dorel Mihai & ȚIțan Emilia & Manea Daniela Ioana & Ionescu Crina Dana & Mihai Mihaela & Șerban Octavian, 2022. "The change in e-commerce in the context of the Coronavirus pandemic," Management & Marketing, Sciendo, vol. 17(2), pages 220-233, June.
    9. Stefano Cascone, 2024. "Eco-Innovative Construction: Integrating Green Roofs Design within the BIM Framework," Sustainability, MDPI, vol. 16(5), pages 1-19, February.
    10. Robert Ferritto, 2024. "Will policy help close the digital gender divide? An intersectional feminist policy analysis of Ethiopia's national digital policy," Development Policy Review, Overseas Development Institute, vol. 42(2), March.
    11. Mariusz Czupich & Justyna Łapińska & Vojtěch Bartoš, 2022. "Environmental Sustainability Assessment of the European Union’s Capital Cities," IJERPH, MDPI, vol. 19(7), pages 1-18, April.
    12. Anders S. G. Andrae & Mengjun Xia & Jianli Zhang & Xiaoming Tang, 2016. "Practical Eco-Design and Eco-Innovation of Consumer Electronics—the Case of Mobile Phones," Challenges, MDPI, vol. 7(1), pages 1-19, February.
    13. Muhammad Fahad & Arsalan Shahid & Ravi Reddy Manumachu & Alexey Lastovetsky, 2019. "A Comparative Study of Methods for Measurement of Energy of Computing," Energies, MDPI, vol. 12(11), pages 1-42, June.
    14. Tilman Santarius & Johanna Pohl & Steffen Lange, 2020. "Digitalization and the Decoupling Debate: Can ICT Help to Reduce Environmental Impacts While the Economy Keeps Growing?," Sustainability, MDPI, vol. 12(18), pages 1-20, September.
    15. Anna Wichowska, 2021. "The degree of fiscal decentralization in European Union countries in different stages of the economic cycle," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 9(2), pages 198-208, December.
    16. John Martinovic & Markus Hähnel & Guntram Scheithauer & Waltenegus Dargie, 2022. "An introduction to stochastic bin packing-based server consolidation with conflicts," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 296-331, July.
    17. Balcerzak, Adam P. & Zinecker, Marek & Skalický, Roman & Rogalska, Elżbieta & Doubravský, Karel, 2023. "Technology-oriented start-ups and valuation: A novel approach based on specific contract terms," Technological Forecasting and Social Change, Elsevier, vol. 197(C).
    18. Anders S. G. Andrae & Mikko Samuli Vaija, 2017. "Precision of a Streamlined Life Cycle Assessment Approach Used in Eco-Rating of Mobile Phones," Challenges, MDPI, vol. 8(2), pages 1-24, August.
    19. Bilan, Yuriy & Oliinyk, Olena & Mishchuk, Halyna & Skare, Marinko, 2023. "Impact of information and communications technology on the development and use of knowledge," Technological Forecasting and Social Change, Elsevier, vol. 191(C).
    20. Salil Bharany & Sandeep Sharma & Osamah Ibrahim Khalaf & Ghaida Muttashar Abdulsahib & Abeer S. Al Humaimeedy & Theyazn H. H. Aldhyani & Mashael Maashi & Hasan Alkahtani, 2022. "A Systematic Survey on Energy-Efficient Techniques in Sustainable Cloud Computing," Sustainability, MDPI, vol. 14(10), pages 1-89, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:3:p:88-:d:1076144. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.