IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0269685.html
   My bibliography  Save this article

Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting

Author

Listed:
  • Rajan Vohra
  • Abir Hussain
  • Anil Kumar Dudyala
  • Jankisharan Pahareeya
  • Wasiq Khan

Abstract

Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem.

Suggested Citation

  • Rajan Vohra & Abir Hussain & Anil Kumar Dudyala & Jankisharan Pahareeya & Wasiq Khan, 2022. "Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting," PLOS ONE, Public Library of Science, vol. 17(7), pages 1-18, July.
  • Handle: RePEc:plo:pone00:0269685
    DOI: 10.1371/journal.pone.0269685
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0269685
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0269685&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0269685?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ben Taieb, Souhaib & Hyndman, Rob J., 2014. "A gradient boosting approach to the Kaggle load forecasting competition," International Journal of Forecasting, Elsevier, vol. 30(2), pages 382-394.
    2. Dudyala Anil Kumar & V. Ravi, 2008. "Predicting credit card customer churn in banks using data mining," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 1(1), pages 4-28.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fan, Cheng & Sun, Yongjun & Zhao, Yang & Song, Mengjie & Wang, Jiayuan, 2019. "Deep learning-based feature engineering methods for improved building energy prediction," Applied Energy, Elsevier, vol. 240(C), pages 35-45.
    2. Prpić, John & Shukla, Prashant P. & Kietzmann, Jan H. & McCarthy, Ian P., 2015. "How to work a crowd: Developing crowd capital through crowdsourcing," Business Horizons, Elsevier, vol. 58(1), pages 77-85.
    3. Zhang, Ning & Li, Zhiying & Zou, Xun & Quiring, Steven M., 2019. "Comparison of three short-term load forecast models in Southern California," Energy, Elsevier, vol. 189(C).
    4. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    5. Arno de Caigny & Kristof Coussement & Koen W. de Bock & Stefan Lessmann, 2019. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," Post-Print hal-02275958, HAL.
    6. Ahmad, Tanveer & Chen, Huanxin, 2018. "Potential of three variant machine-learning models for forecasting district level medium-term and long-term energy demand in smart grid environment," Energy, Elsevier, vol. 160(C), pages 1008-1020.
    7. Verstraete, Gylian & Aghezzaf, El-Houssaine & Desmet, Bram, 2019. "A data-driven framework for predicting weather impact on high-volume low-margin retail products," Journal of Retailing and Consumer Services, Elsevier, vol. 48(C), pages 169-177.
    8. Ivana Kiprijanovska & Simon Stankoski & Igor Ilievski & Slobodan Jovanovski & Matjaž Gams & Hristijan Gjoreski, 2020. "HousEEC: Day-Ahead Household Electrical Energy Consumption Forecasting Using Deep Learning," Energies, MDPI, vol. 13(10), pages 1-29, May.
    9. Roach, Cameron, 2019. "Reconciled boosted models for GEFCom2017 hierarchical probabilistic load forecasting," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1439-1450.
    10. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    11. Döpke, Jörg & Fritsche, Ulrich & Pierdzioch, Christian, 2017. "Predicting recessions with boosted regression trees," International Journal of Forecasting, Elsevier, vol. 33(4), pages 745-759.
    12. Prpić, John, 2017. "A Framework for Policy Crowdsourcing," SocArXiv pmfdx, Center for Open Science.
    13. Theresa Maria Rausch & Tobias Albrecht & Daniel Baier, 2022. "Beyond the beaten paths of forecasting call center arrivals: on the use of dynamic harmonic regression with predictor variables," Journal of Business Economics, Springer, vol. 92(4), pages 675-706, May.
    14. Smirnov, Dmitry & Huchzermeier, Arnd, 2020. "Analytics for labor planning in systems with load-dependent service times," European Journal of Operational Research, Elsevier, vol. 287(2), pages 668-681.
    15. Makridakis, Spyros & Hyndman, Rob J. & Petropoulos, Fotios, 2020. "Forecasting in social settings: The state of the art," International Journal of Forecasting, Elsevier, vol. 36(1), pages 15-28.
    16. Seungwook Kim & Daeyoung Choi & Eunjung Lee & Wonjong Rhee, 2017. "Churn prediction of mobile and online casual games using play log data," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-19, July.
    17. Prpić, John, 2017. "The Fundamentals of Policy Crowdsourcing," SocArXiv wdtvh, Center for Open Science.
    18. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    19. Maher Selim & Ryan Zhou & Wenying Feng & Peter Quinsey, 2021. "Estimating Energy Forecasting Uncertainty for Reliable AI Autonomous Smart Grid Design," Energies, MDPI, vol. 14(1), pages 1-15, January.
    20. Naudé, Wim & Bray, Amy & Lee, Celina, 2021. "Crowdsourcing Artificial Intelligence in Africa: Findings from a Machine Learning Contest," IZA Discussion Papers 14545, Institute of Labor Economics (IZA).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0269685. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.