IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v13y2022i5d10.1007_s13198-022-01720-3.html
   My bibliography  Save this article

Machine learning algorithms for predicting smokeless tobacco status among women in Northeastern States, India

Author

Listed:
  • Kh. Jitenkumar Singh

    (ICMR)

  • A. Jiran Meitei

    (Maharaja Agrasen College, University of Delhi)

  • Nongzaimayum Tawfeeq Alee

    (Amity University Maharashtra)

  • Mosoniro Kriina

    (ICMR)

  • Nirendrakumar Singh Haobijam

    (Jawaharlal Nehru Institute of Medical Sciences)

Abstract

Use of smokeless tobacco (SLT) in women is very high and serious public health issue in the northeast states, India. Prediction on status of SLT use among women is a key to policy making and resource planning at district and community level in this region. This study aims to predict the status of smokeless tobacco use among women in northeast states of India by applying several machine learning (ML) algorithms. We used publicly available National Family Health Survey, 2015–16 data. Eight ML algorithms were used for the prediction on status of SLT use. Precision, specificity, sensitivity, accuracy, and Cohen’s kappa statistic were performed as a part of the systematic assessment of the algorithms. Result of this study reveals that the best classification performance was accomplished with random forest (RF) algorithm accuracy of 79.51% [77.65–81.37], sensitivity of 87.75% [86.55–88.95], specificity of 65.19% [65.18–65.20], precision of 81.39%, F-measure of 84.35 and Cohen’s Kappa was 0.545 [0.529–0.558]. It was concluded that the algorithm of random forest was found superior and performed much better as compared to the rest ML algorithms in predicting the status on smokeless tobacco use in women of northeast states, India. Finally, this research finding recommends application of RF algorithm for classification and feature selection to predict the status of smokeless tobacco as a core interest.

Suggested Citation

  • Kh. Jitenkumar Singh & A. Jiran Meitei & Nongzaimayum Tawfeeq Alee & Mosoniro Kriina & Nirendrakumar Singh Haobijam, 2022. "Machine learning algorithms for predicting smokeless tobacco status among women in Northeastern States, India," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(5), pages 2629-2639, October.
  • Handle: RePEc:spr:ijsaem:v:13:y:2022:i:5:d:10.1007_s13198-022-01720-3
    DOI: 10.1007/s13198-022-01720-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-022-01720-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-022-01720-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. repec:cdl:ctcres:qt1g16k8b9 is not listed on IDEAS
    2. Tobias Cagala, 2017. "Improving data quality and closing data gaps with machine learning," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Data needs and Statistics compilation for macroprudential analysis, volume 46, Bank for International Settlements.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Davide Nicola Continanza & Andrea del Monaco & Marco di Lucido & Daniele Figoli & Pasquale Maddaloni & Filippo Quarta & Giuseppe Turturiello, 2023. "Stacking machine learning models for anomaly detection: comparing AnaCredit to other banking data sets," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Data science in central banking: applications and tools, volume 59, Bank for International Settlements.
    2. Fabio Zambuto & Simona Arcuti & Roberto Sabatini & Daniele Zambuto, 2021. "Application of classification algorithms for the assessment of confirmation to quality remarks," Questioni di Economia e Finanza (Occasional Papers) 631, Bank of Italy, Economic Research and International Relations Area.
    3. José María Serena Garralda & Bruno Tissot, 2018. "Central banks and trade repositories derivatives data," IFC Reports 7, Bank for International Settlements.
    4. Teng, Sin Yong & Touš, Michal & Leong, Wei Dong & How, Bing Shen & Lam, Hon Loong & Máša, Vítězslav, 2021. "Recent advances on industrial data-driven energy savings: Digital twins and infrastructures," Renewable and Sustainable Energy Reviews, Elsevier, vol. 135(C).
    5. Fabio Zambuto, 2021. "Quality checks on granular banking data: an experimental approach based on machine learning," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Micro data for the macro world, volume 53, Bank for International Settlements.
    6. Okiriza Wibisono & Hidayah Dhini Ari & Anggraini Widjanarti & Alvin Andhika Zulen & Bruno Tissot, 2019. "The use of big data analytics and artificial intelligence in central banking – An overview," IFC Bulletins chapters, in: Bank for International Settlements (ed.), The use of big data analytics and artificial intelligence in central banking, volume 50, Bank for International Settlements.
    7. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2022. "Learning from revisions: an algorithm to detect errors in banks’ balance sheet statistical reporting," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4025-4059, December.
    8. Claudia Biancotti & Alfonso Rosolia & Giovanni Veronese & Robert Kirchner & Francois Mouriaux, 2021. "Covid-19 and official statistics: a wakeup call?," Questioni di Economia e Finanza (Occasional Papers) 605, Bank of Italy, Economic Research and International Relations Area.
    9. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2021. "Learning from revisions: a tool for detecting potential errors in banks' balance sheet statistical reporting," Questioni di Economia e Finanza (Occasional Papers) 611, Bank of Italy, Economic Research and International Relations Area.
    10. Ezgi Deryol & Duygu Konukçu & Robert Szemere & Bruno Tissot, 2019. "Mind the data gap: commercial property prices for policy," IFC Reports 8, Bank for International Settlements.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:13:y:2022:i:5:d:10.1007_s13198-022-01720-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.