IDEAS home Printed from https://ideas.repec.org/a/spr/jcomop/v40y2020i1d10.1007_s10878-020-00578-0.html
   My bibliography  Save this article

Classification optimization for training a large dataset with Naïve Bayes

Author

Listed:
  • Thi Thanh Sang Nguyen

    (International University – Vietnam National University)

  • Pham Minh Thu Do

    (International University – Vietnam National University)

Abstract

Book classification is very popular in digital libraries. Book rating prediction is crucial to improve the care of readers. The commonly used techniques are decision tree, Naïve Bayes (NB), neural networks, etc. Moreover, mining book data depends on feature selection, data pre-processing, and data preparation. This paper proposes the solutions of knowledge representation optimization as well as feature selection to enhance book classification and point out appropriate classification algorithms. Several experiments have been conducted and it has been found that NB could provide best prediction results. The accuracy and performance of NB can be improved and outperform other classification algorithms by applying appropriate strategies of feature selections, data type selection as well as data transformation.

Suggested Citation

  • Thi Thanh Sang Nguyen & Pham Minh Thu Do, 2020. "Classification optimization for training a large dataset with Naïve Bayes," Journal of Combinatorial Optimization, Springer, vol. 40(1), pages 141-169, July.
  • Handle: RePEc:spr:jcomop:v:40:y:2020:i:1:d:10.1007_s10878-020-00578-0
    DOI: 10.1007/s10878-020-00578-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10878-020-00578-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10878-020-00578-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcomop:v:40:y:2020:i:1:d:10.1007_s10878-020-00578-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.