IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0198884.html
   My bibliography  Save this article

Detecting opinion spams through supervised boosting approach

Author

Listed:
  • Mohamad Hazim
  • Nor Badrul Anuar
  • Mohd Faizal Ab Razak
  • Nor Aniza Abdullah

Abstract

Product reviews are the individual’s opinions, judgement or belief about a certain product or service provided by certain companies. Such reviews serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity or enhancing their product/service qualities. Product reviews can also increase business profits by convincing future customers about the products which they have interest in. In the mobile application marketplace such as Google Playstore, reviews and star ratings are used as indicators of the application quality. However, among all these reviews, hereby also known as opinions, spams also exist, to disrupt the online business balance. Previous studies used the time series and neural network approach (which require a lot of computational power) to detect these opinion spams. However, the detection performance can be restricted in terms of accuracy because the approach focusses on basic, discrete and document level features only thereby, projecting little statistical relationships. Aiming to improve the detection of opinion spams in mobile application marketplace, this study proposes using statistical based features that are modelled through the supervised boosting approach such as the Extreme Gradient Boost (XGBoost) and the Generalized Boosted Regression Model (GBM) to evaluate two multilingual datasets (i.e. English and Malay language). From the evaluation done, it was found that the XGBoost is most suitable for detecting opinion spams in the English dataset while the GBM Gaussian is most suitable for the Malay dataset. The comparative analysis also indicates that the implementation of the proposed statistical based features had achieved a detection accuracy rate of 87.43 per cent on the English dataset and 86.13 per cent on the Malay dataset.

Suggested Citation

  • Mohamad Hazim & Nor Badrul Anuar & Mohd Faizal Ab Razak & Nor Aniza Abdullah, 2018. "Detecting opinion spams through supervised boosting approach," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-23, June.
  • Handle: RePEc:plo:pone00:0198884
    DOI: 10.1371/journal.pone.0198884
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0198884
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0198884&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0198884?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Firdaus Afifi & Nor Badrul Anuar & Shahaboddin Shamshirband & Kim-Kwang Raymond Choo, 2016. "DyHAP: Dynamic Hybrid ANFIS-PSO Approach for Predicting Mobile Malware," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-21, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ngai, Eric W.T. & Wu, Yuanyuan, 2022. "Machine learning in marketing: A literature review, conceptual framework, and research agenda," Journal of Business Research, Elsevier, vol. 145(C), pages 35-48.
    2. Ahmad Firdaus & Mohd Faizal Ab Razak & Ali Feizollah & Ibrahim Abaker Targio Hashem & Mohamad Hazim & Nor Badrul Anuar, 2019. "The rise of “blockchain”: bibliometric analysis of blockchain study," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1289-1331, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yong Fang & Yuetian Zeng & Beibei Li & Liang Liu & Lei Zhang, 2020. "DeepDetectNet vs RLAttackNet: An adversarial method to improve deep learning-based static malware detection model," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-32, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0198884. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.