Detecting opinion spams through supervised boosting approach

My bibliography Save this article

Detecting opinion spams through supervised boosting approach

Author

Listed:

Mohamad Hazim
Nor Badrul Anuar
Mohd Faizal Ab Razak
Nor Aniza Abdullah

Registered:

Abstract

Product reviews are the individual’s opinions, judgement or belief about a certain product or service provided by certain companies. Such reviews serve as guides for these companies to plan and monitor their business ventures in terms of increasing productivity or enhancing their product/service qualities. Product reviews can also increase business profits by convincing future customers about the products which they have interest in. In the mobile application marketplace such as Google Playstore, reviews and star ratings are used as indicators of the application quality. However, among all these reviews, hereby also known as opinions, spams also exist, to disrupt the online business balance. Previous studies used the time series and neural network approach (which require a lot of computational power) to detect these opinion spams. However, the detection performance can be restricted in terms of accuracy because the approach focusses on basic, discrete and document level features only thereby, projecting little statistical relationships. Aiming to improve the detection of opinion spams in mobile application marketplace, this study proposes using statistical based features that are modelled through the supervised boosting approach such as the Extreme Gradient Boost (XGBoost) and the Generalized Boosted Regression Model (GBM) to evaluate two multilingual datasets (i.e. English and Malay language). From the evaluation done, it was found that the XGBoost is most suitable for detecting opinion spams in the English dataset while the GBM Gaussian is most suitable for the Malay dataset. The comparative analysis also indicates that the implementation of the proposed statistical based features had achieved a detection accuracy rate of 87.43 per cent on the English dataset and 86.13 per cent on the Malay dataset.

Suggested Citation

Mohamad Hazim & Nor Badrul Anuar & Mohd Faizal Ab Razak & Nor Aniza Abdullah, 2018. "Detecting opinion spams through supervised boosting approach," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-23, June.

Handle: RePEc:plo:pone00:0198884
DOI: 10.1371/journal.pone.0198884

Download full text from publisher

References listed on IDEAS

Firdaus Afifi & Nor Badrul Anuar & Shahaboddin Shamshirband & Kim-Kwang Raymond Choo, 2016. "DyHAP: Dynamic Hybrid ANFIS-PSO Approach for Predicting Mobile Malware," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-21, September.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Amna Iqbal & Muhammad Younas & Muhammad Kashif Hanif & Muhammad Murad & Rabia Saleem & Muhammad Aater Javed, 2025. "An intelligent spam detection framework using fusion of spammer behavior and linguistic," PLOS ONE, Public Library of Science, vol. 20(2), pages 1-29, February.
Ajay Kumar & Ram D. Gopal & Ravi Shankar & Kim Hua Tan, 2022. "Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering," Post-Print hal-03630420, HAL.
Ngai, Eric W.T. & Wu, Yuanyuan, 2022. "Machine learning in marketing: A literature review, conceptual framework, and research agenda," Journal of Business Research, Elsevier, vol. 145(C), pages 35-48.
Ahmad Firdaus & Mohd Faizal Ab Razak & Ali Feizollah & Ibrahim Abaker Targio Hashem & Mohamad Hazim & Nor Badrul Anuar, 2019. "The rise of “blockchain”: bibliometric analysis of blockchain study," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1289-1331, September.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yong Fang & Yuetian Zeng & Beibei Li & Liang Liu & Lei Zhang, 2020. "DeepDetectNet vs RLAttackNet: An adversarial method to improve deep learning-based static malware detection model," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-32, April.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0198884. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Detecting opinion spams through supervised boosting approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data