IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v4y2019i3p121-d256191.html
   My bibliography  Save this article

Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders

Author

Listed:
  • Matteo Bodini

    (Dipartimento di Informatica “Giovanni Degli Antoni”, Università degli Studi di Milano, Via Celoria 18, 20133 Milano, Italy)

Abstract

Interactions between online users are growing more and more in recent years, due to the latest developments of the web. People share online comments, opinions, and reviews about many topics. Aspect extraction is the automatic process of understanding the topic (the aspect) of such comments, which has obtained huge interest from commercial and academic points of view. For instance, reviews available in webshops (like eBay, Amazon, Aliexpress, etc.) can help the customers in purchasing products and automatic analysis of reviews would be useful, as sometimes it is almost impossible to read all the available ones. In recent years, aspect extraction in the Bangla language has been regarded more and more as a task of growing importance. In the previous literature, a few methods have been introduced to classify Bangla texts according to the aspect they were focused on. This kind of research is limited mainly due to the lack of publicly available datasets for aspect extraction in the Bangla language. We take into account the only two publicly available datasets, recently published, collected for the task of aspect extraction in the Bangla language. Then, we introduce several classification methods based on stacked auto-encoders, as far as we know never exploited in the task of aspect extraction in Bangla, and we achieve better aspect classification performance with respect to the state-of-the-art: the experiments show an average improvement of 0.17 , 0.31 and 0.30 (across the two datasets), respectively in precision, recall and F1-score, reported in the state-of-the-art works that tackled the problem.

Suggested Citation

  • Matteo Bodini, 2019. "Aspect Extraction from Bangla Reviews Through Stacked Auto-Encoders," Data, MDPI, vol. 4(3), pages 1-20, August.
  • Handle: RePEc:gam:jdataj:v:4:y:2019:i:3:p:121-:d:256191
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/4/3/121/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/4/3/121/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Md. Atikur Rahman & Emon Kumar Dey, 2018. "Datasets for Aspect-Based Sentiment Analysis in Bangla and Its Baseline Evaluation," Data, MDPI, vol. 3(2), pages 1-10, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:4:y:2019:i:3:p:121-:d:256191. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.