IDEAS home Printed from https://ideas.repec.org/a/epw/ejmath/v3y2022i3id14119.html

Keyword Extraction – Comparison of Latent Dirichlet Allocation and Latent Semantic Analysis

Author

Listed:
  • Bhuvaneshwari Kondeti

    (Osmania University, India)

  • Jyothirani S. A

    (Osmania University, India)

  • Haragopal V. V

    (BITS-Pilani, India)

Abstract

The main aim of the present study is to compare the keywords extracted from abstracts and full length text of scientific research papers. In addition to that, here, we compare Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) to identify better performer for keyword extraction. This comparative study is divided into three levels, In the first level, scientific research articles on topics such as Indian Economic growth, GDP, Economic Slowdown etc. were collected and abstracts and full length text was extracted from the sources and pre-processed to remove the words and characters which were not useful to obtain the semantic structures or necessary patterns to make the meaningful corpus. In the second level, the pre-processed data were converted into a bag of words and numerical statistic TF-IDF (Term Frequency – Inverse Document Frequency) is used to assess how relevant a word is to a document in a corpus. In the third level, in order to study the feasibility of the Natural Language Processing (NLP) techniques, Latent Semantic analysis (LSA) and Latent Dirichlet Allocations (LDA) methods were applied over the resultant corpus.

Suggested Citation

Handle: RePEc:epw:ejmath:v:3:y:2022:i:3:id:14119
DOI: 10.24018/ejmath.2022.3.3.119
as

Download full text from publisher

File URL: https://eu-opensci.org/index.php/ejmath/article/view/14119
File Function: Abstract page
Download Restriction: no

File URL: https://eu-opensci.org/index.php/ejmath/article/download/14119/3171
File Function: Full text
Download Restriction: no

File URL: https://libkey.io/10.24018/ejmath.2022.3.3.119?utm_source=ideas
LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
---><---

More about this item

Keywords

;
;
;
;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:epw:ejmath:v:3:y:2022:i:3:id:14119. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Support Team (email available below). General contact details of provider: https://eu-opensci.org/index.php/ejmath .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.