IDEAS home Printed from https://ideas.repec.org/a/igg/jban00/v3y2016i4p64-82.html
   My bibliography  Save this article

Document Retrieval using Efficient Indexing Techniques: A Review

Author

Listed:
  • Shweta Gupta

    (Department of Computer Science and Engineering, Ajay Kumar Garg Engineering College, Ghaziabad, India & Dr. A.P.J. Abdul Kalam Technical University, Uttar Pradesh, India)

  • Sunita Yadav

    (Department of Computer Science and Engineering, Ajay Kumar Garg Engineering College, Ghaziabad, India & Dr. A.P.J. Abdul Kalam Technical University, Uttar Pradesh, India)

  • Rajesh Prasad

    (Department of Computer Science, Yobe State University, Damaturu, Nigeria)

Abstract

Document retrieval plays a crucial role in retrieving relevant documents. Relevancy depends upon the occurrences of query keywords in a document. Several documents include a similar key terms and hence they need to be indexed. Most of the indexing techniques are either based on inverted index or full-text index. Inverted index create lists and support word-based pattern queries. While full-text index handle queries comprise of any sequence of characters rather than just words. Problems arise when text cannot be separated as words in some western languages. Also, there are difficulties in space used by compressed versions of full-text indexes. Recently, one of the unique data structure called wavelet tree has been popular in the text compression and indexing. It indexes words or characters of the text documents and help in retrieving top ranked documents more efficiently. This paper presents a review on most recent efficient indexing techniques used in document retrieval.

Suggested Citation

  • Shweta Gupta & Sunita Yadav & Rajesh Prasad, 2016. "Document Retrieval using Efficient Indexing Techniques: A Review," International Journal of Business Analytics (IJBAN), IGI Global, vol. 3(4), pages 64-82, October.
  • Handle: RePEc:igg:jban00:v:3:y:2016:i:4:p:64-82
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJBAN.2016100104
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jban00:v:3:y:2016:i:4:p:64-82. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.