IDEAS home Printed from https://ideas.repec.org/a/igg/jirr00/v8y2018i2p57-77.html
   My bibliography  Save this article

Named Entity Based Ranking with Term Proximity for XML Retrieval

Author

Listed:
  • Abubakar Roko

    (University Putra Malaysia, Selangor, Malaysia)

  • Shyamala Doraisamy

    (University Putra Malaysia, Selangor, Malaysia)

  • Azreen Azman

    (University Putra Malaysia, Selangor, Malaysia)

  • Azrul Hazri Jantan

    (University Putra Malaysia, Selangor, Malaysia)

Abstract

In this article, an indexing scheme that includes the named entity category for each indexed term is proposed. Based on this, two methods are proposed, one to infer the semantics of an XML element based on its data content, called the confidence value of the element, and the second method computes the proximity scores of the query terms. The confidence value of an element is obtained based on the probability of a named entity category in the data content of the underlying XML element. The proximity score of the query terms measures the proximity and ordering of the query term within an XML element. The article then shows how a ranking function uses the confidence value of an XML element and proximity score to mitigate the impact of higher frequency terms and compute the relevance between a keyword query and an XML fragment. Finally, a keyword search system is introduced and experiments show that the proposed system outperforms existing approaches in terms of search quality and achieve a higher efficiency.

Suggested Citation

  • Abubakar Roko & Shyamala Doraisamy & Azreen Azman & Azrul Hazri Jantan, 2018. "Named Entity Based Ranking with Term Proximity for XML Retrieval," International Journal of Information Retrieval Research (IJIRR), IGI Global, vol. 8(2), pages 57-77, April.
  • Handle: RePEc:igg:jirr00:v:8:y:2018:i:2:p:57-77
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJIRR.2018040104
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jirr00:v:8:y:2018:i:2:p:57-77. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.