IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v13y2014i04ns0219622014500643.html
   My bibliography  Save this article

Semantically Enriched Variable Length Markov Chain Model for Analysis of User Web Navigation Sessions

Author

Listed:
  • Suresh Shirgave

    (Department of Computer Science and Engineering, Textile and Engineering Institute, Rajwada, Ichalkaranji, Maharashatra 416115, India)

  • Prakash Kulkarni

    (Department of Computer Science and Engineering, Walchand College of Engineering, Vishrambag, Sangli, Maharashatra 416115, India)

  • José Borges

    (INESC TEC, Faculty of Engineering, University of Porto, R. Dr. Roberto Frias, Porto 4200-465, Portugal)

Abstract

The rapid growth of the World Wide Web has resulted in intricate Web sites, demanding enhanced user skills to find the required information and more sophisticated tools that are able to generate apt recommendations. Markov Chains have been widely used to generate next-page recommendations; however, accuracy of such models is limited. Herein, we propose the novel Semantic Variable Length Markov Chain Model (SVLMC) that combines the fields of Web Usage Mining and Semantic Web by enriching the Markov transition probability matrix with rich semantic information extracted from Web pages. We show that the method is able to enhance the prediction accuracy relatively to usage-based higher order Markov models and to semantic higher order Markov models based on ontology of concepts. In addition, the proposed model is able to handle the problem of ambiguous predictions. An extensive experimental evaluation was conducted on two real-world data sets and on one partially generated data set. The results show that the proposed model is able to achieve 15–20% better accuracy than the usage-based Markov model, 8–15% better than the semantic ontology Markov model and 7–12% better than semantic-pruned Selective Markov Model. In summary, the SVLMC is the first work proposing the integration of a rich set of detailed semantic information into higher order Web usage Markov models and experimental results reveal that the inclusion of detailed semantic data enhances the prediction ability of Markov models.

Suggested Citation

  • Suresh Shirgave & Prakash Kulkarni & José Borges, 2014. "Semantically Enriched Variable Length Markov Chain Model for Analysis of User Web Navigation Sessions," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 13(04), pages 721-753.
  • Handle: RePEc:wsi:ijitdm:v:13:y:2014:i:04:n:s0219622014500643
    DOI: 10.1142/S0219622014500643
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622014500643
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622014500643?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:13:y:2014:i:04:n:s0219622014500643. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.