IDEAS home Printed from https://ideas.repec.org/a/abq/ijist1/v6y2024i2p565-581.html
   My bibliography  Save this article

Contrasting Impact of Start State on Performance of AReinforcement Learning Recommender System

Author

Listed:
  • Sidra Hassan

    (Department of Software Engineering, University of Engineering and Technology, Taxila, Pakistan)

Abstract

A recommendation problem and RL problem are very similar, as both tryto increase user satisfaction in a certain environment. Typical recommender systems mainly rely on history of the user to give future recommendations and doesn’t adapt well tocurrent changing user demands. RL can be used to evolve with currently changing user demands by considering a reward function as feedback. In this paper,recommendation problem is modeled as an RL problem using a squared grid environment, with each grid cell representing a unique stategenerated by a biclusteringalgorithm Bibit. These biclusters are sorted according to their overlapping and then mapped to a squared grid. An RL agent then moveson this grid to obtain recommendations. However, the agent hasto decide themost pertinent start state that can give best recommendations. To decide the start state of the agent, a contrastingimpact of different start states on the performance of RL agent-based RSs is required. For this purpose, we applied seven different similarity measures to determine the start state ofthe RL agent. These similarity measures are diverse, attributed tothe fact that some may not use rating values, some may only use rating values, or some may use global parameters like average rating value or standard deviation in rating values. Evaluation is performed on ML-100K and FilmTrust datasets under different environment settings. Results provedthat careful selection of start state can greatly improve the performance of RL-based recommender systems.

Suggested Citation

  • Sidra Hassan, 2024. "Contrasting Impact of Start State on Performance of AReinforcement Learning Recommender System," International Journal of Innovations in Science & Technology, 50sea, vol. 6(2), pages 565-581, May.
  • Handle: RePEc:abq:ijist1:v:6:y:2024:i:2:p:565-581
    as

    Download full text from publisher

    File URL: https://journal.50sea.com/index.php/IJIST/article/view/803/1389
    Download Restriction: no

    File URL: https://journal.50sea.com/index.php/IJIST/article/view/803
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Shuang-Bo Sun & Zhi-Heng Zhang & Xin-Ling Dong & Heng-Ru Zhang & Tong-Jun Li & Lin Zhang & Fan Min, 2017. "Integrating Triangle and Jaccard similarities for recommendation," PLOS ONE, Public Library of Science, vol. 12(8), pages 1-16, August.
    2. Hael Al-bashiri & Mansoor Abdullateef Abdulgabber & Awanis Romli & Hasan Kahtan, 2018. "An improved memory-based collaborative filtering method based on the TOPSIS technique," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-26, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fuyu Xu & Kate Beard, 2021. "A comparison of prospective space-time scan statistics and spatiotemporal event sequence based clustering for COVID-19 surveillance," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-23, June.
    2. Mubbashir Ayub & Mustansar Ali Ghazanfar & Zahid Mehmood & Tanzila Saba & Riad Alharbey & Asmaa Mahdi Munshi & Mayda Abdullateef Alrige, 2019. "Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-29, August.
    3. Junmei Feng & Xiaoyi Fengs & Ning Zhang & Jinye Peng, 2018. "An improved collaborative filtering method based on similarity," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-18, September.
    4. Gourav Jain & Tripti Mahara & S. C.Sharma, 2023. "Effective time context based collaborative filtering recommender system inspired by Gower’s coefficient," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 14(1), pages 429-447, February.
    5. Mohammed Mghari & Abdelilah Mhamedi & Abdelaaziz El Hibaoui, 2025. "Contextual Product Recommendation Using Transformer-Based Models: Uncovering Product Dependencies in Transactional Data," SN Operations Research Forum, Springer, vol. 6(3), pages 1-25, September.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:abq:ijist1:v:6:y:2024:i:2:p:565-581. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Iqra Nazeer (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.