Contrasting Impact of Start State on Performance of AReinforcement Learning Recommender System

Abstract

A recommendation problem and RL problem are very similar, as both try to increase user satisfaction in a certain environment. Typical recommender systems mainly rely on history of the user to give future recommendations and doesn’t adapt well to current changing user demands. RL can be used to evolve with currently changing user demands by considering a reward function as feedback. In this paper, recommendation problem is modeled as an RL problem using a squared grid environment, with each grid cell representing a unique state generated by a biclustering algorithm Bibit. These biclusters are sorted according to their overlapping and then mapped to a squared grid. An RL agent then moves on this grid to obtain recommendations. However, the agent has to decide the most pertinent start state that can give best recommendations. To decide the start state of the agent, a contrasting impact of different start states on the performance of RL agent-based RSs is required. For this purpose, we applied seven different similarity measures to determine the start state of the RL agent. These similarity measures are diverse, attributed to the fact that some may not use rating values, some may only use rating values, or some may use global parameters like average rating value or standard deviation in rating values. Evaluation is performed on ML-100K and FilmTrust datasets under different environment settings. Results proved that careful selection of start state can greatly improve the performance of RL-based recommender systems,

Authors and Affiliations

Sidra Hassan, Mubbashir Ayub, Muhammad Waqar, Tasawer Khan

Keywords

Related Articles

Assessment of Public Participation Modalities through Social Media Platforms for Approval of Private Housing Schemes: Case Studies under LDA Lahore, Pakistan

Public participation through social media networks in Private housing scheme (PHS) projects is essential for fostering a feeling of community and avoiding resistance to the planning of housing scheme initiatives. It mi...

Towards End-to-End Speech Recognition System for Pashto Language Using Transformer Model

The conventional use of Hidden Markov Models (HMMs), and Gaussian Mixture Models (GMMs) for speech recognition posed setup challenges and inefficiency. This paper adopts the Transformer model for Pashto continuous sp...

Lightweight Cryptography Algorithms for Internet of ThingsEnabled Networks.A Comparative Study

The rapid advancement of technology has facilitated the interconnection of numerous devices, enabling the collection of vast amounts of data. Consequently, ensuring security within IoT networks has become a top priorit...

Particle Filter Based Multi-sensor Fusion for Remaining Service Life Estimation of Energized LV-Aerial Bundled Cables

Aerial Bundled Cables (ABC) consist of several wires that contain numerous layers of thermal insulation, which reduces the risk of theft. Nonetheless, there have been regular reports of rapid degeneration of such cable...

Fabrication of Smart Syringe Infusion Device: A Solution for Healthcare Industry

Accurate medication delivery is essential in intensive care units, where precision in drug delivery is crucial. In order to address the need for increased accuracy and efficiency in workflow, this study proposes a semi...

Download PDF file
  • EP ID EP760327
  • DOI -
  • Views 12
  • Downloads 0

How To Cite

Sidra Hassan, Mubbashir Ayub, Muhammad Waqar, Tasawer Khan (2024). Contrasting Impact of Start State on Performance of AReinforcement Learning Recommender System. International Journal of Innovations in Science and Technology, 6(2), -. https://europub.co.uk/articles/-A-760327