An Analysis of Q-Learning Algorithms with Strategies of Reward Function
Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2
Abstract
Q-Learning is a Reinforcement Learning technique that works by learning an action-value function that gives the expected utility of taking a given action in a given state and following a fixed policy thereafter. One of the strengths of Q-Learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Reinforcement Learning is an approach where the agent needs no teacher to learn how to solve a problem. The only signal used by the agent to learn from his actions in reinforcement environment is the so called reward, a number which tells the agent if his last action was good (or) not. Q-Learning is a recent form of Reinforcement Learning algorithm that does not need a model of its environment and can be used on-line. This paper discusses about the different strategies of Q-Learning algorithms and reward function.
Authors and Affiliations
Ms. S. Manju, , Dr. Ms. M. Punithavalli,
ANN and Fuzzy Logic Models for the Prediction of groundwater level of a watershed
Computational Intelligence techniques have been proposed as an efficient tool for modeling and forecasting in recent years and in various applications. Groundwater is a highly valuable resource. Measurement and analysis...
SRGM with logistic-exponential Testing-effort function with change-point and Analysis of Optimal release policies based on increasing the test efficiency
Reliability is the one of the important factor of software quality. Past few decades several software reliability growth models are proposed to access the quality of the software. Main challenging task of reliability gro...
A New Approach for Designing Cryptographic Systems based on Feistel Structure
Many Classical and modern cryptographic algorithms have been developed by the Cryptographers to facilitate data security operations. Classical ciphers are not being widely used because of limited key space. Public key cr...
Approaches for Intelligent Traffic System: A Survey
This survey presents various approaches for intelligent traffic systems. The potential research fields in which Intelligent Traffic System emerges as an important application area are highlighted and various issues have...
Improving Signal to Noise Ratio of Low-Dose CT Image Using Wavelet Transform
Now-a-days, diagnosis of human diseases has become comparatively easier with the help of modern technology. Use of new technology provides more information about the patient condition and also patient’s health can be mon...