An Analysis of Q-Learning Algorithms with Strategies of Reward Function

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2

Abstract

Q-Learning is a Reinforcement Learning technique that works by learning an action-value function that gives the expected utility of taking a given action in a given state and following a fixed policy thereafter. One of the strengths of Q-Learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Reinforcement Learning is an approach where the agent needs no teacher to learn how to solve a problem. The only signal used by the agent to learn from his actions in reinforcement environment is the so called reward, a number which tells the agent if his last action was good (or) not. Q-Learning is a recent form of Reinforcement Learning algorithm that does not need a model of its environment and can be used on-line. This paper discusses about the different strategies of Q-Learning algorithms and reward function.

Authors and Affiliations

Ms. S. Manju, , Dr. Ms. M. Punithavalli,

Keywords

Related Articles

Parsing Complementizer Phrases In Machine Translation System

Every language has a finite number of words and finite number of rules but infinite number of sentences. Sentences are not formed by the words alone but by structural units known as constituents. Analysis of the sentence...

Recognizing faces with single sample per subject using fusion of transforms

Face recognition has attracted attention of the researchers. Face recognition becomes challenging if various factors are considered such as varying illumination, pose, facial expression and somewhat occlusion. The face r...

Modeling Virtual Meetings within Software Engineering Environment

It is a common scenario to see project’s stakeholders, such as managers, team leaders, and developers carrying out their meeting in the online environment without a suitable preparation and facilitation For instance, sta...

A New Multi Fractal Dimension Method for Face Recognition with Fewer Features under Expression Variations

In this work, a new method is presented as a mingle of Principal Component Analysis (PCA) and Multi-Fractal Dimension analysis (MFD) for feature extraction. Proposed method makes use of best decision taken from both the...

REMOTE SENSING IMAGE COMPRESSION USING 3D-SPIHT ALGORITHM AND 3D-OWT

Remote Sensing is the gathering of information about a place from a distance. Such information can occur by sensors or satellite, without making any direct contact with that object. We present a new technique for the com...

Download PDF file
  • EP ID EP102749
  • DOI -
  • Views 163
  • Downloads 0

How To Cite

Ms. S. Manju, , Dr. Ms. M. Punithavalli, (2011). An Analysis of Q-Learning Algorithms with Strategies of Reward Function. International Journal on Computer Science and Engineering, 3(2), 814-820. https://europub.co.uk/articles/-A-102749