An Analysis of Q-Learning Algorithms with Strategies of Reward Function
Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2
Abstract
Q-Learning is a Reinforcement Learning technique that works by learning an action-value function that gives the expected utility of taking a given action in a given state and following a fixed policy thereafter. One of the strengths of Q-Learning is that it is able to compare the expected utility of the available actions without requiring a model of the environment. Reinforcement Learning is an approach where the agent needs no teacher to learn how to solve a problem. The only signal used by the agent to learn from his actions in reinforcement environment is the so called reward, a number which tells the agent if his last action was good (or) not. Q-Learning is a recent form of Reinforcement Learning algorithm that does not need a model of its environment and can be used on-line. This paper discusses about the different strategies of Q-Learning algorithms and reward function.
Authors and Affiliations
Ms. S. Manju, , Dr. Ms. M. Punithavalli,
Resource-Aware Load Balancing Scheme using Multi-objective Optimization in Cloud Computing
Cloud computing is a service based, on-demand, pay per use model consisting of an interconnected and virtualizes resources delivered over internet. In cloud computing, usually there are number of jobs that need to be exe...
Coverage Analysis In Wireless Sensor Network
A WSN can be composed of homogeneous or heterogeneous sensor nodes also termed as motes, which adapts the same or different coordination, sensing and computation abilities, respectively. Node deployment is a fundamental...
Meta-Content framework for back index generation
Book reading is a common thing which every one of us does in our life. A common strategy to spot a page for reading is to use front index and back index. A front index generally contains the sections and subsections topi...
Improved CBIR using Multileveled Block Truncation Coding
The paper presents improved content based image retrieval (CBIR) techniques based on multilevel Block truncation coding using multiple threshold values. Block truncation Coding based features is one of the CBIR methods...
Simulation of A Novel Scalable Group Key Management Protocol for Mobile Adhoc Networks
A Mobile Adhoc Network (MANET) is a collection of autonomous nodes that communicate with each other ,most frequently using a multi-hop wireless network. Secure and multicast group communication is an active area of resea...