Enhancement in Decision Making with Improved Performance by Multiagent Learning Algorithms
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 3
Abstract
Abstract:The output of the system is a sequence of actions in some applications. There is no such measure as the best action in any in-between state; an action is excellent if it is part of a good policy. A single action is notimportant; the policy is important that is the sequence of correct actions to reach the goal. In such a case, machine learning program should be able to assess the goodness of policies and learn from past good actionsequences to be able to generate a policy. A multi-agent environment is one in which there is more than one agent, where they interact with one another, and further, where there are restrictions on that environment such that agents may not at any given time know everything about the world that other agents know. Two features of multi-agent learning which establish its study as a separate field from ordinary machine learning. Parallelism, scalability, simpler construction and cost effectiveness are main characteristics of multi-agent systems. Multiagent learning model is given in this paper. Two multiagent learning algorithms i. e. Strategy Sharing & Joint Rewards algorithm are implemented. In Strategy Sharing algorithm simple averaging of Q tables is taken. Each Q-learning agent learns from all of its teammates by taking the average of Q-tables. Joint reward learning algorithm combines the Q learning with the idea of joint rewards. Paper shows result and performance comparison of the two multiagent learning algorithms.
Authors and Affiliations
Deepak A. Vidhate , Dr. Parag Kulkarni
An Efficient Approach of Segmentation and Blind Deconvolution in Image Restoration
Abstract :This paper introduces the concept of Blind Deconvolution for restoration of a digital image and small segments of a single image that has been degraded due to some noise. Concept of Image Restoration isused in...
A New Approach and Algorithm for Baseline Detection of Arabic Handwriting
Abstract : Automatic baseline detection of handwritten Arabic words is a crucial task for OCR. It is extensively used in many preprocessing processes such as text normalization, skew/slant correction, and letters segment...
A Short Range Wireless Communication Using Android NFC API
In this paper, we are proposing the implementation of short range wireless communication using Android’s NFC API. Near Field Communication is set of protocols used for communication between two android powered and NFC en...
“WiMAX-WLAN Interface usingTORA, DSR and OLSR protocols with their evaluation under Wormhole Attack on VOICE and HTTP applications”
Abstract: We are in advanced world of internet with new technologies in now these days. So many new wireless networks technologies have been emerged. WiMAX is one of the advanced technologies from those. Due to adv...
Video Segmentation Using Global Motion Estimation and Compensation
Abstract : Video has to be segmented into objects for content-based processing. A number of video object segmentation algorithms have been proposed such as semiautomatic and automatic. Semiautomatic methods adds burden t...