Fast Iterative model for Sequential-Selection-Based Applications

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2014, Vol 12, Issue 7

Abstract

Accelerated multi-armed bandit (MAB) model in Reinforcement-Learning for on-line sequential selection problems is presented. This iterative model utilizes an automatic step size calculation that improves the performance of MAB algorithm under different conditions such as, variable variance of reward and larger set of usable actions. As result of these modifications, number of optimal selections will be maximized and stability of the algorithm under mentioned conditions may be amplified. This adaptive model with automatic step size computation may attractive for on-line applications in which,  variance of observations vary with time and re-tuning their step size are unavoidable where, this re-tuning is not a simple task. The proposed model governed by upper confidence bound (UCB) approach in iterative form with automatic step size computation. It called adaptive UCB (AUCB) that may use in industrial robotics, autonomous control and intelligent selection or prediction tasks in the economical engineering applications under lack of information.

Authors and Affiliations

Khosrow Amirizadeh, Rajeswari Mandava

Keywords

Related Articles

A REVIEW ON A NOVEL APPROACH FOR DATA COLLECTION IN WSN

Wireless sensor networks have become increasingly popular due to their wide range of application. Clustering sensor nodes organizing them hierarchically have proven to be an effective method to provide better data aggreg...

An Environment for detection of Bugs through SVM

Mining technique finds hidden patterns from the data stored in the repositories and turn it into useful information and knowledge. Most open source software development projects include an open bug repository in which us...

EFFICIENT MANET- INTERNET INTEGRATION FOR MOBILE DEVICES

A mobile ad hoc network (MANET) consists of wireless mobile nodes without having a fixed infrastructure support. The communication between these mobile nodes is carried out without any centralized control. The communicat...

Performance Analysis of Advanced Hybrid Speech Coding Techniques in Time domain, Spectral domain and Perceptual domain

Speech coding is the art of creating a minimally redundant representation of the speech signal that can be efficiently transmitted or stored in digital media and decoding the signal with the best possible perceptual Qua...

Investigation, Formulation and Development of an Open GUI for the Touchscreen Smartphone

The use of touchscreens in handheld mobile devices, including mobile phones, PDA’s, media players and tablet PC’s, has rapidly increased in recent times. One of the most important aspects of these devices is the soft...

Download PDF file
  • EP ID EP650474
  • DOI 10.24297/ijct.v12i7.3092
  • Views 59
  • Downloads 0

How To Cite

Khosrow Amirizadeh, Rajeswari Mandava (2014). Fast Iterative model for Sequential-Selection-Based Applications. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 12(7), 3689-3696. https://europub.co.uk/articles/-A-650474