Q-Value Based Particle Swarm Optimization for Reinforcement Neuro-Fuzzy System Design
Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 10
Abstract
This paper proposes a combination of particle swarm optimization (PSO) and Q-value based safe reinforcement learning scheme for neuro-fuzzy systems (NFS). The proposed Q-value based particle swarm optimization (QPSO) fulfills PSO-based NFS with reinforcement learning; that is, it provides PSO-based NFS an alternative to learn optimal control policies under environments where only weak reinforcement signals are available. The reinforcement learning scheme is designed by Lyapunov principles and enjoys a number of practical benefits, including the ability of maintaining a system's state in a desired operating range and efficient learning. In the QPSO, parameters on a NFS are encoded in a particle evaluated by Q-value. The Q-value cumulates the reward received during a learning trial and is used as the fitness function for PSO evolution. During the trail, one particle is selected from the swarm; meanwhile, a corresponding NFS is built and applied to the environment with an immediate feedback reward. The applicability of QPSO is shown through simulations in single-link and double-link inverted pendulum system.
Authors and Affiliations
Yi-Chang Cheng , Sheng-Fuu Lin , Chi-Yao Hsu
An Analysis and Knowledge Representation System to attain the genuine web user usage behavior
With the explosive growth of WWW, the web mining techniques are densely concentrated to discover the relevant behaviors of the web user from the web log data. In fact the pattern discovery techniques generate many hundre...
Improved Fuzzy C-Means Algorithm for MR Brain Image Segmentation
Segmentation is an important aspect of medical image processing, where Clustering approach is widely used in biomedical applications particularly for brain tumor detection in abnormal Magnetic Resonance Images (MRI). Fuz...
Efficient Forward Node List Algorithm for Broadcasting in symmetric Mobile Ad hoc networks
A mobile ad hoc network enables wireless communications between participating mobile nodes without the assistance of any base station. Two nodes that are out of one another’s ransmission range need the support of interm...
Segmentation Based Approach to Dynamic Page Construction from Search Engine Results
The results rendered by the search engines are mostly a linear snippet list. With the prolific increase in the dynamism of web pages there is a need for enhanced result lists from search engines in order to cope-up with...
PREPROCESSING OF WEB LOGS
Today’s real world databases are highly susceptible to noisy, missing and inconsistent data due to their typically huge size data and their origin from multiple, heterogeneous sources. Hence, pre-processing of data is ne...