Knowledge transfer between heterogeneous reinforcement learning agent
Journal Title: Science Paper Online - Year 2010, Vol 5, Issue 2
Abstract
Aiming at the problem of the existing knowledge transfer methods are only suitable for homogenous reinforcement learning agents, a kind of Q learning algorithm that can transfer knowledge between heterogeneous Agents with different state and action spaces. The main idea of the proposed Q learning algorithm can be described as the follows. Based on a task that was already learned by an old and a new Agent, a neural network was used to off-line learn a mapping relationship of Q value function between the two Agents. The constructed mapping of Q value function was then used to obtain Q value of the new Agent in a new task that was already learned by the old Agent while was not learned by the new Agent. The proposed Q learning algorithm can decrease the number of trials of the new Agent and so as to improve learning speed. Simulation results of 10×10 mazes illustrate the validity of the proposed Q learning algorithm.
Authors and Affiliations
Bo Liu, Ruhai Lei
流体介质中柔性平板振动的特征灵敏度分析
柔性结构在流体介质中振动,诱导流场对振动的影响不可忽略,分析结构振动特性的灵敏度也需要考虑诱导流场的影响。论文作者在偶极子配置法计算流场附加质量的基础上,利用灵敏度分析的直接法发展了一种考虑柔性平板在流体介质中振动的流固...
Tow-dimensional CFD simulation of bubbly flow in structured packings
Basic research of bubbly flow in structured packings is of great importance for the development of novel gas-liquid mass transfer and reaction equipments. Simplified two-dimensional model was constructed and the volume...
Adsorption performance study of pitch-based spherical activated carbon for several zymo-moleculars
Adsorption behavior of beneficial molecular -amylase and pepsase from aqueous solutions onto pitch-based spherical activated carbon (PSAC) with different BET surface area and pore structure has been studied by UV spectr...
多目标模糊决策模型在水资源配置<br /> 方案评价中的应用<br />
水资源的合理配置是个多目标决策问题。本文提出的多目标模糊决策模型,可以在考虑地区差异的前提下,对整个区域上的水资源配置进行综合评价,大大提高了决策的科学性。同时依据此模型设计了相应的水资源配置方案决策系统,使理论更好地应...
基于离散曲率的扫描线条图快速圆弧检测
本文提出一种从圆弧假设到圆弧验证模式的扫描线条图的圆弧检测方法。该方法首先提取扫描线条图的图像骨骼,使用分段线性多边形对线条图像骨骼进行近似表示,来达到简化计算和减少数据量的目的;再通过对骨骼图像的局部离散曲率的计算和统...