Agent-Based Convolution and Reinforcement Learning
Journal Title: BEST : International Journal of Management, Information Technology and Engineering ( BEST : IJMITE ) - Year 2017, Vol 5, Issue 12
The problem with the current models like Darwin-OP or Boston Dynamic’s ATLAS is their up-time, especially with increased number of joints. These models try mimicking the human motion; they end up using a lot of actuators, which in turn leads to the use of a lot of battery power. This paper discusses the creation of a new model of humanoid robots, that does not try to mimic the bipedal walking gait used by humans, but who instead uses a full model constructed from scratch, that consists of a model free Deep Q-Learning (DQN) algorithm, which doesn’t need any walking sequence or walking models, it just learns from trial and error by applying actions on the robot and observing the reward from that action to make an under-actuated robot able to balance and walk forward, backwards, sideways, and rotate in place using only 4 actuators (two in each leg). The proposed model uses a Regional Convolutional Neural Network (R-CNN) to detect and inform the robot about the place of its goal. A full sensory system of a camera and Inertial Measurement Unit (IMU) is utilized to extract and gather the required inputs for reaching the goal from the robot’ environment. Thus instead of thinking that robot as a pre-programmed entity who performs specific task, we treat this as agent who can learn to take whatever actions towards specific goal controlled by evaluation function to maximize specific reward.
Authors and Affiliations
Samer I. Mohamed, Amr Abdelnabi
Cost-Effective Mapping Using Multi-Stage Bargaining Technique
In current paper, we present an approach to cost effective mapping between Cloud Service Providers (CSPs) and Wireless Body Area Networks (WBANs). This approach is mainly based on resource distribution technique & price...
Women Empowerment: A ‘Verb’ for Men of our Society for Social Transformation
In Indian society women are thought to be as goddess and worshipped as ‘kanjak’ during Navratras. But it is also a fact that, they are ‘thought to be’ but not treated as goddess. From the ancient times, the status of wom...
The Impact of construction Waste to the Environmental on Project Development in Aceh
The development progress in construction industries have the great effects to the environmental especially in environmental change and waste produced. One of the causes of the construction waste is natural resources use...
Bi-TODIM Method and its Application in Urban Land Use Efficiency Evaluation
Considering that some multiple indicator evaluation problems have three characteristics: indicators interact with each other, the value of the indicator is in a bipolar interval, and the psychological behavior of evaluat...
Goods and Services Tax and ITS Impact on Selected Industry in India
GST is, perhaps, the most significant tax reform in the history of independent India. It is expected to streamline the current complicated indirect tax structure in the country. It offers many advantages over the current...