Agent-Based Convolution and Reinforcement Learning

Abstract

The problem with the current models like Darwin-OP or Boston Dynamic’s ATLAS is their up-time, especially with increased number of joints. These models try mimicking the human motion; they end up using a lot of actuators, which in turn leads to the use of a lot of battery power. This paper discusses the creation of a new model of humanoid robots, that does not try to mimic the bipedal walking gait used by humans, but who instead uses a full model constructed from scratch, that consists of a model free Deep Q-Learning (DQN) algorithm, which doesn’t need any walking sequence or walking models, it just learns from trial and error by applying actions on the robot and observing the reward from that action to make an under-actuated robot able to balance and walk forward, backwards, sideways, and rotate in place using only 4 actuators (two in each leg). The proposed model uses a Regional Convolutional Neural Network (R-CNN) to detect and inform the robot about the place of its goal. A full sensory system of a camera and Inertial Measurement Unit (IMU) is utilized to extract and gather the required inputs for reaching the goal from the robot’ environment. Thus instead of thinking that robot as a pre-programmed entity who performs specific task, we treat this as agent who can learn to take whatever actions towards specific goal controlled by evaluation function to maximize specific reward.

Authors and Affiliations

Samer I. Mohamed, Amr Abdelnabi

Keywords

Related Articles

The Impact of Inflation Uncertainty on the Resources and Expenditures in the Banking System of Iran

Banks are the basis for economic growth. Yet today the performance of their causal influence of asymmetric information and uncertainty, which could have a negative impact on overall economic performance in this study, th...

Bi-TODIM Method and its Application in Urban Land Use Efficiency Evaluation

Considering that some multiple indicator evaluation problems have three characteristics: indicators interact with each other, the value of the indicator is in a bipolar interval, and the psychological behavior of evaluat...

Company Entrance Surveillance System

Company Entrance Surveillance System tracks the visitor in defining the space. Data is the most crucial element in the technology era. Security of valuable data is one of the important aspects. Tracking becomes one of th...

Teacher’s Education in India: A Critical Review of Various Issues and Challenges

The ultimate function of teacher education institutes is to make the pupil teachers aware of their high mission, and impart a new vision of the responsibilities of teachers, in the education of children and youth. The su...

VALIDATING THE OUTSOURCED RESULTS OF FREQUENT ITEM

“Cloud Computing” is playing a vital role by outsourcing data which is being stored in cloud server to ‘n’ number of third-party providers. The volume of information which is being exchanged between providers is charged...

Download PDF file
  • EP ID EP264556
  • DOI -
  • Views 119
  • Downloads 0

How To Cite

Samer I. Mohamed, Amr Abdelnabi (2017). Agent-Based Convolution and Reinforcement Learning. BEST : International Journal of Management, Information Technology and Engineering ( BEST : IJMITE ), 5(12), 17-28. https://europub.co.uk/articles/-A-264556