Automatic Arabic Image Captioning using RNN-LSTM-Based Language Model and CNN

Abstract

The automatic generation of correct syntaxial and semantical image captions is an essential problem in Artificial Intelligence. The existence of large image caption copra such as Flickr and MS COCO have contributed to the advance of image captioning in English. However, it is still behind for Arabic given the scarcity of image caption corpus for the Arabic language. In this work, an Arabic version that is a part of the Flickr and MS COCO caption dataset is built. Moreover, a generative merge model for Arabic image captioning based on a deep RNN-LSTM and CNN model is developed. The results of the experiments are promising and suggest that the merge model can achieve excellent results for Arabic image captioning if a larger corpus is used.

Authors and Affiliations

Huda A. Al-muzaini, Tasniem N. Al-yahya, Hafida Benhidour

Keywords

Related Articles

Design and Implementation of an Interpreter Using Software Engineering Concepts

In this paper, an interpreter design and implementation for a small subset of C Language using software engineering concepts are presented. This paper reinforces an argument for the application of software engineering co...

Database Preservation: The DBPreserve Approach

In many institutions relational databases are used as a tool for managing information related to day to day activities. Institutions may be required to keep the information stored in relational databases accessible becau...

Implementation of Intelligent Automated Gate System with QR Code

This paper is about QR code-based automated gate system. The aim of the research is to develop and implement a type of medium-level security gate system especially for small companies that cannot afford to install high-t...

The Art of Crypto Currencies

Crypto Currencies have recently gained enormous popularity amongst the general public. With each passing day, more and more companies are radically accepting crypto cur-rencies in their payment systems, paving way for an...

Quantization Table Estimation in JPEG Images

Most digital image forgery detection techniques require the doubtful image to be uncompressed and in high quality. However, most image acquisition and editing tools use the JPEG standard for image compression. The histog...

Download PDF file
  • EP ID EP311658
  • DOI 10.14569/IJACSA.2018.090610
  • Views 152
  • Downloads 0

How To Cite

Huda A. Al-muzaini, Tasniem N. Al-yahya, Hafida Benhidour (2018). Automatic Arabic Image Captioning using RNN-LSTM-Based Language Model and CNN. International Journal of Advanced Computer Science & Applications, 9(5), 67-73. https://europub.co.uk/articles/-A-311658