Natural Language Description Generation for Image using Deep Learning Architecture

Abstract

Automatic natural description generation of an image is currently a challenging task. To generate a natural language description of the image, the system is implemented by combining with the techniques of computer vision and natural language processing. This paper presents different deep learning models for generating the natural language description of the image. Moreover, we discussed how the deep learning model, which works for the natural language description of an image, can be implemented. This deep learning model consists of Convolutional Neural Network CNN as well as Recurrent Neural Network RNN . The CNN is used for extracting the features from the image and RNN is used for generating the natural language description. To implement the deep learning model in generating the natural language description of an image, we have applied the Flickr 8K dataset and we have also evaluated the performance of the model using the standard evaluation matrices. These experiments show that the model is frequently giving accurate natural language descriptions for an input image. Phyu Phyu Khaing | Mie Mie Aung | Myint San "Natural Language Description Generation for Image using Deep Learning Architecture" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-5 , August 2019, URL: https://www.ijtsrd.com/papers/ijtsrd26708.pdfPaper URL: https://www.ijtsrd.com/computer-science/other/26708/natural-language-description-generation-for-image-using-deep-learning-architecture/phyu-phyu-khaing

Authors and Affiliations

Keywords

Related Articles

Language Factor in Food Sustainability within Kericho Kenya Rural Set Up

This paper aims to highlight the use of apt language to encourage food sustainability among residence of Kericho County, Kenya. The research aims to show the importance of songs and radio call in sections in sensitizing...

Development of an Equation to Estimate the Monthly Rainfall A Case Study for Catarman, Northern Samar, Philippines

This study aimed to derived an equation to estimate the monthly rainfall for Catarman, Northern Samar.The observed monthly rainfall data for Catarman N. Samar, Catbalogan Samar, Legazpi City and Masbate were obtained fro...

Exploring the Link between Operational Efficiency and Firms' Financial Performance An Empirical Evidence from the Ghana Stock Exchange GSE

The purpose of this study was to explore the link between operational efficiency and the financial performance of non financial firms listed on the Ghana Stock Exchange GSE . Specifically, the study sought to determine t...

A Review on Different Topologies and Control Method of Static Synchronous Compensator

The electrical power appliances which convert source frequency to another frequency level is known as frequency converter. This research proposed a novel design method to achieve the 80 kHz high frequency converter. In t...

Big Data Analytics Issues Based on Challenges in IoT

Big data is an enormous data in size, collection of data that are enormous in size and can grow exponentially with time. It can be in three forms such as structured, unstructured and can be semi structured form. Big data...

Download PDF file
  • EP ID EP629211
  • DOI 10.31142/ijtsrd26708
  • Views 106
  • Downloads 0

How To Cite

(2019). Natural Language Description Generation for Image using Deep Learning Architecture. International Journal of Trend in Scientific Research and Development, 3(5), 1575-1581. https://europub.co.uk/articles/-A-629211