Data Augmentation to Stabilize Image Caption Generation Models in Deep Learning

Abstract

Automatic image caption generation is a challenging AI problem since it requires utilization of several techniques from different computer science domains such as computer vision and natural language processing. Deep learning techniques have demonstrated outstanding results in many different applications. However, data augmentation in deep learning, which replicates the amount and the variety of training data available for learning models without the burden of collecting new data, is a promising field in machine learning. Generating textual description for a given image is a challenging task for computers. Nowadays, deep learning performs a significant role in the manipulation of visual data with the help of Convolutional Neural Networks (CNN). In this study, CNNs are employed to train prediction models which will help in automatic image caption generation. The proposed method utilizes the concept of data augmentation to overcome the fuzziness of well-known image caption generation models. Flickr8k dataset is used in the experimental work of this study and the BLEU score is applied to evaluate the reliability of the proposed method. The results clearly show the stability of the outcomes generated through the proposed method when compared to others.

Authors and Affiliations

Hamza Aldabbas, Muhammad Asad, Mohammad Hashem Ryalat, Kaleem Razzaq Malik, Muhammad Zubair Akbar Qureshi

Keywords

Related Articles

Stable Beneficial Group Activity Formation

Computational models are one of the very powerful tools for expressing everyday situations that are derived from human interactions. In this paper, an investigation of the problem of forming beneficial groups based on th...

Robust Video Content Authentication using Video Binary Pattern and Extreme Learning Machine

Recently, due to easy accessibility of smartphones, digital cameras and other video recording devices, a radical enhancement has been experienced in the field of digital video technology. Digital videos have become very...

Robust Recurrent Cerebellar Model Articulation Controller for Non-Linear MIMO Systems

This research proposes a robust recurrent cerebellar model articulation control system (RRCMACS) for MIMO non-linear systems to achieve the robustness of the system during operation. In this system, the superior properti...

Toward Information Diffusion Model for Viral Marketing in Business

Current obstacles in the study of social media marketing include dealing with massive data and real-time updates have motivated to contribute solutions that can be adopted for viral marketing. Since information diffusion...

A Built-in Criteria Analysis for Best IT Governance Framework

The implementation of IT governance is important to lead and evolve the information system in agreement with stakeholders. This requirement is seriously amplified at the time of the digital area considering all the new t...

Download PDF file
  • EP ID EP665240
  • DOI 10.14569/IJACSA.2019.0101074
  • Views 74
  • Downloads 0

How To Cite

Hamza Aldabbas, Muhammad Asad, Mohammad Hashem Ryalat, Kaleem Razzaq Malik, Muhammad Zubair Akbar Qureshi (2019). Data Augmentation to Stabilize Image Caption Generation Models in Deep Learning. International Journal of Advanced Computer Science & Applications, 10(10), 571-579. https://europub.co.uk/articles/-A-665240