Image Category Recognition using Bag of Visual Words Representation

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2016, Vol 4, Issue 5

Abstract

Image category recognition is one of the challenging tasks due to difference in image background, illumination, scale, clutter, rotation, etc. Bag-of-Visual-Words (BoVW) model is considered as the standard approach for image categorization. The performance of the BoVW is mainly depend on local features extracted from images. In this paper, a novel BoVW representation approach utilizing Compressed Local Retinal Features (CLRF) for image categorization is proposed. The CLRF uses interest point regions from images and transform them to log polar form. Then two dimensional Discrete Wavelet Transformation (2D DWT) is applied to compress the log polar form and the resultant are considered as features for the interest regions. These features are further used to build a visual vocabulary using k-means clustering algorithm. Then this visual vocabulary is used to form a histogram representation of each image where the images are further classified using Support Vector Machines (SVM) classifier. The performance of the proposed BoVW framework is evaluated using SIMPLIcity and butterflies datasets. The experimental results show that the proposed BoVW approach that uses CLRF is very competitive to the state-of-the-art methods.

Authors and Affiliations

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea

Keywords

Related Articles

Contextual Arabic Handwriting Recognition System Using Embedded Training Based Hybrid HMM/MLP Models

Recognizing unconstrained cursive Arabic handwritten text is a very challenging task the use of hybrid classification to take advantage of the strong modeling of Hidden Markov Models (HMM) and the large capacity of discr...

A Real Time Embedded System Architecture for Autonomous Underwater Sensors Localization

Underwater Acoustic Sensor Networks (UWASNs) consist of a variable number of autonomous sensors or vehicles that are deployed over a given area to perform smart sensing and collaborative monitoring tasks. In UWASNs, sens...

Absorption Spectra Analysis using Modified Self-Organizing Feature Maps

This research demonstrates an application of a modified self-organizing feature map (SOFM) algorithm to analyze and discover the quality of chemical absorption spectrum data. By forming an NxN neural array from input fea...

Application of Artificial Neural Networks ANN and Adaptive Neuro Fuzzy Inference System ANFIS Models in Water Quality Simulation of Tigris River at Baghdad City

In this paper two different types of artificial neural networks LMNN, SCGNN applied to simulate the total dissolved solids at of Tigris River at El-Wihda station using different water quality parameters data (pH, Temp.,...

The Bidirectional Long-Short-Term Memory Neural Network based Word Retrieval for Arabic Documents

The reflow from Arabic document image collections is a challenging task. This is partly due to the insolubility of the Arabic script. Because of the peculiarity of the whole body of the Arabic words, namely connectivity...

Download PDF file
  • EP ID EP277205
  • DOI 10.14738/tmlai.45.2223
  • Views 43
  • Downloads 0

How To Cite

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea (2016). Image Category Recognition using Bag of Visual Words Representation. Transactions on Machine Learning and Artificial Intelligence, 4(5), 1-9. https://europub.co.uk/articles/-A-277205