Image Category Recognition using Bag of Visual Words Representation

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2016, Vol 4, Issue 5

Abstract

Image category recognition is one of the challenging tasks due to difference in image background, illumination, scale, clutter, rotation, etc. Bag-of-Visual-Words (BoVW) model is considered as the standard approach for image categorization. The performance of the BoVW is mainly depend on local features extracted from images. In this paper, a novel BoVW representation approach utilizing Compressed Local Retinal Features (CLRF) for image categorization is proposed. The CLRF uses interest point regions from images and transform them to log polar form. Then two dimensional Discrete Wavelet Transformation (2D DWT) is applied to compress the log polar form and the resultant are considered as features for the interest regions. These features are further used to build a visual vocabulary using k-means clustering algorithm. Then this visual vocabulary is used to form a histogram representation of each image where the images are further classified using Support Vector Machines (SVM) classifier. The performance of the proposed BoVW framework is evaluated using SIMPLIcity and butterflies datasets. The experimental results show that the proposed BoVW approach that uses CLRF is very competitive to the state-of-the-art methods.

Authors and Affiliations

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea

Keywords

Related Articles

The Adept K-Nearest Neighbour Algorithm - An optimization to the Conventional K-Nearest Neighbour Algorithm

This research aims to study the efficiency of a well-known classification algorithm, K-Nearest Neighbour, and suggest a new classification method, an optimised version than one of the existing classification method. The...

A Model- Based Research Material Recommendation System For Individual Users

As there is an enormous amount of online research material available, finding pertinent information for specific purposes has become a tedious chore. So there is a requirement of the research paper recommendation system...

A Real Time Embedded System Architecture for Autonomous Underwater Sensors Localization

Underwater Acoustic Sensor Networks (UWASNs) consist of a variable number of autonomous sensors or vehicles that are deployed over a given area to perform smart sensing and collaborative monitoring tasks. In UWASNs, sens...

Survey and Comparative Study on Agile Methods in Software Engineering

Today‘s business environment is very much dynamic, and organizations are constantly changing their software requirements to adjust with new environment. They also demand for fast delivery of software products as well as...

Temporal Association Rule Mining: With Application to US Stock Market

A modified framework, that applies temporal association rule mining to financial time series, is proposed in this paper. The top four components stocks (stock price time series, in USD) of Dow Jones Industrial Average (D...

Download PDF file
  • EP ID EP277205
  • DOI 10.14738/tmlai.45.2223
  • Views 60
  • Downloads 0

How To Cite

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea (2016). Image Category Recognition using Bag of Visual Words Representation. Transactions on Machine Learning and Artificial Intelligence, 4(5), 1-9. https://europub.co.uk/articles/-A-277205