Image Category Recognition using Bag of Visual Words Representation

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2016, Vol 4, Issue 5

Abstract

Image category recognition is one of the challenging tasks due to difference in image background, illumination, scale, clutter, rotation, etc. Bag-of-Visual-Words (BoVW) model is considered as the standard approach for image categorization. The performance of the BoVW is mainly depend on local features extracted from images. In this paper, a novel BoVW representation approach utilizing Compressed Local Retinal Features (CLRF) for image categorization is proposed. The CLRF uses interest point regions from images and transform them to log polar form. Then two dimensional Discrete Wavelet Transformation (2D DWT) is applied to compress the log polar form and the resultant are considered as features for the interest regions. These features are further used to build a visual vocabulary using k-means clustering algorithm. Then this visual vocabulary is used to form a histogram representation of each image where the images are further classified using Support Vector Machines (SVM) classifier. The performance of the proposed BoVW framework is evaluated using SIMPLIcity and butterflies datasets. The experimental results show that the proposed BoVW approach that uses CLRF is very competitive to the state-of-the-art methods.

Authors and Affiliations

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea

Keywords

Related Articles

Performance Evaluation of Some Selected Sorting Algorithms by the Use of Halstead Complexity Metrics

Complexity is developed to demonstrate feasible metrics for process obtaining objectives and quantifiable measurement, which may have numerous valuable applications in schedule and budget planning, cost estimation and op...

3D HMM-based Facial Expression Recognition using Histogram of Oriented Optical Flow

In this paper, we propose a 3D HMM (Three-dimensional Hidden Markov Models) approach to recognizing human facial expressions and associated emotions. Human emotion is usually classified by psychologists into six categori...

An Objective Approach to Schizophrenia Recognition Utilizing an Adaptive Neuro-Fuzzy Inference (ANFIS) Model

Schizophrenia is a brain disorder that distorts the way a person thinks, acts, expresses emotions, perceives reality, and relates to others. A systematic approach and an overview perception has been carried out over the...

Migration of the Temporal RDB into Temporal ORDB including Bitemporal Data : Phases

This paper proposes an approach for migrating existing relational database (TRDB) according to SQL: 2011 standard into temporal object relational database (TORDB) including Bitemporal data. This is done with methods that...

Implementation of Yorùbá Language Multimedia Learning System

The use of multimedia learning system has been widely accepted as a useful and effective tool in the field of human language. Many students and researchers have examined multimedia learning�s effectiveness from a number...

Download PDF file
  • EP ID EP277205
  • DOI 10.14738/tmlai.45.2223
  • Views 57
  • Downloads 0

How To Cite

Suresh Kannaiyan, Rajkumar Kannan, Gheorghita Ghinea (2016). Image Category Recognition using Bag of Visual Words Representation. Transactions on Machine Learning and Artificial Intelligence, 4(5), 1-9. https://europub.co.uk/articles/-A-277205