Dimensionality Reduction for Handwritten Digit Recognition

Journal Title: EAI Endorsed Transactions on Cloud Systems - Year 2018, Vol 4, Issue 13

Abstract

Human perception of dimensions is usually limited to two or three degrees. Any further increase in the number of dimensions usually leads to the difficulty in visual imagination for any person. Hence, machine learning researchers often commonly have to overcome the curse of dimensionality in high dimensional feature sets with dimensionality reduction techniques. In this proposed model, two handwritten digit datasets are used: CVL Single Digit and MNIST, and two popular feature descriptors, Histogram of Oriented Gradients (HOG) and Gabor filters, are used to generate the feature sets. Investigations are carried out on linear and nonlinear transformations of the feature sets using multiple dimensionality reduction techniques such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and Isomap. The lower dimension vectors obtained, are then used to classify the numeric digits using Support Vector Machine (SVM). A conclusion arrived is that using HOG as the feature descriptor and PCA as the dimensionality reduction technique resulted in the experimental model achieving the highest accuracy of 99.29% on the MNIST dataset with the time efficiency comparable to that of a convolutional neural network (CNN). Further, it is concluded that even though the LDA model with HOG as the feature descriptor achieved a lesser accuracy of 98.34%, but it was able to capture maximum information in just 9 components in its lower dimensional subspace with 75% reduction in time efficiency of that of the PCA-HOG model and the CNN model.

Authors and Affiliations

Ankita Das, Tuhin Kundu, Chandran Saravanan

Keywords

Related Articles

Defining an Elasticity Metric for Cloud Computing Environments

Elasticity is a key property of cloud computing environments and one of the features which distinguishes this paradigm from other ones. An elasticity metric could be used to define and to monitor Service Level Agreements...

A Counterfeit Solution for Pharma Supply Chain

This paper provides a detailed overview of the blockchain technology and how it can be used to build a foolproof system in eliminating counterfeit products in pharmaceutical industries. Study by various reports indicate...

Large Scale Cross-media Data Retrieval based on Hadoop

With the rapid development of the Internet and speedy increase of the data size, there are more and more data intensive applications which often involve hundreds of megabytes of data. It is important and necessary to obt...

Indoor Positioning using cellular network and relay node for wearables

One of the key requirements in public safety domain is to know the exact location of user or wearable device. There are a plethora of Internet of Things based wearable devices which are getting used for geo-fencing and f...

A Dynamic Self-adaptive Resource-Load Evaluation Method in Cloud Computing

Cloud resource and its load have dynamic characteristics. To address this challenge, a dynamic self-adaptive evaluation method (termed SDWM) is proposed in this paper. SDWM uses some dynamic evaluation indicators to eval...

Download PDF file
  • EP ID EP45609
  • DOI http://dx.doi.org/10.4108/eai.12-2-2019.156590
  • Views 249
  • Downloads 0

How To Cite

Ankita Das, Tuhin Kundu, Chandran Saravanan (2018). Dimensionality Reduction for Handwritten Digit Recognition. EAI Endorsed Transactions on Cloud Systems, 4(13), -. https://europub.co.uk/articles/-A-45609