Dimensionality Reduction for Handwritten Digit Recognition
Journal Title: EAI Endorsed Transactions on Cloud Systems - Year 2018, Vol 4, Issue 13
Abstract
Human perception of dimensions is usually limited to two or three degrees. Any further increase in the number of dimensions usually leads to the difficulty in visual imagination for any person. Hence, machine learning researchers often commonly have to overcome the curse of dimensionality in high dimensional feature sets with dimensionality reduction techniques. In this proposed model, two handwritten digit datasets are used: CVL Single Digit and MNIST, and two popular feature descriptors, Histogram of Oriented Gradients (HOG) and Gabor filters, are used to generate the feature sets. Investigations are carried out on linear and nonlinear transformations of the feature sets using multiple dimensionality reduction techniques such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and Isomap. The lower dimension vectors obtained, are then used to classify the numeric digits using Support Vector Machine (SVM). A conclusion arrived is that using HOG as the feature descriptor and PCA as the dimensionality reduction technique resulted in the experimental model achieving the highest accuracy of 99.29% on the MNIST dataset with the time efficiency comparable to that of a convolutional neural network (CNN). Further, it is concluded that even though the LDA model with HOG as the feature descriptor achieved a lesser accuracy of 98.34%, but it was able to capture maximum information in just 9 components in its lower dimensional subspace with 75% reduction in time efficiency of that of the PCA-HOG model and the CNN model.
Authors and Affiliations
Ankita Das, Tuhin Kundu, Chandran Saravanan
Towards Automated Data-Driven Model Creation for Cloud Computing Simulation
The increasing complexity and scale of cloud computing environments due to widespread data centre heterogeneity makes measurement-based evaluations highly difficult to achieve. Therefore the use of simulation tools to su...
Defining an Elasticity Metric for Cloud Computing Environments
Elasticity is a key property of cloud computing environments and one of the features which distinguishes this paradigm from other ones. An elasticity metric could be used to define and to monitor Service Level Agreements...
Overview - Fog Computing and Internet-of-Things (IOT)
The Internet today is getting connected to a very large number of devices or sensors of IOT. It is expected that 50 billion devices will be connected to the Internet by 2020..The IOT driven global economy will have many...
QoE Aware Resource Allocation for Video Communications over LTE Based Mobile Networks
As the limits of video compression and usable wireless radio resources are exhausted, providing increased protection to critical data is regarded as a way forward to increase the effective capacity for delivering video d...
Savant: A Framework for Supporting Content Accountability in Information Centric Networks
The Information Centric Networking (ICN) paradigm offers solutions to some of the functional and performance limitations of the current Internet architecture by offering secure, efficient and scalable mechanisms for the...