Optimization of Naïve Bayes Data Mining Classification Algorithm

Abstract

As a probability-based statistical classification method, the Naïve Bayesian classifier has gained wide popularity; however, the performance of Naive Bayes classification algorithm suffers in the domains (data set) that involve correlated features. [Correlated features are the features which have a mutual relationship or connection with each other. As correlated features are related to each other, they are measuring the same feature only, means they are redundant features]. This paper is focused upon optimization of Naive Bayes classification algorithms to improve the accuracy of generated classification results with reduced time to build the model from training dataset. The aim is to improve the performance of Naive Bayes algorithms by removing the redundant correlated features before giving the dataset to classifier. This paper highlights and discusses the mathematical derivation of Naive Bayes classifier and theoretically proves how the redundant correlated features reduce the accuracy of the classification algorithm. Finally, from the experimental reviews using WEKA data mining software, this paper presents the impressive results with significant improvement into the accuracy and time taken to build the model by Naive Bayes classification algorithm.

Authors and Affiliations

Maneesh Singhal, Ramashankar Sharma

Keywords

Related Articles

Biodiesel Production By Alkaline Transesterification Of Mamey Sapote Oil

Bio- Diesel can also prepared from fruit seed oil to Diesel Engines .Mamey Sapote seeds are easily available and nonedible to all human being. Since most of the biodiesel were derived from edible oils like soybean, sunfl...

Serum Vitamin D in Chronic Periodontitis

Background Vitamin D is a lipid soluble vitamin also called as sunshine vitamin as it is synthesized in skin by exposure to ultraviolet rays. It is mainly required for bone growth, calcium metabolism, cellular growth an...

Improving Trust on Recommendation models using the PCA Recommend based Iterative Analysis against the User trust and Item Rating

The recommendation modelling is challenging issue in the research of recommendation model by integrating the information source with sparsity and high dimensional structure against cold start and curse of dimensionality...

A Review on Progressive Collapse Analysis

Progressive collapse of building is initiated when one or more vertical load carrying members particularly columns are seriously damaged or collapsed during any of the abnormal event. Once a column is failed the buildin...

Survey Paper on Circular Aperture Slot Antenna with Defected Ground Structure for Broad Band

This paper introduces the survey on the circular Aperture Sot Antenna with defected ground structure. A novel system composed of a circular aperture slot antenna and a Common-Mode (CM) noise rejection filter is presente...

Download PDF file
  • EP ID EP18589
  • DOI -
  • Views 942
  • Downloads 25

How To Cite

Maneesh Singhal, Ramashankar Sharma (2014). Optimization of Naïve Bayes Data Mining Classification Algorithm. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 2(8), -. https://europub.co.uk/articles/-A-18589