Optimization of Naïve Bayes Data Mining Classification Algorithm

Abstract

As a probability-based statistical classification method, the Naïve Bayesian classifier has gained wide popularity; however, the performance of Naive Bayes classification algorithm suffers in the domains (data set) that involve correlated features. [Correlated features are the features which have a mutual relationship or connection with each other. As correlated features are related to each other, they are measuring the same feature only, means they are redundant features]. This paper is focused upon optimization of Naive Bayes classification algorithms to improve the accuracy of generated classification results with reduced time to build the model from training dataset. The aim is to improve the performance of Naive Bayes algorithms by removing the redundant correlated features before giving the dataset to classifier. This paper highlights and discusses the mathematical derivation of Naive Bayes classifier and theoretically proves how the redundant correlated features reduce the accuracy of the classification algorithm. Finally, from the experimental reviews using WEKA data mining software, this paper presents the impressive results with significant improvement into the accuracy and time taken to build the model by Naive Bayes classification algorithm.

Authors and Affiliations

Maneesh Singhal, Ramashankar Sharma

Keywords

Related Articles

slugAn Analytical Analysis of Stream Cipher and Block cipher Algorithms

Cryptography is an art or science to provid e security for sharing of information over the internet. Cryptography changes the format of original text into another format that is not easy to understand by unwanted user...

A Study of innovative ways to reward top performers in selective IT companies in India

The use of innovative reward methods can create a positive working environment. The objective of the study is to find out innovative ways – monetary or non monetary to reward top performers in IT industry. Initially and...

An Intelligent Driving Assistive System for Monitoring Driver’s Vigilance

Distracted driving is one of the main causes of vehicle collisions. Passively driver’s activities can be monitored with the help of automobile safety system which can potentially reduce the number of accidents by estima...

slugSurgically Altered Face Image Detection U sing Genetic Algorithm– A Comprehensive Study

In recent years, plastic surgery has become popular worldwide. People take facial plastic surgery to correct feature defects or improve attractiveness and confidence. It has been observed that many face...

Design of Improved Array Multiplier by Carry Select Logic

Multiplier is such an important element from the point of power consumption and speed of operation in the system. Multiplication using deletion scheme provides an efficient method for reducing the power and area as comp...

Download PDF file
  • EP ID EP18589
  • DOI -
  • Views 857
  • Downloads 25

How To Cite

Maneesh Singhal, Ramashankar Sharma (2014). Optimization of Naïve Bayes Data Mining Classification Algorithm. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 2(8), -. https://europub.co.uk/articles/-A-18589