Optimization of Naïve Bayes Data Mining Classification Algorithm

Abstract

As a probability-based statistical classification method, the Naïve Bayesian classifier has gained wide popularity; however, the performance of Naive Bayes classification algorithm suffers in the domains (data set) that involve correlated features. [Correlated features are the features which have a mutual relationship or connection with each other. As correlated features are related to each other, they are measuring the same feature only, means they are redundant features]. This paper is focused upon optimization of Naive Bayes classification algorithms to improve the accuracy of generated classification results with reduced time to build the model from training dataset. The aim is to improve the performance of Naive Bayes algorithms by removing the redundant correlated features before giving the dataset to classifier. This paper highlights and discusses the mathematical derivation of Naive Bayes classifier and theoretically proves how the redundant correlated features reduce the accuracy of the classification algorithm. Finally, from the experimental reviews using WEKA data mining software, this paper presents the impressive results with significant improvement into the accuracy and time taken to build the model by Naive Bayes classification algorithm.

Authors and Affiliations

Maneesh Singhal, Ramashankar Sharma

Keywords

Related Articles

Review of Plastic Waste Management by Pyrolysis Process with Indian perspective

The plastics have found its important role in the day-to-day life of human being and industries. The increasing demands and inefficient disposal methods have resulted in the accumulation of these wastes in the landfills...

Energy Management System in Buildings using Programmable Logic Controller

All Buildings have some form of electrical and mechanical services in order to afford a comfortable living environment for human beings. Energy Management System in Buildings (EMS) is to systematize the usage of electri...

Smart Shopping Cart Using RFID

In today’s technology, many companies are developing products that ensure convenience toward all people. One of the conveniences that involved will be providing with new and easy shopping experience. With a problem of w...

2D Platformer Shooting Game on Unity3D

This review paper describes the working of the shooting game developed using unity3d[1] engine. Unity3d is a game development platform that can be used to develop 2d as well as 3d games.

Energy-Aware Load Balancing and Application Scaling For the Cloud Ecosystem

To introduce an energy-aware operation model used for load balancing and application scaling on a cloud. The basic philosophy of our approach is defining an energy-optimal operation regime and attempting to maximize the...

Download PDF file
  • EP ID EP18589
  • DOI -
  • Views 858
  • Downloads 25

How To Cite

Maneesh Singhal, Ramashankar Sharma (2014). Optimization of Naïve Bayes Data Mining Classification Algorithm. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 2(8), -. https://europub.co.uk/articles/-A-18589