Optimization of Naïve Bayes Data Mining Classification Algorithm
Journal Title: International Journal for Research in Applied Science and Engineering Technology (IJRASET) - Year 2014, Vol 2, Issue 8
Abstract
As a probability-based statistical classification method, the Naïve Bayesian classifier has gained wide popularity; however, the performance of Naive Bayes classification algorithm suffers in the domains (data set) that involve correlated features. [Correlated features are the features which have a mutual relationship or connection with each other. As correlated features are related to each other, they are measuring the same feature only, means they are redundant features]. This paper is focused upon optimization of Naive Bayes classification algorithms to improve the accuracy of generated classification results with reduced time to build the model from training dataset. The aim is to improve the performance of Naive Bayes algorithms by removing the redundant correlated features before giving the dataset to classifier. This paper highlights and discusses the mathematical derivation of Naive Bayes classifier and theoretically proves how the redundant correlated features reduce the accuracy of the classification algorithm. Finally, from the experimental reviews using WEKA data mining software, this paper presents the impressive results with significant improvement into the accuracy and time taken to build the model by Naive Bayes classification algorithm.
Authors and Affiliations
Maneesh Singhal, Ramashankar Sharma
Review of Plastic Waste Management by Pyrolysis Process with Indian perspective
The plastics have found its important role in the day-to-day life of human being and industries. The increasing demands and inefficient disposal methods have resulted in the accumulation of these wastes in the landfills...
Energy Management System in Buildings using Programmable Logic Controller
All Buildings have some form of electrical and mechanical services in order to afford a comfortable living environment for human beings. Energy Management System in Buildings (EMS) is to systematize the usage of electri...
Smart Shopping Cart Using RFID
In today’s technology, many companies are developing products that ensure convenience toward all people. One of the conveniences that involved will be providing with new and easy shopping experience. With a problem of w...
2D Platformer Shooting Game on Unity3D
This review paper describes the working of the shooting game developed using unity3d[1] engine. Unity3d is a game development platform that can be used to develop 2d as well as 3d games.
Energy-Aware Load Balancing and Application Scaling For the Cloud Ecosystem
To introduce an energy-aware operation model used for load balancing and application scaling on a cloud. The basic philosophy of our approach is defining an energy-optimal operation regime and attempting to maximize the...