Optimization of Naïve Bayes Data Mining Classification Algorithm

Abstract

As a probability-based statistical classification method, the Naïve Bayesian classifier has gained wide popularity; however, the performance of Naive Bayes classification algorithm suffers in the domains (data set) that involve correlated features. [Correlated features are the features which have a mutual relationship or connection with each other. As correlated features are related to each other, they are measuring the same feature only, means they are redundant features]. This paper is focused upon optimization of Naive Bayes classification algorithms to improve the accuracy of generated classification results with reduced time to build the model from training dataset. The aim is to improve the performance of Naive Bayes algorithms by removing the redundant correlated features before giving the dataset to classifier. This paper highlights and discusses the mathematical derivation of Naive Bayes classifier and theoretically proves how the redundant correlated features reduce the accuracy of the classification algorithm. Finally, from the experimental reviews using WEKA data mining software, this paper presents the impressive results with significant improvement into the accuracy and time taken to build the model by Naive Bayes classification algorithm.

Authors and Affiliations

Maneesh Singhal, Ramashankar Sharma

Keywords

Related Articles

A Cross Comparative Analysis of Sustainable Alternatives for Traditional Roofing in Kerala

Traditional architecture is not an architectural style. It is an attitude towards the culture of a society. Traditional architecture of Kerala is evolved overtime to meet the needs of the people (inhabitants) who live i...

Anaerobic digestion of Municipal Solid biodegradable wastes for methane production:A Review

The untreated and undisposed municipal solid waste generated through different sources is a major concern of the world now-a-days. There are millions of tonnes of municipal solid waste produced every year and the amount...

Design of FPGA Controlled Closed Loop BiDirectional DC-DC (BDC) Converter for Renewable Energy Storage Applications

This paper describes the development of Bi-Directional DC-DC converter (BDC) for solar application. BDC can be used either as a Buck converter or either as a Boost converter. Designed converter in this paper is controll...

Power Quality Perfection in A Nonlinear Loaded Electrical System by Use of a Three Phase Active Power Filter

Consumer electronics, home appliances and a great assortment of developed applications, namely power electronics based, can cause high disorder in the abounding electricity. In this paper, the power quality distinctiven...

A Review: Forgery Image Detection in Forensics

The proposed system investigate changed spaces, spoke to by picture illuminant maps to propose a techniques for selecting complementary forms of characterizing visual properties for an effective and automated detection...

Download PDF file
  • EP ID EP18589
  • DOI -
  • Views 956
  • Downloads 25

How To Cite

Maneesh Singhal, Ramashankar Sharma (2014). Optimization of Naïve Bayes Data Mining Classification Algorithm. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 2(8), -. https://europub.co.uk/articles/-A-18589