A Survey on Naïve Bayes Algorithm for Diabetes Data Set Problems

Abstract

Diabetes Mellitus is one of the growing vitally fatal diseases world-wide. A design of classifier for the detection of Diabetes Mellitus with optimal cost and precise performance is the need of the age. The current project implementation looks further to train self-organizing weka effectively classify a diabetic patient as such. weka are so chosen due to their dynamic nature of learning and future application of knowledge. The proposed method here uses a weka implementation of the Naïve Bayes algorithm for designing of classifier. Data mining is a process of extracting information from a dataset and transform it into understandable structure for further use, also it discovers patterns in large data sets. Data mining has number of important techniques such as preprocessing, classification. Classification is one such technique which is based on supervised learning. Diabetic is a life threatening disease prevalent in several developed as well as developing countries like India. The data classification is diabetic patients data set is developed by collecting data from hospital repository consists of 1865 instances with different attributes. The instances in the dataset are two categories of blood tests, urine tests. In this paper we discuss various algorithm approaches of data mining that have been utilized for diabetic disease prediction. Data mining is a well known technique used by health organizations for classification of diseases such as diabetes and cancer in bioinformatics research. In the proposed approach we have used WEKA with 10 cross validation to evaluate data and compare results. Weka has an extensive collection of different machine learning and data mining algorithms.

Authors and Affiliations

Nilesh Jagdish Vispute, Dinesh Kumar Sahu, Anil Rajput

Keywords

Related Articles

Study of Tail-Pipe Emission from Petrol Driven Passenger Cars

Pollution from vehicles is due to discharge like Carbon monoxide (CO), Carbon dioxide (CO2), Hydrocarbon (HC) and Oxides of Nitrogen (NOx) through their tailpipe. Cars, being in leading proportions (38.4%) in Indian tra...

slugCloud Computing

Cloud Computing is basically an internet-based network made up of large number of Servers – mostly based on open standards, modular and inexpensive. Clouds contain vast amount of information and provide a variety of ser...

Tensile Free Tripper’s Guide

People who are new to a city or a place find it difficult to search for a suitable mode of transport. Also they need someone’s help to find the route to particular place, name of the bus, and the stops in-between. This...

Commonly used pesticides in agriculture of Warangal District-Telangana and their consequences on the growth of Vesicular and Arbuscular Mycorrhizal fungi with reference to the DNA and Protein Content

In this paper we described the hazardous affects of the four common pesticides Captain, Bavestin, Endosulfan, Dicofol on VAM growth with reference to DNA ratio and protein content. Initially, we isolated VAM spores from...

Determination of Soil Compaction Levels by Agricultural Machinery in Cultivated Fields Using Dynamic Cone Penetrometer

The increasing soil degradation due to soil compaction may be linked to the increase in weight of agricultural machinery, in the more use of machinery even under unfavourable soil conditions and to poor crop rotation. T...

Download PDF file
  • EP ID EP21510
  • DOI -
  • Views 202
  • Downloads 3

How To Cite

Nilesh Jagdish Vispute, Dinesh Kumar Sahu, Anil Rajput (2015). A Survey on Naïve Bayes Algorithm for Diabetes Data Set Problems. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(12), -. https://europub.co.uk/articles/-A-21510