A Survey on Naïve Bayes Algorithm for Diabetes Data Set Problems

Abstract

Diabetes Mellitus is one of the growing vitally fatal diseases world-wide. A design of classifier for the detection of Diabetes Mellitus with optimal cost and precise performance is the need of the age. The current project implementation looks further to train self-organizing weka effectively classify a diabetic patient as such. weka are so chosen due to their dynamic nature of learning and future application of knowledge. The proposed method here uses a weka implementation of the Naïve Bayes algorithm for designing of classifier. Data mining is a process of extracting information from a dataset and transform it into understandable structure for further use, also it discovers patterns in large data sets. Data mining has number of important techniques such as preprocessing, classification. Classification is one such technique which is based on supervised learning. Diabetic is a life threatening disease prevalent in several developed as well as developing countries like India. The data classification is diabetic patients data set is developed by collecting data from hospital repository consists of 1865 instances with different attributes. The instances in the dataset are two categories of blood tests, urine tests. In this paper we discuss various algorithm approaches of data mining that have been utilized for diabetic disease prediction. Data mining is a well known technique used by health organizations for classification of diseases such as diabetes and cancer in bioinformatics research. In the proposed approach we have used WEKA with 10 cross validation to evaluate data and compare results. Weka has an extensive collection of different machine learning and data mining algorithms.

Authors and Affiliations

Nilesh Jagdish Vispute, Dinesh Kumar Sahu, Anil Rajput

Keywords

Related Articles

 - Splitting Graphs

Let G(V,E) be a graph. A dominating set is a subset S of V such that every vertex not in S is adjacent to at least one vertex in S. The cardinality of a minimum dominating set is called the domination number, (G). A do...

Combined Beamforming with Space Time Frequency Coding for Mimo-OFDM Systems

Multiple antennas can be used in wireless systems to attain good quality of service and high data rate communication. Recently, efficient space-time-frequency (STF) codes have been developed to improve the diversity gai...

Surveying Techniques and Development of Authentication Key in Smart-Devices

In today's digital era of technology secure authentication is the basic need of every smart system. As the information security is the prime need of today’s systems so the authentication is the basic step of the process...

A Cost Optimization Study of Multi Story Building under Different Indian Zones

Even after designing structures for different level of seismic forces, those may or may not perform to the level that they were designed for. It is very important to have practice of earthquake-resistant design, in orde...

Sign Language Recognition for Deaf Sign User

Sign language recognition is one of the most growing fields of research today and it is the most natural way of communication for the people with hearing problems. A hand gesture recognition system can provide an opport...

Download PDF file
  • EP ID EP21510
  • DOI -
  • Views 248
  • Downloads 3

How To Cite

Nilesh Jagdish Vispute, Dinesh Kumar Sahu, Anil Rajput (2015). A Survey on Naïve Bayes Algorithm for Diabetes Data Set Problems. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(12), -. https://europub.co.uk/articles/-A-21510