Efficient Disease Classifier Using Data Mining Techniques: Refinement of Random Forest Termination Criteria
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 14, Issue 5
Abstract
In biomedical field, the classification of disease using data mining is the critical task. The prediction accuracy plays a vital role in disease data set. More data mining classification algorithms like decision trees, neural networks, Bayesian classifiers are used to diagnosis the diseases. In decision tree Random Forest, Initially a forest is constructed from ten tress. The accuracy is measured and compared with desired accuracy. If the selected best split of trees matched the desired accuracy the construction terminates. Otherwise a new tree is added with random forest and accuracy is measured. The fitting criteria of random forest are accuracy and correlation. The accuracy is based on the mean absolute percentage error (MAPE) and the mean absolute relative error (MARE).In proposed system to refine the termination criteria of Random Forest, Binomial distribution, multinomial distribution and sequential probability ratio test (SPRT) are used. The proposed method stops the random forest earlier compared with existing Random Forest algorithm. The supervised learning model like support vector machine takes a set of inputs and analyze the inputs and recognize the desired patterns. The disease data sets are supplied to SVM and prediction accuracy is measured. The comparison is made between Random Forest and SVM and best class labels are identified based on disease.
Authors and Affiliations
K. Kalaiselvi
Comparison and Enhancement of Digital Image by Using Canny Filter and Sobel Filter
In this research paper we have defining two different edge detection methods i.e canny edge detection and Sobel edge detection and we are also discussing some image quality parameters like PSNR, SNR, MSE, RMSE,...
Dissemination of Link State Information for Enhancing Security in Mobile Ad Hoc Networks
A mobile adhoc network is a Self-configuring network of mobile routers connected by wireless links. In the mobile adhoc network, each and every device moves independently in any direction so that there are frequent...
Energy Efficient and Secure, Trusted network discovery for Wireless Sensor Networks
While routing Wireless Sensor nodes in the Multi-hop network ,nodes may undergo some attacks such as sink hole attack, worm hole attack, Sybil attack etc., by the attackers through identity deception. So, to &n...
Security Suite for IT and Telecom Industries
Abstract: Security is the biggest concern nowadays faced by various companies as security threats are more prevalent. This „openness‟ is the reason because of which protecting networks and business data is even more diff...
Anonymizied Approach to Preserve Privacy of Published Data Through Record Elimination
Abstract: Data mining is the process of analyzing data. Data Privacy is collection of data and dissemination of data. Privacy issues arise in different area such as health care, intellectual property, biological da...