Efficient Disease Classifier Using Data Mining Techniques: Refinement of Random Forest Termination Criteria

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 14, Issue 5

Abstract

 In biomedical field, the classification of disease using data mining is the critical task. The prediction accuracy plays a vital role in disease data set. More data mining classification algorithms like decision trees, neural networks, Bayesian classifiers are used to diagnosis the diseases. In decision tree Random Forest, Initially a forest is constructed from ten tress. The accuracy is measured and compared with desired accuracy. If the selected best split of trees matched the desired accuracy the construction terminates. Otherwise a new tree is added with random forest and accuracy is measured. The fitting criteria of random forest are accuracy and correlation. The accuracy is based on the mean absolute percentage error (MAPE) and the mean absolute relative error (MARE).In proposed system to refine the termination criteria of Random Forest, Binomial distribution, multinomial distribution and sequential probability ratio test (SPRT) are used. The proposed method stops the random forest earlier compared with existing Random Forest algorithm. The supervised learning model like support vector machine takes a set of inputs and analyze the inputs and recognize the desired patterns. The disease data sets are supplied to SVM and prediction accuracy is measured. The comparison is made between Random Forest and SVM and best class labels are identified based on disease.

Authors and Affiliations

K. Kalaiselvi

Keywords

Related Articles

 A Genetic Algorithm For Scheduling JobsWith Burst Time And Priorities

Abstract: Scheduling play extremely important role in our day-to-day life, same as the performance of system is highly affected by the CPU scheduling. For the better scheduling the performance is depend upon the paramete...

 Token Sequencing Approach to Prevent SQL Injection Attacks

 : Internet, the network of networks represents an insecure channel for exchanging information leading to a high risk of intrusion or fraud. Many web applications remain under the attack of hackers who intentional...

 Semantic Based Amalgamation of Pulverized Components Using Ontology

 Abstract: The problem faced by both software developers and user in development environment and user environment is finding compatible and complementary components in large set of different applications. Different...

 An Efficient Secure Anonymous Communication Protocol in MANET based on Destinations Location

 Abstract: The protocols and cryptographic techniques used in MANET are intended to provide complete security to the data transmitted with low cost. In hostile environments, as a part of providing security to data;...

 Wavelet Based Features for Defect Detection in Fabric using Genetic Algorithm

 Abstract: In this paper a new scheme is proposed for Fabric defect detection in textile industry. For this purpose, wavelet transformer is used as feature extractor of coefficients of fabric. These coefficients c...

Download PDF file
  • EP ID EP93784
  • DOI -
  • Views 115
  • Downloads 0

How To Cite

K. Kalaiselvi (2013).  Efficient Disease Classifier Using Data Mining Techniques: Refinement of Random Forest Termination Criteria. IOSR Journals (IOSR Journal of Computer Engineering), 14(5), 104-111. https://europub.co.uk/articles/-A-93784