A Novel Algorithm for Scaling up the Accuracy of Decision Trees
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 2
Abstract
Classification is one of the most efficient and widely used data mining technique. In classification, Decision trees can handle high dimensional data, and their representation is intuitive and generally easy to assimilate by humans. The area under the receiver operating characteristic curve, AUC is one of the recently used measures for calculating the performance of a classifier.In this paper, we presented two novel decision tree algorithms namely C4.45 and C4.55, aimed to improve the AUC value over the C4.5, which is a state-of-the-art decision tree algorithm. The empirical experiments conducted on 42 benchmark datasets have strongly indicated that C4.45 and C4.55 has significantly outperformed C4.5 on the AUC value.
Authors and Affiliations
Ali Mirza Mahmood , K. Mrutunjaya Rao , Kiran Kumar Reddi
DIGITAL IMAGE PROCESSING TECHNIQUES FOR DETECTION AND REMOVAL OF NOISE IN IMAGES IMPLEMENTING BLIND SOURCE SEPARATION
In different fields, the problem of Blind Source Separation (BSS) is known as a collection of linear combinations of unknown sources and worse the coefficients of the linear combinations that are unknown. The main proble...
Study on Identification of Harmonic Contributions Between Utility and Customer
Harmonic contributions can be decreased in Power System transmission and Distribution by planning the voltage harmonic levels in the acceptable limits so that Harmonic pollution can be limited to some extent. In this res...
Locating ATMs in Urban Areas
In recent years, the banks and financial institutions have considerably attempted to provide better and more varied services to the customers. These varied services may satisfy needs of different group of customers. In t...
Handwritten Tamil Character Recognition and Conversion using Neural Network
Hand written Tamil Character recognition refers to the process of conversion of handwritten Tamil character into Unicode Tamil character. The scanned image is segmented into paragraphs using spatial space detection techn...
Approaches for Managing and Analyzing Unstructured Data
Large volumes of data that will be stored and accessed in future is unstructured. The unstructured data is generated in a very fast pace and uses large storage areas. This increases the storage budget. Extracting value f...