A Comparison Study on Performance Analysis of Data Mining Algorithms in Classification of Local Area News Dataset using WEKA Tool

Abstract

Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), [1] a field at the intersection of computer science and statistics, is the process that attempts to discover patterns in large data sets. It utilizes methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. It is commonly used in marketing, surveillance, fraud detection, scientific discovery and now gaining wide way in social networking. Anything and everything on the Internet is fair game for extreme data mining practices. Social media covers all aspects of the social side of the internet that allow us to get contact and carve up information with others as well as intermingle with any number of people in any place in the world. This paper uses the dataset “Local News Survey” from Pew Research Center. The focus of the research is towards exploration on impact of the internet on Local News activities using Data Mining Techniques. The original dataset contains 102 attributes which is very large and hence the essential attributes required for the analysis are selected by feature reduction method. The selected attributes were applied to Data Mining Classification Algorithms such as RndTree, ID3, K-NN, C4.5 and CS-MC4. The Error rates of various classification Algorithms were compared to bring out the best and effective Algorithm suitable for this dataset.

Authors and Affiliations

G. Kesavaraj

Keywords

Related Articles

 INDENTATION OF SANDWICHES USING A LAYERWISE MODEL WITH FIXED DEGREES OF FREEDOM

 A zig-zag plate model with variable kinematics and fixed degrees of freedom recently developed by the authors is applied to study indentation of sandwiches with honeycomb/foam core, with the aim of reducing the co...

 IMPLEMENTATION AND DESIGN THREE SOFTWARE USING REUSABLE SOFTWARE CONCEPT “ANALYTICAL STUDY”

 There are two ways for the principle of re-use software and the first way is the indirect method indirect boils down to the use of one or more pieces of software in the production and creation of new programs witho...

 Re-Refining Recovery Methods of Used Lubricating Oil

 Used lubricating oil (ULO) is any petroleum based or synthetic oil that has been used and during operation oil losses effectiveness due to the presence of certain contaminants from air, fuel combustion, oxidation...

Web Crawlers and Search Engines

hypertext links. As the size of the system increases the users must traverse increasingly more links to find what they are looking for, until precise navigation becomes impractical. The WebCrawler is a tool that solves...

 PERFORMANCE ANALYSIS OF DOUBLE SPRING MASS DAMPER SYSTEM FOR VEHICLE SUSPENSION

 The main objective of this paper based on suspension system is to obtain a result and analysis on new model of a suspension system. This model is designed with a normal spring and damper, where it contains two spri...

Download PDF file
  • EP ID EP158949
  • DOI -
  • Views 69
  • Downloads 0

How To Cite

G. Kesavaraj (30). A Comparison Study on Performance Analysis of Data Mining Algorithms in Classification of Local Area News Dataset using WEKA Tool. International Journal of Engineering Sciences & Research Technology, 2(10), 2748-2755. https://europub.co.uk/articles/-A-158949