A Comparison Study on Performance Analysis of Data Mining Algorithms in Classification of Local Area News Dataset using WEKA Tool
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 2, Issue 10
Abstract
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), [1] a field at the intersection of computer science and statistics, is the process that attempts to discover patterns in large data sets. It utilizes methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. It is commonly used in marketing, surveillance, fraud detection, scientific discovery and now gaining wide way in social networking. Anything and everything on the Internet is fair game for extreme data mining practices. Social media covers all aspects of the social side of the internet that allow us to get contact and carve up information with others as well as intermingle with any number of people in any place in the world. This paper uses the dataset “Local News Survey” from Pew Research Center. The focus of the research is towards exploration on impact of the internet on Local News activities using Data Mining Techniques. The original dataset contains 102 attributes which is very large and hence the essential attributes required for the analysis are selected by feature reduction method. The selected attributes were applied to Data Mining Classification Algorithms such as RndTree, ID3, K-NN, C4.5 and CS-MC4. The Error rates of various classification Algorithms were compared to bring out the best and effective Algorithm suitable for this dataset.
Authors and Affiliations
G. Kesavaraj
A GREAT MATHEMATICAL TRUTH : SQUARE ROOT TWO IS AN INVISIBLE PART & PACEL OF CIRCLE (118th Geometrical construction on Real Pi)
Square root two was introduced by Pythagorean Hippasus of Metapontum representing the diagonal of the square. In March 1998, it was discovered that the same square root two, plays an important role in deciding the...
PERFORMANCE,COMPARISON AND IMPROVEMENT USING MIMO TECHNIQUES OF QAM - OFDM IN DIFFERENT WIRELESS CHANNELS
Orthogonal Frequency Division Multiplexing (OFDM) is predicted to be implemented in future broadcasting and Wireless Local Area Network (WLAN) systems due to its robustness in transmitting a high data rate. With the...
SUPPLIER SELECTION PROCESS IN SUPPLY CHAIN MANAGEMENT
Supplier’s Selection is one among the foremost essential activities of supply chain management. Supplier’s Selection could be an advanced activity involving qualitative and quantitative multi-criteria. A trade-off...
THE EFFECT OF SOIL MOISTURE CONTENT ON THE ENERGY REQUIREMENT AND FUEL CONSUMPTION OF THE MACHINERY UNIT
The field experiment was conducted in one of college of agricultural fields – university of Baghdad – Abu ghraib for 2016 in a silty clay loam soil study power and fuel consumption requirement s , two types of...
A MULTI-LEVEL APPROACH OF ELLIPTIC CURVE CRYPTOSYSTEM FOR ENHANCED SECURITY OF AMAZIGH ALPHABET USING CELLULAR AUTOMATA
Securing data is a challenging issue in today’s era. Encryption is one of the popular methods to achieve secret communication between sender and receiver. Many different encryption techniques have been proposed to...