Implementation and Analysis of Clustering Algorithms in Data Mining
Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 6, Issue 1
Abstract
Data mining plays a very important role in information industry and in society due to the presence of huge amount of data. Organizations in the whole world are already aware about data mining. Data mining is the process which uses various kinds of data analysis tools to obtain patterns which also referred to as knowledge discovery from data. Clustering is called unsupervised learning algorithm as groups are not predefined but defined by the data. There are so many research areas in data mining. This paper is focusing on performance and evaluation of clustering algorithm: K-means, SOM and HAC. Evaluations of these three algorithms are purely based on the survey based analysis. These algorithms are analyzed by applying on the data set of banking which is a very high dimensional data. Performances of these algorithms are also compared with each other. Our results indicate that SOM technique is better than k-means and as good as or better than the hierarchical clustering technique. We have also generated one code in Orange Python which is the enhanced algorithm based on the hybrid approach of SOM, K-means and HAC.
Authors and Affiliations
Prabhjot Kaur, Robin Parkash Mathur
The New Image Encryption and Decryption Using Quasi Group
Multimedia Communication is the new age of communication. Image Communication is one of the most popular types of multimedia communication. This type of communication always faces security challenges. Security breach is...
IMPLEMENTATION OF MOBILE VIRTUAL LABORATORY: CONTRIBUTORY FACTORS IN A DEVELOPING COUNTRY
The growing availability of mobile devices across developing countries and coupled with increase awareness of mobile learning as well as the use of mobile devices for laboratory practical warrant the exploration of its w...
Assessing the Critical Factors for E-Learning Systems Using Fuzzy TOPSIS and Fuzzy Logic
Assessing the success of Information Systems (ISs) has been identified as one of the most critical issues in IS field. Offering more services and the ease of access is considered as a significant factor for today’s aca...
Topology Controlled Energy Proficient Protocol for Wireless Sensor Networks
Random deployment in Wireless sensor networks lead to spatial node redundancy in close knit sensor networks. In this paper, an improved energy proficient PEGASIS based protocol (PEGASIS-TC) has been proposed. PEGASIS-TC...
Simulating Efficient power Wireless Sensor Network over Smart University Campus
Attendance is one of the important factors that determine the students activity in any educational organizations. Taking attendance manually is considered as a huge task, even if, it was done using traditional methods su...