Implementation and Analysis of Clustering Algorithms in Data Mining
Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 6, Issue 1
Abstract
Data mining plays a very important role in information industry and in society due to the presence of huge amount of data. Organizations in the whole world are already aware about data mining. Data mining is the process which uses various kinds of data analysis tools to obtain patterns which also referred to as knowledge discovery from data. Clustering is called unsupervised learning algorithm as groups are not predefined but defined by the data. There are so many research areas in data mining. This paper is focusing on performance and evaluation of clustering algorithm: K-means, SOM and HAC. Evaluations of these three algorithms are purely based on the survey based analysis. These algorithms are analyzed by applying on the data set of banking which is a very high dimensional data. Performances of these algorithms are also compared with each other. Our results indicate that SOM technique is better than k-means and as good as or better than the hierarchical clustering technique. We have also generated one code in Orange Python which is the enhanced algorithm based on the hybrid approach of SOM, K-means and HAC.
Authors and Affiliations
Prabhjot Kaur, Robin Parkash Mathur
Intuitionistic fuzzy optimization: Usage of hesitation index
This paper presents the concept of usage of hesitation index in optimization problem under uncertainty. Our technique is an extension of idea of intuitionistic fuzzy optimization technique, proposed by Plamen P. Angelov...
An English -Arabic Real Time System (EARS)
Researchers, international traders, and politicians necessitate a common interactive language to deal and keep their work secure. Using a direct language translator enables customers to deal with others, each one uses hi...
DSS FOR IMPLEMENTING SYSTEMIC APPROACH TO FORECASTING
A computer based decision support system is proposed the basic tasks of which are modeling and forecasting of financial processes and credit risk estimation. The system is developed on the basis of system analysis princi...
Analyzing patients' EEG energy for brain death determination based on Dynamic 2T-EMD
EEG (electroencephalography) energy is an important evaluation indicator in brain death determination based on EEG analysis. In related works, the static EEG energy value can be discovered using EMD (empirical mode decom...
Enhanced Tree Based Real Time Intrusion Detection System in Big Data
Intrusion detection is one of the major necessities of the current networked environment, where every information is available in its corresponding digital form. This paper presents an enhanced tree based approach that c...