Experimental Study: Comparison of clustering algorithms

Abstract

One of the most important processes in the machine learning is the clustering. The clustering is an unsupervised process that gathers all similar measurements to identify and put them in groups based on specific measurements. Clustering task is required in many applications such as, text analysis, data visualization, nature language processing, image processing, computer vision, and even gene expression analysis. This work tends to make a comparison study to analyze the performance of different clustering algorithms using different datasets. We conduct some experimental results to evaluate the effectiveness of six clustering algorithms: hard K mean, fuzzy K mean, Locality weighted of hard K mean, Locality weighted of fuzzy K mean, Hierarchical , and DBSCAN algorithms. We use synaptic and real dataset in our experiments. We synthesize three different datasets to analyze the performance: imbalanced classes dataset, an outlier dataset, and moon dataset. Additionally, we perform image segmentation and compression using these clustering algorithms. Finally, we test the performance of the algorithms by performing facial expression clustering, which is one of the most challenging problem in the computer vision.

Authors and Affiliations

Mohammed Dawod, Mays Hasan, Amar Daood

Keywords

Related Articles

Efficient production of negative hydrogen ions in RF plasma by using a self-biased grid electrode

Volume production of negative hydrogen ions is established efficiently in a pure hydrogen RF discharge plasma by using a self-biased grid electrode for production of low electron-temperature and high density plasma. Usin...

Handwriting Identification By Using Neuro Fuzzy Methods Based On Features Extraction

Handwriting recognition system is a system to recognize one's writing through paper. This technology identifies a unique and fixed piece of writing like human handwriting. The character pattern recognition on human handw...

Experimental Investigation For The Supply Pressure Decay By Using Different Hydraulic Control Valves

This paper reports an experimental study of the supply pressure decay within many different hydraulic system designs. The experimental pressures are compared to a supply pressure with changing the number of valves differ...

INVENTORY CONDITIONS OF SWAMP IRRIGATION RANTAU RASAUOFTANJUNG JABUNG TIMURREGENC

The agricultural sector is the main source of food supply for 245 million Indonesians, it's supply about 87% raw materials for small and medium industries, and contributes 15% of GDP with foreign exchange value of around...

Maintenance 4.0 To Fulfil The Demands Of Industry 4.0 And Factory Of The Future

In today’s high market competition, industries attempt adapt new technologies retain their market share. With technology advancement in factories, maintenance methods are developed to suit the new manufacturers’ demands....

Download PDF file
  • EP ID EP392042
  • DOI 10.9790/9622-0708042334.
  • Views 118
  • Downloads 0

How To Cite

Mohammed Dawod, Mays Hasan, Amar Daood (2017). Experimental Study: Comparison of clustering algorithms. International Journal of engineering Research and Applications, 7(8), 23-34. https://europub.co.uk/articles/-A-392042