slugTHE PROBLEM OF HIGH DIMENSIONALITY WITH LOW DENSITY IN CLUSTERING

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 2

Abstract

In many real-world applications, there are a number of dimensions having large variations in a dataset. The dimensions of the large variations scatter the cluster and confuse the distance between two samples in a dataset. This degrades the performances of many existing algorithms. This problem can be happened even when the number of dimensions of a dataset is small. Moreover, no existing method can distinguish whether the dataset has the highly repeated problem or low-density‟s problem. The only way to distinguish the problem is by a prior knowledge, which is given by the user. There are many methods to resolve this type of high dimensionality problem. The common way is to prune the non-significant features so that the features having large variations are removed and high-density cluster centers are obtained. Much research work has been carried out based on this criterion. The subspace clustering method is one of the well-known tools. The feature space is first partitioned into a number of equal length grids. Then, the density of each interval is measured. The features having low density are discarded and the clustering is conducted on the high density regions. Although these methods work very well on synthetic datasets, the pruned dimensions can carry useful information and hence, pruning them may increase the classification error rates.

Authors and Affiliations

Prof. T. Sudha and Swapna Sree Reddy. Obili

Keywords

Related Articles

slugIdentification of Paraphrasing in the context of Plagiarism

Paraphrasing is a very important form of processing for natural language processing (NLP). A characteristic property of natural language is that various expressions can exist to express a single concept. The aim of thi...

slugThe steady-state solution of multiple parallel channels in series and non-serial servers with balking & reneging due to long queue and some urgent message

This paper considers the most appropriate & more general queuing model in respect of customers which are allowed to leave the system at any stage with or without getting service. The paper considers the steady-state be...

Hostile Takeovers and Defensive Tactics: A Case study of Arcelor Mittal

Hostile Take-overs become the site of battlefields as it is witnessed in Arcelor Mittal takeover case. Five month long fierce takeover battle occurred between Arcelor and Mittal Steel which brought a lot of excitement...

Conceptual Framework or Information Security Policy

To protect overall security, it’s important to formulate a clear policy that states what rights the employee has and how the employee should responsibly handle company resources. That policy should be signed by each em...

slugCorporate Merger & Acquisition: A Strategic approach in Indian Banking Sector

It is an inherent desire and need of every business to grow vertically and horizontally. Organic growth, that is development from within, is often slow and sometimes difficult. Competition is fierce, and companies must...

Download PDF file
  • EP ID EP18186
  • DOI -
  • Views 314
  • Downloads 12

How To Cite

Prof. T. Sudha and Swapna Sree Reddy. Obili (2012). slugTHE PROBLEM OF HIGH DIMENSIONALITY WITH LOW DENSITY IN CLUSTERING. International Journal of Management, IT and Engineering, 2(2), -. https://europub.co.uk/articles/-A-18186