slugTHE PROBLEM OF HIGH DIMENSIONALITY WITH LOW DENSITY IN CLUSTERING

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 2

Abstract

In many real-world applications, there are a number of dimensions having large variations in a dataset. The dimensions of the large variations scatter the cluster and confuse the distance between two samples in a dataset. This degrades the performances of many existing algorithms. This problem can be happened even when the number of dimensions of a dataset is small. Moreover, no existing method can distinguish whether the dataset has the highly repeated problem or low-density‟s problem. The only way to distinguish the problem is by a prior knowledge, which is given by the user. There are many methods to resolve this type of high dimensionality problem. The common way is to prune the non-significant features so that the features having large variations are removed and high-density cluster centers are obtained. Much research work has been carried out based on this criterion. The subspace clustering method is one of the well-known tools. The feature space is first partitioned into a number of equal length grids. Then, the density of each interval is measured. The features having low density are discarded and the clustering is conducted on the high density regions. Although these methods work very well on synthetic datasets, the pruned dimensions can carry useful information and hence, pruning them may increase the classification error rates.

Authors and Affiliations

Prof. T. Sudha and Swapna Sree Reddy. Obili

Keywords

Related Articles

slugMITIGATION OF CHROMATIC DISPERSION BY ALL PASS FILTER

Dispersion in a single mode fiber is the bottleneck of long haul optical communication systems, which limits the bit rate and repeater-less distance. Chromatic dispersion (CD) of a single mode fiber (SMF) is an importa...

A STUDY ON CUSTOMER ATTITUDE TOWARDS E-BANKING SERVICES OF PRIVATE SECTOR BANKS IN KRISHNAGIRI DISTRICT

E-Banking service is the automated delivery of new and traditional banking products and services directly to customers through electronic, interactive communication channels includes the systems that enable financial i...

slugAPPLICATION AND IMPLEMENTATION OF CRM IN HOTELS OF DEVELOPING CITIES - A CASE STUDY OF RANCHI

Hotel sells room to the guest. It is the main product that Hotel sells and with the sale of this product, other hotel products like food, beverage, laundry services etc. also get sold. Earlier when the numbers of hotel...

Impact of Visual Merchandising on Consumers’ Buying ChoiCe with referenCe to Reliance Fresh

The importance of visual merchandising cannot be ignored in this era where many purchase decisions are influenced by displays and presentations in store. The main objective of this paper is to study the influence of vi...

An ideational model of Organisational culture: With Values of the Leader as the foundation

The researchers were explored the concept to understand how it affects the organization. They have developed many models to explain it to others. One fact all the researchers have found is that values followed in the o...

Download PDF file
  • EP ID EP18186
  • DOI -
  • Views 296
  • Downloads 12

How To Cite

Prof. T. Sudha and Swapna Sree Reddy. Obili (2012). slugTHE PROBLEM OF HIGH DIMENSIONALITY WITH LOW DENSITY IN CLUSTERING. International Journal of Management, IT and Engineering, 2(2), -. https://europub.co.uk/articles/-A-18186