A Preview on Subspace Clustering of High Dimensional Data

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 6, Issue 3

Abstract

When clustering high dimensional data, traditional clustering methods are found to be lacking since they consider all of the dimensions of the dataset in discovering clusters whereas only some of the dimensions are relevant. This may give rise to subspaces within the dataset where clusters may be found. Using feature selection, we can remove irrelevant and redundant dimensions by analyzing the entire dataset. The problem of automatically identifying clusters that exist in multiple and maybe overlapping subspaces of high dimensional data, allowing better clustering of the data points, is known as Subspace Clustering. There are two major approaches to subspace clustering based on search strategy. Top-down algorithms find an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, iteratively improving the results. Bottom-up approaches start from finding low dimensional dense regions, and then use them to form clusters. Based on a survey on subspace clustering, we identify the challenges and issues involved with clustering gene expression data.

Authors and Affiliations

Sajid Nagi, Dhruba Kumar Bhattacharyya, Jugal K. Kalita

Keywords

Related Articles

Demand Forecasting and Demand Supply Management of Vegetables in India: A Review and Prospect

Vegetable quantity arrival to market varies every day with which its prices also changes rapidly. This paper analyses the factors that affect the rapid change in prices of vegetables such as demand forecasting, demand su...

Efficient Data Forwarding Mechanism in Backbone Networks by Employing MPLS Technology

Multiprotocol Label Switching (MPLS) is a relatively new WAN technology that is attracting the networking professionals around the globe. Many ISPs have already deployed it in their network. Yet, some other ISPs are in t...

Intergenerational relations: the use of technology as a mediator

The social dynamics of today provide greater proximity between people belonging to different generations, diluting the differences among the large volume of innovations, whether technological or cultural that are produce...

Steganography: Securing Message in wireless network

Steganography is the process of hiding a secret message with in a cover medium. However eavesdropper may guess the embedding algorithm like least significant bit (LSB) replacement of Chan et al, 2004; Wang et al, 2001; W...

AN ENHANCED CLUSTERING APPROACH FOR ENERGY EFFICIENT ROUTING IN WIRELESS SENSOR NETWORKS

Energy consumption is the core issue in wireless sensor networks (WSN). To generate a node energy model that can accurately reveal the energy consumption of sensor nodes is an extremely important part of protocol develop...

Download PDF file
  • EP ID EP650083
  • DOI 10.24297/ijct.v6i3.4466
  • Views 95
  • Downloads 0

How To Cite

Sajid Nagi, Dhruba Kumar Bhattacharyya, Jugal K. Kalita (2013). A Preview on Subspace Clustering of High Dimensional Data. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 6(3), 441-448. https://europub.co.uk/articles/-A-650083