A Preview on Subspace Clustering of High Dimensional Data

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 6, Issue 3

Abstract

When clustering high dimensional data, traditional clustering methods are found to be lacking since they consider all of the dimensions of the dataset in discovering clusters whereas only some of the dimensions are relevant. This may give rise to subspaces within the dataset where clusters may be found. Using feature selection, we can remove irrelevant and redundant dimensions by analyzing the entire dataset. The problem of automatically identifying clusters that exist in multiple and maybe overlapping subspaces of high dimensional data, allowing better clustering of the data points, is known as Subspace Clustering. There are two major approaches to subspace clustering based on search strategy. Top-down algorithms find an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, iteratively improving the results. Bottom-up approaches start from finding low dimensional dense regions, and then use them to form clusters. Based on a survey on subspace clustering, we identify the challenges and issues involved with clustering gene expression data.

Authors and Affiliations

Sajid Nagi, Dhruba Kumar Bhattacharyya, Jugal K. Kalita

Keywords

Related Articles

ONTOLOGY BASED CLASSIFICATION AND CLUSTERING OF RESEARCH PROPOSALS AND EXTERNAL RESEARCH REVIEWERS

With the rapid development of research work in projects, research project selection is a necessary task for the research funding agencies. It is common to group the large number of research proposals, received by the res...

A New Similarity Measure for User-based Collaborative Filtering in Recommender Systems

Collaborative filtering is a popular approach in recommender Systems that helps users in identifying the items they may like in a wagon of items. Finding similarity among users with the available item ratings so as to pr...

Performance Analysis and FPGA Implementation of Digital PID Controller for Speed Control of DC Motor

This paper deals with the performance analysis and implementation of PID(Proportional-Integral-Derivative) Controller on FPGA platform.The hardware implementation has been done on Xilinx Spartan 3E FPGA board.The softwar...

New QoS-based Decision Making Approach for Heterogeneous Networks

Next generation wireless networks will provide seamless high bandwidth connectivity with high quality of service (QoS) support to mobile users, where a mobile will be able to connect to several wireless access networks s...

HEAP: Hybrid Energy-efficient Aggregation Protocol for Large Scale Wireless Sensor Networks

Wireless sensor networks (WSNs) can be meritoriously used in several application areas like agriculture, military surveillance, environmental monitoring, forest fire detection etc. Since they are used to monitor large ge...

Download PDF file
  • EP ID EP650083
  • DOI 10.24297/ijct.v6i3.4466
  • Views 104
  • Downloads 0

How To Cite

Sajid Nagi, Dhruba Kumar Bhattacharyya, Jugal K. Kalita (2013). A Preview on Subspace Clustering of High Dimensional Data. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 6(3), 441-448. https://europub.co.uk/articles/-A-650083