Automatic Database Clustering: Issues and Algorithms

Journal Title: INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY - Year 2014, Vol 10, Issue 4

Abstract

Clustering is the process of grouping of data, where the grouping is established by finding similarities between data based on their characteristics. Such groups are termed as Clusters. Clustering is an unsupervised learning problem that group objects based upon distance or similarity. While a lot of work has been published on clustering of data on storage medium, little has been done about automating this process. There should be an automatic and dynamic database clustering technique that will dynamically re-cluster a database with little intervention of a database administrator (DBA) and maintain an acceptable query response time at all times. A good physical clustering of data on disk is essential to reducing the number of disk I/Os in response to a query whether clustering is implemented by itself or coupled with indexing, parallelism, or buffering. In this paper we describe the issues faced when designing an automatic and dynamic database clustering technique for relational databases.. A comparative study of clustering algorithms across two different data items is performed here. The performance of the various clustering algorithms is compared based on the time taken to form the estimated clusters. The experimental results of various clustering algorithms to form clusters are depicted as a graph.

Authors and Affiliations

Sakshi Kumar , Mahesh Singh , Sunil Sharma

Keywords

Related Articles

A Privacy Preserving of Composite Private/Public Key in Cloud Servers

Security is a term used to provide secrecy of data from the illegal entries. It is used to prevent a user that he/she should not have access to. It is a two step process. The security system in the first step identifies...

Efficient Fault-Tolerant Strategy Selection Algorithm in Cloud Computing

Cloud computing is upcoming a mainstream feature of information technology. More progressively enterprises deploy their software systems in the cloud environment. The applications in cloud are usually large scale and con...

A Comparative study Between Fuzzy Clustering Algorithm and Hard Clustering Algorithm

Data clustering is an important area of data mining. This is an unsupervised study where data of similar types are put into one cluster while data of another types are put into different cluster. Fuzzy C means is a very...

Performance Estimation of 2*3 MIMO-MC-CDMA in Rayleigh Fading Channel

In this paper we analyze the performance of 2*3 MIMO-MC-CDMA system in MATLAB which greatly reduces BER by increasing the efficiency of system. MIMO and MC-CDMA system arrangement is used to decrease bit error rate and a...

Private Grid Environment for Personal Users

The overall aim of this paper is to introduce the Private grid environment (PvGrid) as a ubiquitous grid environment that is not only owned and utilized by personal users but also deployed over their own devices. In this...

Download PDF file
  • EP ID EP94232
  • DOI -
  • Views 97
  • Downloads 0

How To Cite

Sakshi Kumar, Mahesh Singh, Sunil Sharma (2014). Automatic Database Clustering: Issues and Algorithms. INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY, 10(4), 208-213. https://europub.co.uk/articles/-A-94232