Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5

Abstract

 Abstract: Datasets in the horizontal aggregated layout are preferred by most of data mining algorithms, machine learning algorithm. Major efforts are required to compute data in the horizontal aggregated format. There are many inbuilt aggregation functions in SQL, namely, minimum, maximum, average, sum and count. These aggregation functions are used with a query evaluation method to retrieve data in the horizontal aggregation format. Optimization techniques used for vertical aggregation is not appropriate for horizontal aggregation. Standard aggregations are hard to interpret when there are many result rows, especially when grouping attributes having high cardinalities. That is why we proposed C4.5 classification algorithm and K-means clustering algorithm with query evaluation method and aggregation function for optimizing horizontal aggregation. Horizontal aggregation is a method which generates SQL code to return aggregated columns in the horizontal tabular layout. It returns a set of numbers instead of one number per row. There are various applications where the horizontal aggregation is used such as electrical billing, banks, hospital management system, pharmacy, and online library etc. [6].

Authors and Affiliations

Ms. Priti Phalak , Dr. Rekha Sharma

Keywords

Related Articles

Level of Interleukin 8 and Alpha -1 Antitrypsin in CerebroSpinal Fuid of Iraqi Patient with Bacterial and Viral Meniningitis

Objective: Meningitis refers to inflammation of the leptomeninges, the connective tissue layers in closest proximity to the surface of the brain. Meningitis can be caused by bacteria, viruses, parasites, and fungi, as we...

Secure Group Key Management using Ciphertext in MANETs

Abstract: The modern developments in Mobile AdhocNETworks (MANETs) suffer from efficient and secure broadcasting of messages to a distant system. The main obstacles for the secured information transfer are restricted com...

 A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision

Abstract: Text document clustering provides an effective technique to manage a huge amount of retrieval outcome by grouping documents in a small number of meaningful classes. In unsupervised clustering method the unlabel...

 ‘A Review Study on Future Applicability of Snake Robots in India’

Abstract: In this study we to aim to present an overview of features of snake robots and their application across various fields. Snakes is blessed with a unique feature of moving over or climbing all most all kind of te...

 Feature Selection And Vectorization In Legal Case DocumentsUsing Chi-Square Statistical Analysis And Naïve BayesApproaches

 Abstract : Most machine learning techniques employed in the area of text classification require the features ofthe documents to be effectively selected owing to the large chunk of data encountered in the classifica...

Download PDF file
  • EP ID EP105338
  • DOI -
  • Views 98
  • Downloads 0

How To Cite

Ms. Priti Phalak, Dr. Rekha Sharma (2014).  Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering. IOSR Journals (IOSR Journal of Computer Engineering), 16(5), 6-13. https://europub.co.uk/articles/-A-105338