Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5

Abstract

 Abstract: Datasets in the horizontal aggregated layout are preferred by most of data mining algorithms, machine learning algorithm. Major efforts are required to compute data in the horizontal aggregated format. There are many inbuilt aggregation functions in SQL, namely, minimum, maximum, average, sum and count. These aggregation functions are used with a query evaluation method to retrieve data in the horizontal aggregation format. Optimization techniques used for vertical aggregation is not appropriate for horizontal aggregation. Standard aggregations are hard to interpret when there are many result rows, especially when grouping attributes having high cardinalities. That is why we proposed C4.5 classification algorithm and K-means clustering algorithm with query evaluation method and aggregation function for optimizing horizontal aggregation. Horizontal aggregation is a method which generates SQL code to return aggregated columns in the horizontal tabular layout. It returns a set of numbers instead of one number per row. There are various applications where the horizontal aggregation is used such as electrical billing, banks, hospital management system, pharmacy, and online library etc. [6].

Authors and Affiliations

Ms. Priti Phalak , Dr. Rekha Sharma

Keywords

Related Articles

Soft Computing Techniques for Treating Neural Problem: Dementia Used Throughout the World –Areview

Abstract: Dementia Furthermore it’s A large portion regular manifestation, Alzheimer’s disease, will be an intricate confusion that afflicts fundamentally the elderly, influencing an evaluated from claiming 63 million to...

 Internet Worm Classification and Detection using Data MiningTechniques

 Abstract: Internet worm means separate malware computer programs that repeated itself and in order to spreadone computer to another computer. Malware includes computer viruses, worms, root kits, key loggers, Trojan...

Optimal Seeding And Self-Reproduction From A Mathematical Point of View.

Abstract: P. Kabamba developed generation theory as a tool for studying self-reproducing systems. We provide an alternative definition of a generation system and give a complete solution to the problem of finding op...

Review of Evolutionary Algorithms in Wsn

Abstract: Diverse issues related to wireless sensor networks like energy minimization (optimization), compression schemes, network algorithms which are self-organizing, routing protocols, management of quality of service...

 Search Accelerator

 Abstract: Optimization problem consists of maximizing or minimizing a real function by systematically Choosing input values from within an allowed set and the value of the function can be solved. Whenever the use...

Download PDF file
  • EP ID EP105338
  • DOI -
  • Views 151
  • Downloads 0

How To Cite

Ms. Priti Phalak, Dr. Rekha Sharma (2014).  Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering. IOSR Journals (IOSR Journal of Computer Engineering), 16(5), 6-13. https://europub.co.uk/articles/-A-105338