Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5

Abstract

 Abstract: Datasets in the horizontal aggregated layout are preferred by most of data mining algorithms, machine learning algorithm. Major efforts are required to compute data in the horizontal aggregated format. There are many inbuilt aggregation functions in SQL, namely, minimum, maximum, average, sum and count. These aggregation functions are used with a query evaluation method to retrieve data in the horizontal aggregation format. Optimization techniques used for vertical aggregation is not appropriate for horizontal aggregation. Standard aggregations are hard to interpret when there are many result rows, especially when grouping attributes having high cardinalities. That is why we proposed C4.5 classification algorithm and K-means clustering algorithm with query evaluation method and aggregation function for optimizing horizontal aggregation. Horizontal aggregation is a method which generates SQL code to return aggregated columns in the horizontal tabular layout. It returns a set of numbers instead of one number per row. There are various applications where the horizontal aggregation is used such as electrical billing, banks, hospital management system, pharmacy, and online library etc. [6].

Authors and Affiliations

Ms. Priti Phalak , Dr. Rekha Sharma

Keywords

Related Articles

 Recognition of Human Iris Using Accurate Iris Map

 Personal identification based on Biometrics technology is a trend in future. Iris Recognition is regarded as a high accuracy verification technology when compared to traditional approaches. In real-time Iris Rec...

 Overview of Improving Robustness of MAODV Protocol byCombining Tree and Mesh Structures

 Abstract: Mobile ad hoc networks (MANETs) plays an important role in the communication in the networkmust be set up temporarily and quickly. Since the nodes move randomly routing protocols must bring strong andcons...

 Production of Clean Fuel from Waste Biomass using Combined Dark and Photofermentation

 Sequential dark and photo- fermentation is a rather new approach in biological hydrogen gas production. In the present work, two-stage fermentation process consisting of dark and photo-fermentation periods was ca...

"SL-SKE (Signature Less-Secret Key Encryption) For DataSharing in Clouds"

Abstract: Cloud cоmрutіոg іs tyріcаlly defіոed as а type of cоmрutіոg that relies оո shаrіոg cоmрutіոg resources. The Іոfrаstructure as а Service іո cloud offers the dаtа-ceոter servіces to stоre аոd mаոаge іոfоrmаtіоո,...

A Survey on Different Levels of Risks during Different Phases in Data Warehouse

Abstract: The term Data Warehouse represents huge collection of historical data which are subject-oriented, non-volatile, integrated, and time-variant and such data is required for the business needs [1]. Data warehouses...

Download PDF file
  • EP ID EP105338
  • DOI -
  • Views 134
  • Downloads 0

How To Cite

Ms. Priti Phalak, Dr. Rekha Sharma (2014).  Optimization of Horizontal Aggregation in SQL by using C4.5 Algorithm and K-Means Clustering. IOSR Journals (IOSR Journal of Computer Engineering), 16(5), 6-13. https://europub.co.uk/articles/-A-105338