An Horizontal Aggregation Approach for Preparation of Data Sets in Data Mining

Abstract

 In Data Mining, Preparing a data set for analysis is generally the most time consuming task, it requires many complex SQL queries, joining tables and aggregating columns. Existing SQL aggregations have limitations to prepare data sets because they return one column per aggregated group. In general, a significant manual effort is required to build data sets, where a horizontal l generate SQL code to return aggregated columns in a horizontal tabular layout, returning a set of numbers instead of one number per row. This new class of functions is called horizontal aggrega data sets with a horizontal denormalized layout, which is the standard layout required by most data mining algorithms.

Authors and Affiliations

Mayur N. Agrawal

Keywords

Related Articles

Data Aggregation Using Genetic Algorithm in Wireless Sensor Network

A sensor network consists of one or more “sinks”. The sensors in the network act as “sources” which detect environmental events and push relevant data to the appropriate sinks. Sensors transmit information towards the s...

 APPLICATION OF COMSOL SOFTWARE TO SIMULATE INDUCTION HEATING PROCESS OF THE SEMISOLID STATE OF A356 ALUMINUM ALLOY IN THIXOFORMING PROCESSES

 Thixoforming techniques require metal alloys to be cast when they are partially liquid and partially solid. Before a material flows into a die cavity under pressure, the temperature distribution must be uniform wi...

 DESIGN AND ANALYSIS OF TELESCOPIC HALFSHAFT FOR AN ALL-TERRAINVEHICLE (ATV)

 Torque transmission from differential to the wheels is a prime factor of driveshaft. A half-shaft transmits thedrive from differential to the wheel hub. Telescopic half-shaft is one of the advancement coming up ina...

A Comparative Study Of Fatigue Failure Due To Acceleration Pulse Loading Over

This research paper deals with experimentation of a circular steel rod with specific geometry, for comparative study of fatigue failure test under constant speed and accelerated pulse speed. Two identical specimens are...

 SECURITY FEATURES IN INDIAN 500 RUPEE NOTE

Various security features of Indian 500 rupee note are discussed in this paper. Common man, who is using currency and having lack of knowledge about security features, can be cheated easily by forgers. There are more tha...

Download PDF file
  • EP ID EP153783
  • DOI -
  • Views 81
  • Downloads 0

How To Cite

Mayur N. Agrawal (30).  An Horizontal Aggregation Approach for Preparation of Data Sets in Data Mining. International Journal of Engineering Sciences & Research Technology, 2(4), 854-858. https://europub.co.uk/articles/-A-153783