Mining Frequent Item Sets in Asynchronous Transactional Data Streams over Time Sensitive Sliding Windows Model
Journal Title: Mehran University Research Journal of Engineering and Technology - Year 2016, Vol 35, Issue 4
Abstract
EPs (Extracting Frequent Patterns) from the continuous transactional data streams is a challenging and critical task in some of the applications, such as web mining, data analysis and retail market, prediction and network monitoring, or analysis of stock market exchange data. Many algorithms have been developed previously for mining FPs (Frequent Patterns) from a data stream. Such algorithms are currently highly required to develop new solutions and approaches to the precise handling of data streams. New techniques, solutions, or approaches are developed to address unbounded, ordered, and continuous sequences of data and for the generation of data at a rapid speed from data streams. Hence, extracting FPs using fresh or recent data involves the high-level analysis of data streams. We have suggested an efficient technique for the window sliding model; this technique extracts new and fresh FPs from high-speed data streams. In this study, a CPILT (Compacted Tree Compact Pattern Tree) is developed to capture the latest contents in the stream and to efficiently remove outdated contents from the data stream. The main concept introduced in this work on CPILT is the dynamic restructuring of a tree, which is helpful in producing a compacted tree and the frequency descending structure of a tree on runtime. With the help of the mining technique of FP growth, a complete list of new and fresh FPs is obtained from a CPILT using an existing window. The memory usage and time complexity of the latest FPs in high-speed data streams can efficiently be determined through proper experimentation and analysis.
Authors and Affiliations
Qaisar Javaid, Farida Memon, Shah Nawaz Talpur, Muhammad Arif, Muhammad Daud Awan
Effect of Steel Fibers on Heat of Hydration and Mechanical Properties of Concrete Containing Fly Ash
This study investigated the effects of steel fibers on the fresh and hardened properties, and heat of hydration of concrete containing FA (Fly Ash). A total of 192 samples were cast comprising cubes, cylinders, and prism...
Frequency Diversity Array for DOA Estimation
The localization of targets has been presented in this article. DOA (Direction of Arrival) is an important parameter to be determined by radar. The MLE (Maximum Likelihood Estimator) has been widely used to accurately an...
Identifying the Wasted Spaces within Hospital Buildings in Pakistan
The built environment of hospital buildings are generally not accepted to be pleasant. In the design of healthcare facility, it is quite important that its design, spatial arrangement and areal distribution must respond...
HLA Run Time Infrastructure: A Comparative Study
Distributed computer simulation systems use a general-purpose architecture known as HLA (High Level Architecture). HLA aims to provide common architecture for all types of distributed modeling and simulations by providin...
An Improved Data Model for Uncertain Data
Uncertain data can be categorized as imprecise data and probabilistic data. In each of these categories, the uncertainty can be found at different granularity levels. Conventional data models are developed for the purpos...