Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streams
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 2, Issue 10
Abstract
Nowadays, data mining has become one of the most popular research areas in the field of computer science, because data mining techniques are used for extracting the hidden knowledge from the large databases. In data mining, most of the work is emphasized over knowledge discovery and data stream mining is becoming an active research area in this domain. A data stream is a similar to river, it means continuous and massive sequence of data elements are in and out generated at a rapid rate and the analysis of data stream has been recently attracted attention over in data mining research community. When the amount of data is very huge, it leads to a numerous computational and mining challenges due to shortage of hardware and software limitations. Data mining techniques are newly proposed for data streams they are highly helpful to mine are data stream clustering, data stream classification, frequent pattern technique, sliding window techniques and so on. For outlier detection data stream clustering algorithm is highly needed. This main objective of this research work is to perform the clustering process in data streams and detecting the outliers in data streams. In this research work, two clustering algorithms namely BIRCH with CLARANS and CURE with CLARANS are used for finding the outliers in data streams. Different types, sizes of data sets and two performance factors such as clustering accuracy and outlier detection accuracy are used for analysis. By analyzing the experimental results, it is observed that the CURE with CLARANS clustering algorithm performance is more accurate than the BIRCH with CLARANS.
Authors and Affiliations
Dr. S. Vijayarani
DESIGN OF A HIGH-SPEED WALLACE TREE MULTIPLIER
Multiplication is one of the most common arithmetic operations employed in digital systems, but multipliers are the most time, area, and power consuming circuits. Improvement in any of these parameters can be advantageo...
SUSTAINABILITY APPROCH FOR CONCRETE PAVER BLOCK USING GLASS INDUSTRIAL WASTES
As now a day’s disposal of solid waste is becoming a major problem and therefore some percentage of the waste in the form of fly ash and glass powder is used to reduce the pollution caused by these elements. The ai...
HYBRID APPROACH OF BOOSTED TREE FOR CHURN PREDICTION IN MATLAB
Organization puts much attempt to hold the churn clients in the company by identifying them as clients are beneficial persons to the growth of a company. Hybrid approach of Boosted tree is one of advance algorithm...
DETECTION OF COMPUTER VIRUSES USING WELM_BFO
Computer viruses are big threat for our society .The expansion of various new viruses of varying forms make the prevention quite tuff. Here we proposed WELM_BFO to detect computer viruses. The proposed method efficient...
A REVIEW ON PERFORMANCE AND EMISSION CHARACTERISCS OF C.I. ENGINE WITH OXYGENATED FUEL ADDITIVES
In our daily life the combustion products from the C.I. engine are one of the most pollution factor .These factor increasesgreenhouse effect, acid rain and tends to destroy the ozone layer .The chemical compositio...