Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streams

Abstract

Nowadays, data mining has become one of the most popular research areas in the field of computer science, because data mining techniques are used for extracting the hidden knowledge from the large databases. In data mining, most of the work is emphasized over knowledge discovery and data stream mining is becoming an active research area in this domain. A data stream is a similar to river, it means continuous and massive sequence of data elements are in and out generated at a rapid rate and the analysis of data stream has been recently attracted attention over in data mining research community. When the amount of data is very huge, it leads to a numerous computational and mining challenges due to shortage of hardware and software limitations. Data mining techniques are newly proposed for data streams they are highly helpful to mine are data stream clustering, data stream classification, frequent pattern technique, sliding window techniques and so on. For outlier detection data stream clustering algorithm is highly needed. This main objective of this research work is to perform the clustering process in data streams and detecting the outliers in data streams. In this research work, two clustering algorithms namely BIRCH with CLARANS and CURE with CLARANS are used for finding the outliers in data streams. Different types, sizes of data sets and two performance factors such as clustering accuracy and outlier detection accuracy are used for analysis. By analyzing the experimental results, it is observed that the CURE with CLARANS clustering algorithm performance is more accurate than the BIRCH with CLARANS.

Authors and Affiliations

Dr. S. Vijayarani

Keywords

Related Articles

DESIGN OF A HIGH-SPEED WALLACE TREE MULTIPLIER

Multiplication is one of the most common arithmetic operations employed in digital systems, but multipliers are the most time, area, and power consuming circuits. Improvement in any of these parameters can be advantageo...

 SUSTAINABILITY APPROCH FOR CONCRETE PAVER BLOCK USING GLASS INDUSTRIAL WASTES

 As now a day’s disposal of solid waste is becoming a major problem and therefore some percentage of the waste in the form of fly ash and glass powder is used to reduce the pollution caused by these elements. The ai...

 HYBRID APPROACH OF BOOSTED TREE FOR CHURN PREDICTION IN MATLAB

 Organization puts much attempt to hold the churn clients in the company by identifying them as clients are beneficial persons to the growth of a company. Hybrid approach of Boosted tree is one of advance algorithm...

DETECTION OF COMPUTER VIRUSES USING WELM_BFO

Computer viruses are big threat for our society .The expansion of various new viruses of varying forms make the prevention quite tuff. Here we proposed WELM_BFO to detect computer viruses. The proposed method efficient...

 A REVIEW ON PERFORMANCE AND EMISSION CHARACTERISCS OF C.I. ENGINE WITH OXYGENATED FUEL ADDITIVES

 In our daily life the combustion products from the C.I. engine are one of the most pollution factor .These factor increasesgreenhouse effect, acid rain and tends to destroy the ozone layer .The chemical compositio...

Download PDF file
  • EP ID EP117322
  • DOI -
  • Views 72
  • Downloads 0

How To Cite

Dr. S. Vijayarani (30). Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streams. International Journal of Engineering Sciences & Research Technology, 2(10), 2885-2893. https://europub.co.uk/articles/-A-117322