Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streams

Abstract

Nowadays, data mining has become one of the most popular research areas in the field of computer science, because data mining techniques are used for extracting the hidden knowledge from the large databases. In data mining, most of the work is emphasized over knowledge discovery and data stream mining is becoming an active research area in this domain. A data stream is a similar to river, it means continuous and massive sequence of data elements are in and out generated at a rapid rate and the analysis of data stream has been recently attracted attention over in data mining research community. When the amount of data is very huge, it leads to a numerous computational and mining challenges due to shortage of hardware and software limitations. Data mining techniques are newly proposed for data streams they are highly helpful to mine are data stream clustering, data stream classification, frequent pattern technique, sliding window techniques and so on. For outlier detection data stream clustering algorithm is highly needed. This main objective of this research work is to perform the clustering process in data streams and detecting the outliers in data streams. In this research work, two clustering algorithms namely BIRCH with CLARANS and CURE with CLARANS are used for finding the outliers in data streams. Different types, sizes of data sets and two performance factors such as clustering accuracy and outlier detection accuracy are used for analysis. By analyzing the experimental results, it is observed that the CURE with CLARANS clustering algorithm performance is more accurate than the BIRCH with CLARANS.

Authors and Affiliations

Dr. S. Vijayarani

Keywords

Related Articles

Rule-Based Decision Tree to Identify Malicious Traffic

Intrusion Detection Systems (IDSs) provide an important layer of security for computer systems and networks. An IDS’s task is to detect suspicious or unacceptable system and network activity and to alert a systems admi...

 A SURVEY ON ATTRIBUTE BASED ENCRYPTION TECHNIQUES IN CLOUD COMPUTING

 Cloud computing is an emerging computing paradigm, enabling users to store their data remotely in a server and to provide services on-demand. In cloud computing, cloud users and cloud service providers are almost...

DESIGN AND DEVELOPMENT OF MOTHER MOULD OF HYDRAULIC PRESS

Accurate die fixing is an important stage of the hydraulic press with the precised value of clearance. At present the hydraulic press depend on the fixing of die inside the moul d cavity with help of mechanical wedges...

 A Long-Run Relationship Investigation of Energy Consumption and Air Pollution in Togo

 The world is facing the challenge of global warming and climate change issues. Energy use is crucial to human survival and development. Improvements in lifestyles have historically been associated with increases i...

http://www.ijesrt.com/issues%20pdf%20file/Archives%202013/may-2013/11.pdf

Wind form many type of turbine are used they have different type of fault occurred on system. Fault occurred on the system LG, LLG, LLLG or other, system start to reduce the capacity & stability. Wind turbine gener...

Download PDF file
  • EP ID EP117322
  • DOI -
  • Views 84
  • Downloads 0

How To Cite

Dr. S. Vijayarani (30). Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streams. International Journal of Engineering Sciences & Research Technology, 2(10), 2885-2893. https://europub.co.uk/articles/-A-117322