A New Approach for Detecting Outliers in Data Streams

Abstract

 In modern years, data streams have become an increasingly important research area, where as data stream refers to continuous flow of data and it is a process of extracting knowledge structure from continuous, rapid data records and it can be considered as a subfield of data mining. Data Stream can be classified into two types they are offline and online streams. Online data stream used in an amount of real world appliances, including network traffic monitoring, intrusion detection, credit card and fraud detection and offline data stream are used in reports based on web log streams. Data size is extremely huge and potentially infinite and it’s not possible to store all the data, so it leads to a mining challenge where shortage of limitations occurs in hardware and software. Data mining techniques are newly proposed for data streams they are highly helpful to mine the data are data stream clustering, data stream classification, frequent pattern technique, sliding window techniques and so on. For outlier detection data stream clustering technique is highly desirable one. The main objective of this research work is to perform the clustering process in data streams and detecting the outliers in data streams. Two types of clustering algorithms namely FUZZY C-MEANS and CLARANS are used for finding the outliers in data streams. The two performance factors such as clustering accuracy and outlier detection accuracy are used for analysis. By analyzing the experimental results, it is observed that the CLARANS clustering algorithm performance is more accurate than the FUZZY CMEANS.

Authors and Affiliations

Dr. S. Vijayarani*

Keywords

Related Articles

 Series Connection Effect on the Current Shunt Measuring Technique for Large Area Multicrystalline Silicon Solar Cells

 The series connection effect on the electrical performance of a large area multicrystalline Silicon solar cell (21 cm × 21 cm) with back contact technology has been considered in a desert area. Short circuit curre...

  PI Control Based DC Drive Speed Controller Responses for Small Load Torque

 The separately excited Direct current (DC) motors with conventional Proportional controller is generally used in industry. This can be easily implemented and are found to be highly effective if the load changes a...

 HOMOMORPHIC ENCRYPTION AND RE-ENCRYPTION APPLIED TO VOTING DATA SECURITY

 Homomorphic Encryption is a good basis to enhance the security measures of untrusted systems/applications that stores and manipulates sensitive data. This strong protection of data results from the capability, all...

 A REVIEW ON EVOLUTION OF HYBRID AND ELECTRIC VEHICLE

 Vehicles have been around for more than 100 years. They have changed a lot in that time. Today’s cars are faster and more reliable than those of long ago. They are also safer and more comfortable. One thing has no...

 CFD SIMULATION AND EXPERIMENTAL VERIFICATION OF AIR FLOW THROUGH HEATED PIPE

 The aim of this work is to validate the Dittus-Boelter equation by experimental, correlation and Simulation method. It used to find the value of heat transfer coefficient ‘h’ for turbulent flow in many fluid trans...

Download PDF file
  • EP ID EP164047
  • DOI -
  • Views 98
  • Downloads 0

How To Cite

Dr. S. Vijayarani* (30).  A New Approach for Detecting Outliers in Data Streams. International Journal of Engineering Sciences & Research Technology, 2(11), 3128-3133. https://europub.co.uk/articles/-A-164047