A New Approach for Detecting Outliers in Data Streams
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 2, Issue 11
Abstract
In modern years, data streams have become an increasingly important research area, where as data stream refers to continuous flow of data and it is a process of extracting knowledge structure from continuous, rapid data records and it can be considered as a subfield of data mining. Data Stream can be classified into two types they are offline and online streams. Online data stream used in an amount of real world appliances, including network traffic monitoring, intrusion detection, credit card and fraud detection and offline data stream are used in reports based on web log streams. Data size is extremely huge and potentially infinite and it’s not possible to store all the data, so it leads to a mining challenge where shortage of limitations occurs in hardware and software. Data mining techniques are newly proposed for data streams they are highly helpful to mine the data are data stream clustering, data stream classification, frequent pattern technique, sliding window techniques and so on. For outlier detection data stream clustering technique is highly desirable one. The main objective of this research work is to perform the clustering process in data streams and detecting the outliers in data streams. Two types of clustering algorithms namely FUZZY C-MEANS and CLARANS are used for finding the outliers in data streams. The two performance factors such as clustering accuracy and outlier detection accuracy are used for analysis. By analyzing the experimental results, it is observed that the CLARANS clustering algorithm performance is more accurate than the FUZZY CMEANS.
Authors and Affiliations
Dr. S. Vijayarani*
Series Connection Effect on the Current Shunt Measuring Technique for Large Area Multicrystalline Silicon Solar Cells
The series connection effect on the electrical performance of a large area multicrystalline Silicon solar cell (21 cm × 21 cm) with back contact technology has been considered in a desert area. Short circuit curre...
PI Control Based DC Drive Speed Controller Responses for Small Load Torque
The separately excited Direct current (DC) motors with conventional Proportional controller is generally used in industry. This can be easily implemented and are found to be highly effective if the load changes a...
HOMOMORPHIC ENCRYPTION AND RE-ENCRYPTION APPLIED TO VOTING DATA SECURITY
Homomorphic Encryption is a good basis to enhance the security measures of untrusted systems/applications that stores and manipulates sensitive data. This strong protection of data results from the capability, all...
A REVIEW ON EVOLUTION OF HYBRID AND ELECTRIC VEHICLE
Vehicles have been around for more than 100 years. They have changed a lot in that time. Today’s cars are faster and more reliable than those of long ago. They are also safer and more comfortable. One thing has no...
CFD SIMULATION AND EXPERIMENTAL VERIFICATION OF AIR FLOW THROUGH HEATED PIPE
The aim of this work is to validate the Dittus-Boelter equation by experimental, correlation and Simulation method. It used to find the value of heat transfer coefficient ‘h’ for turbulent flow in many fluid trans...