Performance Evaluation of Density-Based Outlier Detection on High Dimensional Data
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 2
Abstract
Outlier detection is a task that finds objects that are considerably dissimilar, exceptional or inconsistent with respect to the remaining data. Outlier detection has wide applications which include data analysis, financial fraud detection, network intrusion detection and clinical diagnosis of diseases. In data analysis applications, outliers are often considered as error or noise and are removed once detected. Approaches to detect and remove outliers have been studied by several researchers. Density based approaches have been proved to be effective in detecting outliers successfully, but usually requires huge amount of computations. In this paper, two approaches that enhance the traditional density based method for removing outliers are analyzed. The first method uses data partitioning method and use speed up strategies to avoid large computations. The second method presents a unified clustering and outlier detection using Neighbourhood based Local Density Factor (NLDF). The aim of both the models is to improve the performance of outlier detection, clustering and to speed up the whole process. In this paper, the working of these two papers is studied and a performance evaluation based on clustering efficiency and outlier detection efficiency is presented.
Authors and Affiliations
P. Murugavel , Dr. M. Punithavalli
Analysis of Dependencies of Checkpoint Cost and Checkpoint Interval of Fault Tolerant MPI Applications
In this paper, we have analysed i) the relationship between the checkpoint cost and the optimal checkpoint interval and ii) the relationship between the checkpoint cost and the number of processors (processes) and we hav...
Fractals Based Clustering for CBIR
Fractal based CBIR is based on the self similarity fundamentals of fractals. Mathematical and natural fractals are the shapes whose roughness and fragmentation neither tend to vanish, nor fluctuate, but remain essentiall...
Content Based Image Retrieval using Density Distribution and Mean of Binary Patterns of Walsh Transformed Color Images
This paper introduces a novel idea of Binary Pattern observation of column wise and Row wise Walsh transformed color images for feature vector generation. The density distribution of Sal, Cal components of Binary Pattern...
Global Chaos Synchronization of Four-Scroll and Four-Wing Attractors by Active Nonlinear Control
This paper investigates the global chaos synchronization of identical four-scroll attractors (Liu and Chen, 2004), identical four-wing attractors (Liu, 2009) and non-identical four-scroll and four-wing attractors by acti...
Estimation of Solar Radiation at a Particular Place: Comparative study between Soft Computing and Statistical Approach
This study focuses on the development of connectionist model such as neural network based method to efficiently predict solar radiation of a particular place. Here a comparative study is given between a conventional appr...