Performance Evaluation of Density-Based Outlier Detection on High Dimensional Data

Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 2

Abstract

Outlier detection is a task that finds objects that are considerably dissimilar, exceptional or inconsistent with respect to the remaining data. Outlier detection has wide applications which include data analysis, financial fraud detection, network intrusion detection and clinical diagnosis of diseases. In data analysis applications, outliers are often considered as error or noise and are removed once detected. Approaches to detect and remove outliers have been studied by several researchers. Density based approaches have been proved to be effective in detecting outliers successfully, but usually requires huge amount of computations. In this paper, two approaches that enhance the traditional density based method for removing outliers are analyzed. The first method uses data partitioning method and use speed up strategies to avoid large computations. The second method presents a unified clustering and outlier detection using Neighbourhood based Local Density Factor (NLDF). The aim of both the models is to improve the performance of outlier detection, clustering and to speed up the whole process. In this paper, the working of these two papers is studied and a performance evaluation based on clustering efficiency and outlier detection efficiency is presented.

Authors and Affiliations

P. Murugavel , Dr. M. Punithavalli

Keywords

Related Articles

An Exact Algorithm for Multi – Product Bulk Transportation Problem

The paper investigates an NP-Hard nature Problem, where several commodities are produced in several plant sites with capacity constraints, and distributed to several destination sites according to demands and transportat...

TRAFFIC ANALYSIS OF DSR, AODV AND OLSR USING TCP AND UDP

Mobile Ad-hoc Network (MANET) is a collection of wireless mobile nodes dynamically forming a temporary network without the aid of any established infrastructure or centralized administration. Routing protocols in MANET h...

Associated Sensor Patterns Mining of Data Stream from WSN Dataset

Data mining is the process to discover probably beneficial definite information from the large transactional databases. Association rule mining is most common technique of data mining. It aims at discovering associations...

Text Analytics to Data Warehousing

Information hidden or stored in unstructured data can play a critical role in making decisions, understanding and conducting other business functions. Integrating data stored in both structured and unstructured formats c...

AN EFFICIENT CLASSIFICATION OF GENOMES BASED ON CLASSES AND SUBCLASSES

The grass family has been the subject of intense research over the past. Reliable and fast classification / sub-classification of large sequences which are rapidly gaining importance due to genome sequencing projects all...

Download PDF file
  • EP ID EP87237
  • DOI -
  • Views 143
  • Downloads 0

How To Cite

P. Murugavel, Dr. M. Punithavalli (2013). Performance Evaluation of Density-Based Outlier Detection on High Dimensional Data. International Journal on Computer Science and Engineering, 5(2), 62-67. https://europub.co.uk/articles/-A-87237