Improved Hybrid Clustering and Distance-based Technique for Outlier Removal

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 1

Abstract

Outliers detection is a task that finds objects that are dissimilar or inconsistent with respect to the remaining data. It has many uses in applications like fraud detection, network intrusion detection and clinical diagnosis of diseases. Using clustering algorithms for outlier detection is a technique that is frequently used. The clustering algorithms consider outlier detection only to the point they do not interfere with the clustering process. In these algorithms, outliers are only by-products of clustering algorithms and they cannot rank the priority of outliers. In this paper, three partition-based algorithms, PAM, CLARA and CLARANs are combined with k-medoid distance based outlier detection to improve the outlier detection and removal process. The experimental results prove that CLARANS clustering algorithm when combined with medoid distance based outlier detection improves the accuracy of detection and increases the time efficiency.

Authors and Affiliations

P. Murugavel, , Dr. M. Punithavalli

Keywords

Related Articles

A Comprehensive Assessment of Object-Oriented Software Systems Using Metrics Approach

Demand for efficient software is increasing day by day and bject-oriented design technique became able to fulfill this demand because it is the most powerful mechanism to develop efficient software systems. It can not o...

An Approach to Active Queue Management in Computer Network

Active queue management is a key technique for reducing the packet drop rate in the internet. This packet dropping mechanism is used in a router to minimize congestion when the packets are dropped before queue gets full...

Speech Quality Requirements over DSL Networks

Abstract— Quality of service (QoS) has been a feature of voice communication networks almost since their inception. The extension of traditional voice QoS methods to data communication networks and the Internet has been...

SUPPORT VECTOR MACHINE BASED GUJARATI NUMERAL RECOGNITION

In this paper we propose the Support Vector Machine (SVM) based recognition scheme towards the recognition of Gujarati handwritten numerals. The preprocessing is done considering morphological operations. For computing t...

Improving the Performance of K-Means Clustering For High Dimensional Data Set

Clustering high dimensional data is the cluster analysis of data with anywhere from a few dozen to many thousands of dimensions. Multiple dimensions are hard to think in, impossible to visualize, and, due to the exponent...

Download PDF file
  • EP ID EP155326
  • DOI -
  • Views 102
  • Downloads 0

How To Cite

P. Murugavel, , Dr. M. Punithavalli (2011). Improved Hybrid Clustering and Distance-based Technique for Outlier Removal. International Journal on Computer Science and Engineering, 3(1), 333-339. https://europub.co.uk/articles/-A-155326