slugTHE PROBLEM OF OUTLIERS IN CLUSTERING

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 2

Abstract

Clustering has been widely used in many applications including data mining, pattern recognition and machine learning. Noise is a major problem in cluster analysis, which degrades the performance of many existing methods. This paper is aimed at solving noise problems in data clustering. Many existing clustering algorithms are sensitive to the presence of outliers. In this paper, a new robust operator is developed to attack this problem, namely the modified l2 norm. There are many merits in using this new measure. No sensitiveuser-defined parameter is needed for this measure and it automatically assigns a small weight to the sample, which is far away from the cluster center. It is robust to outliers and has a theoretical 50% breakdown point. It can be solved without using an exhaustive search and can be extended to more general prototype, for example curve. We have tested this method with four synthetic and three real world datasets. Experiment results show that the method yields better results than other clustering algorithms.

Authors and Affiliations

Prof. Thatimakula Sudha and Swapna Sree Reddy. Obili

Keywords

Related Articles

slugTHE STRATEGY OF DE-INTERNATIONALIZATION OF THE SMES OF THE FOOTWEAR IN THE AREA METROPOLITANA DE GUADALAJARA

The aim of this paper is to analyze the exogenous and endogenous factors that determine the strategy of de-internationalization of SMEs in the sector of the footwear in the Metropolitan Zone of Guadalajara (ZMG). The p...

Agricultural Marketing in Growth of Rural India

The marketing of agro products is a multifarious process. Agriculture sector is facing several challenges in terms of exploring and searching new markets for the increased production. But unfortunately, Farmers are not...

slugRole of Ontology in NLP Grammar Construction for Semantic based Search Implementation in Product Data Management Systems

In this research paper we address the importance of Product Data Management (PDM) with respect to its contributions in industry. We also present PDM Systems in brief and highlight some of major challenges to the PDM co...

MANAGEMENT INFORMATION SYSTEM

The business application of Management Information System has expanded significantly over the years. Technology advances have increased both the availability and volume of information for managers and the decision make...

Analysis of Self Tuning Fuzzy PID Internal Model Control

In this paper internal model control and fuzzy self-tuning PID controller is combined into a whole controller which make up a new controller fuzzy self-tuning PID internal model controller. First the internal model con...

Download PDF file
  • EP ID EP18183
  • DOI -
  • Views 314
  • Downloads 10

How To Cite

Prof. Thatimakula Sudha and Swapna Sree Reddy. Obili (2012). slugTHE PROBLEM OF OUTLIERS IN CLUSTERING. International Journal of Management, IT and Engineering, 2(2), -. https://europub.co.uk/articles/-A-18183