slugTHE PROBLEM OF OUTLIERS IN CLUSTERING

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 2

Abstract

Clustering has been widely used in many applications including data mining, pattern recognition and machine learning. Noise is a major problem in cluster analysis, which degrades the performance of many existing methods. This paper is aimed at solving noise problems in data clustering. Many existing clustering algorithms are sensitive to the presence of outliers. In this paper, a new robust operator is developed to attack this problem, namely the modified l2 norm. There are many merits in using this new measure. No sensitiveuser-defined parameter is needed for this measure and it automatically assigns a small weight to the sample, which is far away from the cluster center. It is robust to outliers and has a theoretical 50% breakdown point. It can be solved without using an exhaustive search and can be extended to more general prototype, for example curve. We have tested this method with four synthetic and three real world datasets. Experiment results show that the method yields better results than other clustering algorithms.

Authors and Affiliations

Prof. Thatimakula Sudha and Swapna Sree Reddy. Obili

Keywords

Related Articles

WR DISPATCHING APPROACHES IN DISTRIBUTED WEB SERVER SYSTEMS

Once the web site becomes a popular then single web server may not be able to handle high volume of incoming traffic. In order to achieve web server scalability, more servers need to be added to distribute the load amo...

A Case Study on Customer Review based Sentiment Analysis

Abstract – Today large number of companies are shifting their businesses online due to growing trend among customer to shop online. There arises a need for effective visual analysis of online customer opinions. It has...

Gender Disparities in Wage rates and Employment in India

Different wage rates for men and women are still observed in many economies especially under developed and developing, though there are many legal frameworks for equality for both sexes. The discrimination and biases a...

Comparative Study of Green & Traditional Non Green Cleaning Products with reference to Pune Hotels

Cleaning products are the first priority of the hospitality industry. It helps to maintain the hygiene and cleanliness in the hotel to provide the better atmosphere and enhancing the standard of the hotel. Varieties of...

Purchasing Efficiency Impact on Inventory Valuation and Company’s performance

Inventory constitutes a significant asset on the balance sheet of trading & manufacturing companies. Purchasing strategy and function affect the level of inventory and business performance of company to great extent. E...

Download PDF file
  • EP ID EP18183
  • DOI -
  • Views 324
  • Downloads 10

How To Cite

Prof. Thatimakula Sudha and Swapna Sree Reddy. Obili (2012). slugTHE PROBLEM OF OUTLIERS IN CLUSTERING. International Journal of Management, IT and Engineering, 2(2), -. https://europub.co.uk/articles/-A-18183