REVIEW OF CLUSTERING UNCERTAIN DATA

Abstract

 Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods like k-means and density-based clustering methods like DBSCAN to uncertain data, thus rely on geometric distances between objects. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In this project, we systematically model uncertain objects in both continuous and discrete domains, where an uncertain object is modeled as a continuous and discrete random variable, respectively. We use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects.

Authors and Affiliations

Ms. Nikhatparvin Ahamad*

Keywords

Related Articles

 Trusted Cloud for Data Sharing

 Cloud computing is a general term for anything that involves delivering hosted services over the internet. These services are broadly divided into three categories: Infrastructure-as-a-Service (IaaS), Platform-as-...

 ANDROID BASED BIOMEDICAL SIGNAL MONITORING SYSTEM

 Telemedicine has reduced the human effort by replacing wired infrastructure with wireless infrastructure. This paper based on body area network (BAN). sensors are attached to the human body and connected to microc...

 INVERTERS CURRENT FREQUENCY DEVIATION BASED ISLANDING DETECTION IN GRID CONNECTED PHOTOVOLTAIC SYSTEM

 The protection schemes of distribution systems are usually designed under the assumption that power flows from the substations to the end users.The utility system contains both load and generation in which part of...

Evolution and Development of Crankshaft (Pratt & Whitney)-A Study

One of the things that made the original Pratt & Whitney “Wasp” so successful in 1926 when it first passed its type test was the ability to make its power at a higher RPM and a lighter weight than its competition. K...

PRIVATE BANKS’ ATMS EFFICIENCY AT GROUND ZERO: A CASE STUDY OF ALLAHABAD

The technological investment done by Private Banks especially in installing cash dispensing machines, i.e. ATMs in India, had also forced public sector banks to update themselves, as their competitors; gone are the days...

Download PDF file
  • EP ID EP90900
  • DOI 10.5281/zenodo.61474
  • Views 95
  • Downloads 0

How To Cite

Ms. Nikhatparvin Ahamad* (30).  REVIEW OF CLUSTERING UNCERTAIN DATA. International Journal of Engineering Sciences & Research Technology, 5(9), 119-121. https://europub.co.uk/articles/-A-90900