REVIEW OF CLUSTERING UNCERTAIN DATA
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 5, Issue 9
Abstract
Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods like k-means and density-based clustering methods like DBSCAN to uncertain data, thus rely on geometric distances between objects. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In this project, we systematically model uncertain objects in both continuous and discrete domains, where an uncertain object is modeled as a continuous and discrete random variable, respectively. We use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects.
Authors and Affiliations
Ms. Nikhatparvin Ahamad*
Trusted Cloud for Data Sharing
Cloud computing is a general term for anything that involves delivering hosted services over the internet. These services are broadly divided into three categories: Infrastructure-as-a-Service (IaaS), Platform-as-...
ANDROID BASED BIOMEDICAL SIGNAL MONITORING SYSTEM
Telemedicine has reduced the human effort by replacing wired infrastructure with wireless infrastructure. This paper based on body area network (BAN). sensors are attached to the human body and connected to microc...
INVERTERS CURRENT FREQUENCY DEVIATION BASED ISLANDING DETECTION IN GRID CONNECTED PHOTOVOLTAIC SYSTEM
The protection schemes of distribution systems are usually designed under the assumption that power flows from the substations to the end users.The utility system contains both load and generation in which part of...
Evolution and Development of Crankshaft (Pratt & Whitney)-A Study
One of the things that made the original Pratt & Whitney “Wasp” so successful in 1926 when it first passed its type test was the ability to make its power at a higher RPM and a lighter weight than its competition. K...
PRIVATE BANKS’ ATMS EFFICIENCY AT GROUND ZERO: A CASE STUDY OF ALLAHABAD
The technological investment done by Private Banks especially in installing cash dispensing machines, i.e. ATMs in India, had also forced public sector banks to update themselves, as their competitors; gone are the days...