REVIEW OF CLUSTERING UNCERTAIN DATA

Abstract

 Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods like k-means and density-based clustering methods like DBSCAN to uncertain data, thus rely on geometric distances between objects. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In this project, we systematically model uncertain objects in both continuous and discrete domains, where an uncertain object is modeled as a continuous and discrete random variable, respectively. We use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects.

Authors and Affiliations

Ms. Nikhatparvin Ahamad*

Keywords

Related Articles

ERROR DETECTION USIN G BINARY BCH (255, 2 15, 5) CODES

Error - correction codes are the codes used to correct the errors occurred during the transmission of the data in the unreliable communication medi ums. Error detection is the detection of errors caused by noise or...

CURRENT HARMONICS REDUCTION IN DISTRIBUTED GENERATION SYSTEM BY USING SHUNT HYBRID ACTIVE POWER FILTER STRATEGY

The enormous growth of non - linear load is connected to the power system will create unbalancing and inject harmonics current to the source.This unbalancing load and harmonic injection has produce mismatching of p...

 IMPLEMENTATION AND DESIGN THREE SOFTWARE USING REUSABLE SOFTWARE CONCEPT “ANALYTICAL STUDY”

 There are two ways for the principle of re-use software and the first way is the indirect method indirect boils down to the use of one or more pieces of software in the production and creation of new programs witho...

BIOLEACHING OF ZINC: ECO FRIENDLY MINING

The research work presented in this paper is on a Biomining The estimated annual demand for zinc in India is approximately 2.41 lakh tones; against this, the present installed capacity in the country for zinc ingots is...

 Electronic Patch Wireless Reflectance Pulse Oximetry for Remote Health Monitoring

 This project describes the development of a wireless electronic patch for wearable health monitoring by reflectance pulse oximetry. The Electronic Patch is the health monitoring system which incorporates the biome...

Download PDF file
  • EP ID EP90900
  • DOI 10.5281/zenodo.61474
  • Views 110
  • Downloads 0

How To Cite

Ms. Nikhatparvin Ahamad* (30).  REVIEW OF CLUSTERING UNCERTAIN DATA. International Journal of Engineering Sciences & Research Technology, 5(9), 119-121. https://europub.co.uk/articles/-A-90900