REVIEW OF CLUSTERING UNCERTAIN DATA

Abstract

 Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods like k-means and density-based clustering methods like DBSCAN to uncertain data, thus rely on geometric distances between objects. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In this project, we systematically model uncertain objects in both continuous and discrete domains, where an uncertain object is modeled as a continuous and discrete random variable, respectively. We use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects.

Authors and Affiliations

Ms. Nikhatparvin Ahamad*

Keywords

Related Articles

 Decoding Techniques of Error Control Codes called LDPC

 This paper deals with the design and decoding of an extremely powerful and flexible family of errorcontrol codes called low-density parity-check (LDPC) codes. LDPC codes can be designed to perform close to the ca...

 AUTOMATIC TEXT SUMMARIZATION AND DEADWOOD REMOVAL FOR PUNJABI LANGUAGE

 Due to large information over the internet it becomes very difficult and laborious task to get the useful information. Automatic text summarization is one of the techniques which give the shorthand information whi...

DETECTION OF COMPUTER VIRUSES USING WELM_ABC

Computer viruses are big threat for our society .The expansion of various new viruses of varying forms make the prevention quite tuff. Here we proposed WELM_ABC to detect computer viruses. The proposed method efficient...

 INTERACTION OF FOODS AND DRUGS INVOLVING NUTRIENTS AND ENZIMES

 Healthcare providers, such as physicians, pharmacists, nurses, and dietitians, have to be aware of important fooddrug interactions in order to optimize the therapeutic efficacy of prescribed and over-the-counter d...

 Throughput Analysis of Unlicensed Mobile Access using WIFI

 Making voice calls for faster information exchange is showing its increasing demands day by day. IP telephony is sensible in the view to provide such platform. IP telephony in heterogeneous network environments ca...

Download PDF file
  • EP ID EP90900
  • DOI 10.5281/zenodo.61474
  • Views 90
  • Downloads 0

How To Cite

Ms. Nikhatparvin Ahamad* (30).  REVIEW OF CLUSTERING UNCERTAIN DATA. International Journal of Engineering Sciences & Research Technology, 5(9), 119-121. https://europub.co.uk/articles/-A-90900