To Investigate the Problem of Similarity Search on Dimension Incomplete Data

Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 3

Abstract

Similarity query in multidimensional database is a fundamental research problem with numerous applications in the areas of database, data mining, and information retrieval. The existing work on querying incomplete data addresses the problem where the data values on certain dimensions are unknown. Missing dimension information poses great computational challenge, since all possible combinations of missing dimensions need to be examined when evaluating the similarity between the query and the data objects. We develop the lower and upper bounds of the probability that a data object is similar to the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining all missing dimension combinations. A probability triangle inequality is also employed to further prune the search space and speed up the query process. The proposed probabilistic framework and techniques can be applied to both whole and subsequence queries. Extensive experimental results on real-life data sets demonstrate the effectiveness and efficiency of our approach.

Authors and Affiliations

Keywords

Related Articles

Knowledge Regarding Health Benefits to Mothers and Children in Relation to Birth Spacing, Among Mothers

"Birth Spacing refers to the time interval from one child's birth date until the next child's birth date. There are many factors to consider in determining what an optimal time interval between pregnancies would be. Stud...

Study and Analysis of Human Gait to Recognize the Person

Study and Analysis of Human Gait to Recognize the Person

Mitigating Hotspot Locating Attack in Wireless Sensor Network

In wireless sensor network monitored objects can be located by using traffic analysis techniques. Whenever sensors are used to monitor sensitive objects, the privacy of monitored objects’ locations becomes an important c...

Facies Characteristics, Depositional Environments and Sequences Stratigraphy of the Euphrates Formation in Hadetha Area, Western Iraq

The Euphrates Formation (Lower-early middle Miocene) shallow water carbonates deposits, western Iraq was studied in order to determine its microfacies and depositional environment. According to, the benthic foraminifera...

Hybrid Objective Metric for Image Quality Assessment

Hybrid Objective Metric for Image Quality Assessment

Download PDF file
  • EP ID EP356889
  • DOI -
  • Views 105
  • Downloads 0

How To Cite

(2015). To Investigate the Problem of Similarity Search on Dimension Incomplete Data. UNKNOWN, 4(3), -. https://europub.co.uk/articles/-A-356889