To Investigate the Problem of Similarity Search on Dimension Incomplete Data

Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 3

Abstract

Similarity query in multidimensional database is a fundamental research problem with numerous applications in the areas of database, data mining, and information retrieval. The existing work on querying incomplete data addresses the problem where the data values on certain dimensions are unknown. Missing dimension information poses great computational challenge, since all possible combinations of missing dimensions need to be examined when evaluating the similarity between the query and the data objects. We develop the lower and upper bounds of the probability that a data object is similar to the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining all missing dimension combinations. A probability triangle inequality is also employed to further prune the search space and speed up the query process. The proposed probabilistic framework and techniques can be applied to both whole and subsequence queries. Extensive experimental results on real-life data sets demonstrate the effectiveness and efficiency of our approach.

Authors and Affiliations

Keywords

Related Articles

Effect of Academic Services in Establishing Quality Teaching and Learning in State Polytechnic of Bandung

Effect of Academic Services in Establishing Quality Teaching and Learning in State Polytechnic of Bandung

A Review of Friction Stir Welding Process and its Variables

A Review of Friction Stir Welding Process and its Variables

Analysis Of Supplier Selection Process On Product Quality

Analysis Of Supplier Selection Process On Product Quality

Effectiveness of Balloon Therapy on Respiratory Status of Patients with Lower Respiratory Tract Disorders

Breathing is the bridge between mind and body, the connection between consciousness and unconsciousness. Chronic respiratory disease is found to be one of the most distressful conditions, badly affecting the quality of h...

Factors Influencing Adoption of Woodfuel Energy Saving Technologies in Nakuru County, Kenya

There have been efforts to promote use of woodfuel conservation technologies. These technologies include the improved charcoal stoves, the improved fuelwood stoves and the fireless cookers that can save woodfuel of upto...

Download PDF file
  • EP ID EP356889
  • DOI -
  • Views 107
  • Downloads 0

How To Cite

(2015). To Investigate the Problem of Similarity Search on Dimension Incomplete Data. UNKNOWN, 4(3), -. https://europub.co.uk/articles/-A-356889