To Investigate the Problem of Similarity Search on Dimension Incomplete Data

Journal Title: International Journal of Science and Research (IJSR) - Year 2015, Vol 4, Issue 3

Abstract

Similarity query in multidimensional database is a fundamental research problem with numerous applications in the areas of database, data mining, and information retrieval. The existing work on querying incomplete data addresses the problem where the data values on certain dimensions are unknown. Missing dimension information poses great computational challenge, since all possible combinations of missing dimensions need to be examined when evaluating the similarity between the query and the data objects. We develop the lower and upper bounds of the probability that a data object is similar to the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining all missing dimension combinations. A probability triangle inequality is also employed to further prune the search space and speed up the query process. The proposed probabilistic framework and techniques can be applied to both whole and subsequence queries. Extensive experimental results on real-life data sets demonstrate the effectiveness and efficiency of our approach.

Authors and Affiliations

Keywords

Related Articles

Energy Loss Reduction in Distribution System

This research study energy loss reduction in distribution system. This study carries out in the distribution system by using PSS/adept program as tool for simulation. The techniques considered for the reduction of techni...

Awareness, Attitude and Practice of Smoking among Medical Sciences& Non-Medical Sciences Students at Taif University: Comparative Study

Awareness, Attitude and Practice of Smoking among Medical Sciences& Non-Medical Sciences Students at Taif University: Comparative Study

A Survey Paper on Twitter Opinion Mining

"Million people have primary focus on Social media platforms to share their own thoughts and opinions in regards to their day to day life, business, celebrity entertainments, polities etc. Opinion Mining defined as an In...

The Analysis of Effect of Economic Value Added (EVA) and Market Value Added (MVA) on Share Price of Subsector Companies of Property Incorporated in LQ45 Indonesia Stock Exchange in Period of 2009-2013

"After the global crisis in 2008, the Indonesian economy grew higher with maintained stability. These numbers eventually push up asset prices, including property. The increase in property prices in turn increases the dem...

Performance Modeling of Automotive Sensors and Sensor Interface Systems using Simulink

Performance Modeling of Automotive Sensors and Sensor Interface Systems using Simulink

Download PDF file
  • EP ID EP356889
  • DOI -
  • Views 103
  • Downloads 0

How To Cite

(2015). To Investigate the Problem of Similarity Search on Dimension Incomplete Data. International Journal of Science and Research (IJSR), 4(3), -. https://europub.co.uk/articles/-A-356889