A Novel Information Retrieval Approach using Query Expansion and Spectral-based

Abstract

Most of the information retrieval (IR) models rank the documents by computing a score using only the lexicographical query terms or frequency information of the query terms in the document. These models have a limitation as they does not consider the terms proximity in the document or the term-mismatch or both of the two. The terms proximity information is an important factor that determines the relatedness of the document to the query. The ranking functions of the Spectral-Based Information Retrieval Model (SBIRM) consider the query terms frequency and proximity in the document by comparing the signals of the query terms in the spectral domain instead of the spatial domain using Discrete Wavelet Transform (DWT). The query expansion (QE) approaches are used to overcome the word-mismatch problem by adding terms to query, which have related meaning with the query. The QE approaches are divided to statistical approach Kullback-Leibler divergence (KLD) and semantic approach P-WNET that uses WordNet. These approaches enhance the performance. Based on the foregoing considerations, the objective of this research is to build an efficient QESBIRM that combines QE and proximity SBIRM by implementing the SBIRM using the DWT and KLD or P-WNET. The experiments conducted to test and evaluate the QESBIRM using Text Retrieval Conference (TREC) dataset. The result shows that the SBIRM with the KLD or P-WNET model outperform the SBIRM model in precision (P@), R-precision, Geometric Mean Average Precision (GMAP) and Mean Average Precision (MAP).

Authors and Affiliations

Sara Alnofaie, Mohammed Dahab, Mahmoud Kamal

Keywords

Related Articles

WQbZS: Wavelet Quantization by Z-Scores for JPEG2000

In this document we present a methodology to quantize wavelet coefficients for any wavelet-base entropy coder, we apply it in the particular case of JPEG2000. Any compression system have three main steps: Transformation...

A Load Balancing Policy for Heterogeneous Computational Grids

Computational grids have the potential computing power for solving large-scale scientific computing applications. To improve the global throughput of these applications, workload has to be evenly distributed among the av...

Human Face Classification using Genetic Algorithm

The paper presents a precise scheme for the development of a human face classification system based human emotion using the genetic algorithm (GA). The main focus is to detect the human face and its facial features and c...

Knowledge Sharing Protocol for Smart Spaces

In this paper we present a novel knowledge sharing protocol (KSP) for semantic technology empowered ubiquitous computing systems. In particular the protocol is designed for M3 which is a blackboard based semantic interop...

Complex Binary Adder Designs and their Hardware Implementations

Complex Binary Number System (CBNS) is (-1+j)-based on binary number system which facilitates both real and imaginary components of a complex number to be represented as single binary number. In this paper, we have prese...

Download PDF file
  • EP ID EP107290
  • DOI 10.14569/IJACSA.2016.070950
  • Views 71
  • Downloads 0

How To Cite

Sara Alnofaie, Mohammed Dahab, Mahmoud Kamal (2016). A Novel Information Retrieval Approach using Query Expansion and Spectral-based. International Journal of Advanced Computer Science & Applications, 7(9), 364-373. https://europub.co.uk/articles/-A-107290