A Novel Information Retrieval Approach using Query Expansion and Spectral-based

Abstract

Most of the information retrieval (IR) models rank the documents by computing a score using only the lexicographical query terms or frequency information of the query terms in the document. These models have a limitation as they does not consider the terms proximity in the document or the term-mismatch or both of the two. The terms proximity information is an important factor that determines the relatedness of the document to the query. The ranking functions of the Spectral-Based Information Retrieval Model (SBIRM) consider the query terms frequency and proximity in the document by comparing the signals of the query terms in the spectral domain instead of the spatial domain using Discrete Wavelet Transform (DWT). The query expansion (QE) approaches are used to overcome the word-mismatch problem by adding terms to query, which have related meaning with the query. The QE approaches are divided to statistical approach Kullback-Leibler divergence (KLD) and semantic approach P-WNET that uses WordNet. These approaches enhance the performance. Based on the foregoing considerations, the objective of this research is to build an efficient QESBIRM that combines QE and proximity SBIRM by implementing the SBIRM using the DWT and KLD or P-WNET. The experiments conducted to test and evaluate the QESBIRM using Text Retrieval Conference (TREC) dataset. The result shows that the SBIRM with the KLD or P-WNET model outperform the SBIRM model in precision (P@), R-precision, Geometric Mean Average Precision (GMAP) and Mean Average Precision (MAP).

Authors and Affiliations

Sara Alnofaie, Mohammed Dahab, Mahmoud Kamal

Keywords

Related Articles

Cosine Based Latent Factor Model for Precision Oriented Recommendation

Recommender systems suggest a list of interesting items to users based on their prior purchase or browsing behaviour on e-commerce platforms. The continuing research in recommender systems have primarily focused on devel...

Data Synchronization Model for Heterogeneous Mobile Databases and Server-side Database

Mobile devices, because they can be used to access corporate information anytime anywhere, have recently received considerable attention, and several research efforts have been tailored towards addressing data synchroniz...

Browser-Based DDoS Attacks without Javascript

Recently, browser-based distributed denial of service (DDoS) attacks, in which a malicious JavaScript program is distributed through an advertisement network, and runs in the background of the web browser, were observed....

RSECM: Robust Search Engine using Context-based Mining for Educational Big Data

With an accelerating growth in the educational sector along with the aid of ICT and cloud-based services, there is a consistent rise of educational big data, where storage and processing become the prime matter of challe...

Cuckoo Search Optimization for Reduction of a Greenhouse Climate Model

Greenhouse climate and crop models and specially reduced models are necessary for bettering environmental management and control ability. In this paper, we present a new metaheuristic method, called Cuckoo Search (CS) al...

Download PDF file
  • EP ID EP107290
  • DOI 10.14569/IJACSA.2016.070950
  • Views 112
  • Downloads 0

How To Cite

Sara Alnofaie, Mohammed Dahab, Mahmoud Kamal (2016). A Novel Information Retrieval Approach using Query Expansion and Spectral-based. International Journal of Advanced Computer Science & Applications, 7(9), 364-373. https://europub.co.uk/articles/-A-107290