Investigate the use of Anchor-Text and of Query-Document Similarity Scores to Predict the Performance of Search Engine
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2017, Vol 8, Issue 11
Abstract
Query difficulty prediction aims to estimate, in advance, whether the answers returned by search engines in response to a query are likely to be useful. This paper proposes new predictors based upon the similarity between the query and answer documents, as calculated by the three different models. It examined the use of anchor text-based document surrogates, and how their similarity to queries can be used to estimate query difficulty. It evaluated the performance of the predictors based on 1) the correlation between the average precision (AP), 2) the precision at 10 (P@10) of the full text retrieved results, 3) a similarity score of anchor text, and 4) a similarity score of full-text, using the WT10g data collection of web data. Experimental evaluation of our research shows that five of our proposed predictors demonstrate reliable and consistent performance across a variety of different retrieval models.
Authors and Affiliations
Abdulmohsen Almalawi, Rayed AlGhamdi, Adel Fahad
Semantic Sentiment Analysis of Arabic Texts
Twitter considered as a rich resource to collect people's opinions in different domains and attracted researchers to develop an automatic Sentiment Analysis (SA) model for tweets. In this work, a semantic Arabic Twitter...
Comprehensive Study and Comparison of Information Retrieval Indexing Techniques
This research is aimed at comparing techniques of indexing that exist in the current information retrieval processes. The techniques being inverted files, suffix trees, and signature files will be critically described an...
FPGA Implementation of Parallel Particle Swarm Optimization Algorithm and Compared with Genetic Algorithm
In this paper, a digital implementation of Particle Swarm Optimization algorithm (PSO) is developed for implementation on Field Programmable Gate Array (FPGA). PSO is a recent intelligent heuristic search method in which...
Relevance of the Indicators Observed in the Measurement of Social Resilience
This article scrutinizes the validation of the observed properties by the experts in the study of social resilience. To that purpose, it utilizes the method of factorial analysis of multi-correspondences (ACM) in the ref...
Designing of Cell Coverage in Light Fidelity
The trend of communication has changed and the internet user demands to have higher data rate and secure communication link. Wireless-Fidelity (Wi-Fi) that uses radio waves for communication has been used as an internet...