Investigate the use of Anchor-Text and of Query-Document Similarity Scores to Predict the Performance of Search Engine

Abstract

Query difficulty prediction aims to estimate, in advance, whether the answers returned by search engines in response to a query are likely to be useful. This paper proposes new predictors based upon the similarity between the query and answer documents, as calculated by the three different models. It examined the use of anchor text-based document surrogates, and how their similarity to queries can be used to estimate query difficulty. It evaluated the performance of the predictors based on 1) the correlation between the average precision (AP), 2) the precision at 10 (P@10) of the full text retrieved results, 3) a similarity score of anchor text, and 4) a similarity score of full-text, using the WT10g data collection of web data. Experimental evaluation of our research shows that five of our proposed predictors demonstrate reliable and consistent performance across a variety of different retrieval models.

Authors and Affiliations

Abdulmohsen Almalawi, Rayed AlGhamdi, Adel Fahad

Keywords

Related Articles

Cervical Cancer Prediction through Different Screening Methods using Data Mining

Cervical cancer remains an important reason of deaths worldwide because effective access to cervical screening methods is a big challenge. Data mining techniques including decision tree algorithms are used in biomedical...

Detection and Identification System of Bacteria and Bacterial Endotoxin Based on Raman Spectroscopy

Sepsis is a global health problem that causes risk of death. In the developing world, about 60 to 80 % of death cases are caused by Sepsis. Rapid methods for detecting its causes, represent one of the major factors that...

Study and Analysis of Delay Sensitive and Energy Efficient Routing Approach

Wireless Sensing Networks (WSNs) comprised of significant numbers of miniatures and reasonable sensor nodes, which sense data from surrounding and forwarded data toward the base station (BS) via multi-hop fashion through...

Data Fusion Between Microwave and Thermal Infrared Radiometer Data and Its Application to Skin Sea Surface Temperature, Wind Speed and Salinity Retrievals

Method for data fusion between Microwave Scanning Radiometer: MSR and Thermal Infrared Radiometer: TIR derived skin sea surface temperature: SSST, wind speed: WS and salinity is proposed. SSST can be estimated with MSR a...

A Social Semantic Web based Conceptual Architecture of Disaster Trail Management System

Disasters affect human lives severely. Due to these disasters, hundreds and thousands of human beings lost their lives and gracious properties. Government agencies, non- government organization and individual volunteers...

Download PDF file
  • EP ID EP240767
  • DOI 10.14569/IJACSA.2017.081140
  • Views 65
  • Downloads 0

How To Cite

Abdulmohsen Almalawi, Rayed AlGhamdi, Adel Fahad (2017). Investigate the use of Anchor-Text and of Query-Document Similarity Scores to Predict the Performance of Search Engine. International Journal of Advanced Computer Science & Applications, 8(11), 320-332. https://europub.co.uk/articles/-A-240767