Increase of Precision on the Top of the List of Retrieved Web Documents Using Global and Local Link Analysis
Journal Title: Webology - Year 2007, Vol 4, Issue 3
Abstract
At present, information derived from the cross-references among pages is used to improve the results of Web-based information retrieval systems, as constantly occur in bibliometric techniques. The references are local when only the links related to the set of documents returned as answers to a user query are treated, as done by the HITS algorithm. If all the links of the documents in the collection are taken into account, we speak of global references. This is the case with the PageRank algorithm, which takes advantage of the whole Web structure. Using the WBR99 reference collection, the article shows the results of the implementation of the HITS and PageRank algorithms and emphasizes the gains in precision on the top of the list compared with the results of the space vector model algorithm (SVM), which is grounded only on the textual analysis of the pages. It was noticed that the use of local links produces higher average precision. However, the use of global links is justified whenever high precision at low recall is important and query processing efficiency is essential, such as in Web search engines.
Authors and Affiliations
Luiz Fernando de Barros Campos
Feeling alone among friends: Adolescence, social networks and loneliness
Adolescents are particularly susceptible to feelings of loneliness and social relationships are therefore an important part of their development. The aim of the present study is to explore the patterns of adolescents' us...
Texting Tolerance: Computer-Mediated Interfaith Dialogue
As religious unrest and tension rise throughout the world, facilitating interfaith dialogue has become more important than ever. Many religious organizations have begun to include interfaith discourse into their genera...
Have digital repositories come of age? The views of library directors
This survey of approximately 150 repositories assessed the achievements, impact, and success of digital repositories. Results show that while the size and use of repositories has been relatively modest, almost half of al...
A Comparative Theoretical and Empirical Analysis of Machine Learning Algorithms
With the explosion of data in recent times, Machine learning has emerged as one of the most important methodical approaches to observe significant insights from the vast amount of data. Particularly, it is witnessed that...
Citation analysis of Journal of Documentation
Citation analysis of all the journal articles published in the Journal of Documentation from 1996-2010 is carried out. 487 articles are published in the journal during 15 years. Highest numbers (44) of articles are publi...