Increase of Precision on the Top of the List of Retrieved Web Documents Using Global and Local Link Analysis

Journal Title: Webology - Year 2007, Vol 4, Issue 3

Abstract

At present, information derived from the cross-references among pages is used to improve the results of Web-based information retrieval systems, as constantly occur in bibliometric techniques. The references are local when only the links related to the set of documents returned as answers to a user query are treated, as done by the HITS algorithm. If all the links of the documents in the collection are taken into account, we speak of global references. This is the case with the PageRank algorithm, which takes advantage of the whole Web structure. Using the WBR99 reference collection, the article shows the results of the implementation of the HITS and PageRank algorithms and emphasizes the gains in precision on the top of the list compared with the results of the space vector model algorithm (SVM), which is grounded only on the textual analysis of the pages. It was noticed that the use of local links produces higher average precision. However, the use of global links is justified whenever high precision at low recall is important and query processing efficiency is essential, such as in Web search engines.

Authors and Affiliations

Luiz Fernando de Barros Campos

Keywords

Related Articles

Webgraph connectivity and dynamics: Russian research institutions

This research paper proposes a webgraph dynamics model for research institutes based on a webgraph constructed on a set of instants of time and “return back” through removal of multiple hyperlinks. Analysis of the dynami...

Paradigm shifts: from pre-web information systems to recent web-based contextual information retrieval

As the types of user accessible data and information escalates, so does the variety of Information Retrieval (IR) practices which can match to achieve the challenges instigated. By expanding its applicability which can b...

Identification of the characteristics of e-commerce websites

E-commerce websites must possess certain characteristics in order to attract customers/users. Although previous studies have been conducted to determine some of these characteristics of different categories of websites,...

Development of Intellectual System for Data De-Duplication and Distribution in Cloud Storage

The system for backing up the data is designed. Client software works on the computer of user, takes all the necessary files for backup, and turns them into Stream of bytes. Then breaks it into blocks (from 32 KB to 64KB...

International Actions against Cybercrime: Networking Legal Systems in the Networked Crime Scene

This article reviews the international impetus of criminal law reform in combating cybercrime. This article classifies actions of international harmonization into professional, regional, multinational and global actions,...

Download PDF file
  • EP ID EP687526
  • DOI -
  • Views 211
  • Downloads 0

How To Cite

Luiz Fernando de Barros Campos (2007). Increase of Precision on the Top of the List of Retrieved Web Documents Using Global and Local Link Analysis. Webology, 4(3), -. https://europub.co.uk/articles/-A-687526