Effective Performance of Information Retrieval by using Domain Based Crawler
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2013, Vol 4, Issue 7
Abstract
World Wide Web continuously introduces new capabilities and attracts many people[1]. It consists of more than 60 billion pages online. Due to this explosion in size, the information retrieval system or Search Engines are being upgraded day by day and it can be used to access the information effectively and efficiently. In this paper, we have addressed Domain Based Information Retrieval (DBIR) System. In this system we crawl the information from the web and added all links to the data base which are related to a specific domain. It simply ignores which are not related to that domain. Because of that we can save the Storage Space (SS) and Searching Time (ST) and as a result it improves the performance of the system. It is an extension of Effective Performance of Web Crawler (EPOW) System [2], in which it has two Crawler modules. The first one is Basic Crawler. It consists of multiple downloaders to achieve parallelization policy . The second one is Master Crawler, which is used to filter the URLs send by the Basic Crawler based on the Domain and sends back to the Basic Crawler to extract the related links. All these related links are collectively stored into the database under a unique domain name.
Authors and Affiliations
Sk. Nabi, Dr. Premchand
UHF RFID Reader Antenna using Novel Planar Metamaterial Structure for RFID System
An Ultra High Frequency (UHF) half-loop antenna used in Radio Frequency Identification (RFID) systems is proposed with a planar patterned metamaterial structure of compact size. The size of the planar patterned metamater...
An Incident Management System for Debt Collection in Virtual Banking
An astonishing peak volume of bad loans in most countries, including Iran, is one of the latest manifestations of deep disorders which inhibited banking system from performing its main duty to promote development plans o...
A New Uncertainty Measure in Belief Entropy Framework
Belief entropy, which represents the uncertainty measure between several pieces of evidence in the Dempster-Shafer framework, is attracting increasing interest in research. It has been used in many applications and is ma...
System Autonomy Modeling During Early Concept Definition
The current rapid systems engineering design methods, such as AGILE, significantly reduce the development time. This results in the early availability of incremental capabilities, increases the importance of accelerating...
Autonomic Computing for Business Applications
Autonomic computing, a new deployment technology introduced by IBM a decade ago, to manage the ever increasing complexity of IT systems, has become a part of many large scale deployments today. A lot of inroads have been...