Effective Performance of Information Retrieval by using Domain Based Crawler

Abstract

World Wide Web continuously introduces new capabilities and attracts many people[1]. It consists of more than 60 billion pages online. Due to this explosion in size, the information retrieval system or Search Engines are being upgraded day by day and it can be used to access the information effectively and efficiently. In this paper, we have addressed Domain Based Information Retrieval (DBIR) System. In this system we crawl the information from the web and added all links to the data base which are related to a specific domain. It simply ignores which are not related to that domain. Because of that we can save the Storage Space (SS) and Searching Time (ST) and as a result it improves the performance of the system. It is an extension of Effective Performance of Web Crawler (EPOW) System [2], in which it has two Crawler modules. The first one is Basic Crawler. It consists of multiple downloaders to achieve parallelization policy . The second one is Master Crawler, which is used to filter the URLs send by the Basic Crawler based on the Domain and sends back to the Basic Crawler to extract the related links. All these related links are collectively stored into the database under a unique domain name.

Authors and Affiliations

Sk. Nabi, Dr. Premchand

Keywords

Related Articles

E-Learning Methodologies and Tools

E-learning is among the most important explosion propelled by the internet transformation. This allows users to fruitfully gather knowledge and education both by synchronous and asynchronous methodologies to effectively...

Balanced Distribution of Load on Grid Resources using Cellular Automata

Load balancing is a technique for equal and fair distribution of workloads on resources and maximizing their performance as well as reducing the overall execution time. However, meeting all of these goals in a single alg...

Automatic Optic Disc Boundary Extraction from Color Fundus Images

Efficient optic disc segmentation is an important task in automated retinal screening. For the same reason optic disc detection is fundamental for medical references and is important for the retinal image analysis applic...

E-governance justified

Information and Communication Technology today has become an indispensable part in our lives, gaining wide application in human activities. This is due to the fact that, its use is less expensive, more secure, and allows...

Method for Designing Scalable Microservice-based Application Systematically: A Case Study

Microservice is a new transformation of Service-Oriented Architecture (SOA) which is gaining momentum in both academic and industry. The success of microservice began when giant companies like Netflix used them as a serv...

Download PDF file
  • EP ID EP115027
  • DOI 10.14569/IJACSA.2013.040713
  • Views 110
  • Downloads 0

How To Cite

Sk. Nabi, Dr. Premchand (2013). Effective Performance of Information Retrieval by using Domain Based Crawler. International Journal of Advanced Computer Science & Applications, 4(7), 88-92. https://europub.co.uk/articles/-A-115027