Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 6
Abstract
Abstract: The internet is a vast collection of billions of web pages containing terabytes of information arranged in thousands of servers using HTML. The size of this collection itself is a formidable obstacle in retrieving necessary and relevant information. This made search engines an important part of our lives. Search engines strive to retrieve information as relevant as possible. One of the building blocks of search engines is the Web Crawler. We tend to propose a two - stage framework, specifically two smart Crawler, for efficientgathering deep net interfaces. Within the first stage, smart Crawler, performs site-based sorting out centre pages with the assistance of search engines, avoiding visiting an oversized variety of pages. To realize additional correct results for a targeted crawl, smart Crawler, ranks websites to order extremely relevant ones for a given topic. Within the second stage, smart Crawler, achieves quick in – site looking by excavating most relevant links with associate degree accommodative link -ranking.
Authors and Affiliations
Ajit T. Raut , Ajit N. Ogale , Subhash A. Kaigude , Uday D. Chikane
VRIT: An Innovative Approach of Industrial Training through Virtual Reality
The emerging global competition and increasing costs are a great challenge to industries. New cost effective training methods are explored to cope with this demand. In-depth knowledge of the functions in a fact...
Study of P2P Botnet
Abstract: Today, centralized botnets are still widely used. In a centralized botnet, bots are connected to several servers (called C&C servers) to obtain commands. This architecture is easy to construct and eff...
A Secure Data Transmission by Embedding Marked Encrypted Image on Cloak Image
Abstract: A mobile WSN is considered as a collection of wireless mobile nodes and a base station forming an ad-hoc network. This type of network is used in various areas; such as underwater and underground. Each no...
Development of a D.C Circuit Analysis Software Using MicrosoftVisual C#.Net
Abstract: In this paper, the development of D.C circuit simulation software, using Microsoft visual C#.net, hasbeen achieved. This paper aims at (i) analysing a purely resistive planar circuit, (ii) displaying curr...
Privacy Protection in Personalized Web Search Via TaxonomyStructure
Abstract: Web search engine has long become the most important portal for ordinary people looking foruseful information on the web. User might experience failure when search engine return irrelevanceinformation due to en...