Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 6
Abstract
Abstract: The internet is a vast collection of billions of web pages containing terabytes of information arranged in thousands of servers using HTML. The size of this collection itself is a formidable obstacle in retrieving necessary and relevant information. This made search engines an important part of our lives. Search engines strive to retrieve information as relevant as possible. One of the building blocks of search engines is the Web Crawler. We tend to propose a two - stage framework, specifically two smart Crawler, for efficientgathering deep net interfaces. Within the first stage, smart Crawler, performs site-based sorting out centre pages with the assistance of search engines, avoiding visiting an oversized variety of pages. To realize additional correct results for a targeted crawl, smart Crawler, ranks websites to order extremely relevant ones for a given topic. Within the second stage, smart Crawler, achieves quick in – site looking by excavating most relevant links with associate degree accommodative link -ranking.
Authors and Affiliations
Ajit T. Raut , Ajit N. Ogale , Subhash A. Kaigude , Uday D. Chikane
Using Artificial Intelligence Techniques For Epilepsy Treatment
Abstract: Epilepsy is a combination of neurological disorders that causes people to have seizure. Immediate seizures occurring might cause injuries of the patients or other. Recent studies of epilepsy are based on two ap...
Pattern of Epistaxis of Patients Attending in A Tertiary Care Hospital of Tripura, Northeastern Region of India
Epistaxis is one of the most frequently encountered emergencies reported to occur in up to 60% of the general population. It has a bimodal age presentation with incidence peaks in below 25 years and above 50 years of age...
Dynamic Passwords Using Graphics
Abstract:Textual passwords are most common method used for authentication. But textual passwords are vulnerable to eves dropping, dictionary attacks, social engineering and shoulder surfing. Graphical passwords are intro...
Empirical Study of 2-bit Fast Adder using Simon 2.0
Abstract : The present context of post CMOS era demands highly sophisticated low power consuming high speed novel integrated chips in nanometer region. SET (Single Electron Transistor) is eventually the highest priority...
Performance Evaluation of IPv4 Vs Ipv6 and TunnellingTechniques Using Optimized Network Engineering Tools(OPNET)
Abstract: Internet Protocol version 6 (IPv6) is the latest version of the Internet Protocol (IP). IPv6 is intendedto replace IPv4, which is still widely used, in order to deal with the problem of IPv4 address exhau...