Scuttling Web Opportunities By Application Cramming

Abstract

The web contains large data and it contains innumerable websites that is monitored by a tool or a program known as Crawler. The main goal of this paper is to focus on the web forum crawling techniques. In this paper, the various techniques of web forum crawler and challenges of crawling are discussed. The paper also gives the overview of web crawling and web forums. Internet is emergent exponentially and has become progressively more. Now, it is complicated to retrieve relevant information from internet. The rapid growth of the internet poses unprecedented scaling challenges for general purpose crawlers and search engines. In this paper, we present a novel Forum Crawler under Supervision (FoCUS) method, which supervised internet-scale forum crawler. The intention of FoCUS is to crawl relevant forum information from the internet with minimal overhead, this crawler is to selectively seek out pages that are pertinent to a predefined set of topics, rather than collecting and indexing all accessible web documents to be capable to answer all possible adhoc questions. FoCUS is continuously keeps on crawling the internet and finds any new internet pages that have been added to the internet, pages that have been removed from the internet. Due to growing and vibrant activity of the internet; it has become more challengeable to navigate all URLs in the web documents and to handle these URLs. We will take one seed URL as input and search with a keyword, the searching result is based on keyword and it will fetch the internet pages where it will find that keyword.

Authors and Affiliations

Dhulipalla Vijaya Sree| Student of M.Tech Department of Computer Science Engineering G.V.R&S college of Engineering & Technology, GUNTUR, Alahari Hanumat Prasad| Department of Computer Science Engineering G.V.R&S college of Engineering & Technology, GUNTUR

Keywords

Related Articles

Development Of A Modified Svm Algorithm For Controlling The Rec Z-Source Npc Inverter

The REC Z-source NPC inverter is accepted to come across applications in grid connected distributed generation (DG) systems based on renewable energy sources such as photovoltaic systems, wind turbines and fuel cell...

An Unprecedented Approach of Detecting and Reporting System of Earthquakes Using Tweet Analysis

Social media has got an exponential growth in recent years. One of the most representative examples is Twitter, which allows users to publish short tweets (messages within a 140-character limit) about “what’s hap...

Transformer less Series Active Filter for Power Quality Improvement

To upgrade the power quality in singlestage frameworks with critical loads a transformer less hybrid series dynamic channel is proposed . This venture helps the energy administration and power quality issues identifie...

The fundamental target of mathematics instruction is to invigorate one's instinct and logical point of view. Since the instinct is fuzzy, one can't be kept to two– esteemed logical considering. There ought to be som...

Implementation of High Speed Area Efficient Fixed Width Multiplier

The aim of project is to design a proposed truncated multiplier with less area utilization and low power comparing with previous multipliers. The proposed method finally reduces the number of full adders and half add...

Download PDF file
  • EP ID EP16332
  • DOI -
  • Views 303
  • Downloads 12

How To Cite

Dhulipalla Vijaya Sree, Alahari Hanumat Prasad (2014). Scuttling Web Opportunities By Application Cramming. International Journal of Science Engineering and Advance Technology, 2(10), 495-498. https://europub.co.uk/articles/-A-16332