A Review on Identifying the Main Content From Web Pages
Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 4
Abstract
A web page is a web document in which huge amount of information is available and because of rapid growth of World Wide Web there is a great advantage to anyone, the user can easily access the web pages from any place through the internet. In the web page contains noisy information like menus, footers, unnecessary links, logos, etc and the main content. Most of the users are interested in only main content .But the main problem with the extraction process is to greater performance impact on web summarization; question answering system, information retrieval application because of the web page is collection of noisy and main content. So we propose an extraction process for identifying main content from web pages. In the extraction process consist of an automatic extraction techniques and hand crafted rules. In the automatic extraction techniques process the first step is to the web page is segmented into web page block and the second step is to differentiate main content from irrelevant or noisy content. In the hand crafted rule process extracts the main content from web pages by using rules which are already generated.
Using Functional Point Analysis and Test Point Analysis Reducing Maintenance Cost of Software
Maintaining reliability is the difficult task while developing software. Software reliability is defined as the possibility of failure free functioning for a particular period of time within precise condition. Software r...
Management of Convulsions in HIV Positive Patients
It is estimated that at least 10 percent of HIV patients experience seizures. HIV positive patients are prone for convulsions as part of neurological complications of the disease or they may be having associated epilepsy...
Effect of Irrigation Methods and Nitrogen Fertilizers on Barley Crop
Abstract: This work was carried out to study the effects of different sprinklers layouts, different irrigation levels, and different doses of fertilizers on barley production under clay soil conditions. To achieve the ob...
A New Novel Based Approach to Enhance Security in Arithmetic Coding using Random Probabilities
A New Novel Based Approach to Enhance Security in Arithmetic Coding using Random Probabilities
Strategy for Improvement of Maternal Health in Nepal
Maternal death is a public health problem in like Nepal. Still women have been facing the pregnancy related problems due to inaccessible and poor quality health service as well as a less available health service from ski...