Extraction of Core Contents from Web Pages
Journal Title: INTERNATIONAL JOURNAL OF ENGINEERING TRENDS AND TECHNOLOGY - Year 2014, Vol 8, Issue 9
Abstract
The information available on web pages mostly contains semi-structured text documents which are represented either in XML, or HTML, or XHTML format that lacks formatted document structure. The document does not discriminate between the text and the schema that represent the text. Also the amount of structure used to represent the text depends on the purpose and size of text document. No semantic is applied to semi-structured documents. This requires extracting core contents of text document to analyse words or sentences to generate useful knowledge. This paper discusses several techniques and approaches useful for extracting core content from semi-structured text documents and their merits and demerits
Authors and Affiliations
Sandeep Sirsat
Caching in Wireless Sensor Networks: A Survey
Wireless Sensor Networks are exploited in multiple applications. There are some issues like important data loss, many times during transmission and reception process by the small tiny sensor nodes presents in the WSNs. F...
A Wavelet Based Denoising of Speech Signal
In this Paper we introduce an enhancement terminology in speech processing.Speech enhancement involves processing speech signals for human listening or as preparation for further processing before listening. The enhancem...
Comparison Of Compressive Strength Of Medium Strength Self Compacted Concrete By Different Curing Techniques
In this paper variation in compressive strength of medium strength, self-compacted concrete with 3 different curing techniques is discussed. Initially several trials were carried out for mix design of medium streng...
The Evaluation of Forecasting Methods for Sales of Sterilized Flavoured Milk in Chhattisgarh
In recent years, there has been a great deal of discussion on applications of various forecasting models and their performance in forecasting business activities. This paper discussed few of forecasting models and their...
Fingerprint Identification Using Minutiae Matching
In the modern computerized world, it has become more and more important to authenticate people in a secure way. Modern applications like online banking or online shopping use techniques that depend on personal iden...