Hierarchical classification of web content using Naïve Bayes approach
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 5
Abstract
This paper explores the use of hierarchical structure to classify a heterogeneous collection of web pages. In the hierarchical classification, a model learns to distinguish a second level category from all other categories that are within the same top level. In the flat non hierarchical classification, a model distinguishes a second level category from all existing second level categories. We use Naïve Bayes classifier which has been proved to be effective for web content classification, but has not been previously explored in the case of hierarchical classification. This paper analyses the feasibility of a web page classifier which exploits the hierarchical structure of categories and studies their recall, precision and Fmeasure scores.
Authors and Affiliations
Neetu
Rebroadcasting for Routing Reduction based upon Neighbor coverage in Ad Hoc Networks
Cause of nodes high mobility in mobile ad hoc networks (MANETs), there are frequent link breakages exist which escort to frequent route discoveries and path failures. The route discovery procedure cannot be ignored. In a...
Sliding window approach based Text Binarisation from Complex Textual images
Abstract— Text binarisation process classifies individual pixels as text or background in the textual images. Binarization is necessary to bridge the gap between localization and recognition by OCR. This paper presents...
Recognizing faces with single sample per subject using fusion of transforms
Face recognition has attracted attention of the researchers. Face recognition becomes challenging if various factors are considered such as varying illumination, pose, facial expression and somewhat occlusion. The face r...
IMPROVED ROUND ROBIN POLICY A MATHEMATICAL APPROACH
This work attempts to mathematically formulize the computation of waiting time of any process in a static -process, CPU-bound round robin scheme. That in effect, can calculate other performance measures also. An improv...
An Evident Theoretic Feature Selection Approach for Text Categorization
With the exponential growth of textual documents available in unstructured form on the Internet, feature selection approaches are increasingly significant for the preprocessing of textual documents for automatic text cat...