Hierarchical classification of web content using Naïve Bayes approach

Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 5

Abstract

This paper explores the use of hierarchical structure to classify a heterogeneous collection of web pages. In the hierarchical classification, a model learns to distinguish a second level category from all other categories that are within the same top level. In the flat non hierarchical classification, a model distinguishes a second level category from all existing second level categories. We use Naïve Bayes classifier which has been proved to be effective for web content classification, but has not been previously explored in the case of hierarchical classification. This paper analyses the feasibility of a web page classifier which exploits the hierarchical structure of categories and studies their recall, precision and Fmeasure scores.

Authors and Affiliations

Neetu

Keywords

Related Articles

Rebroadcasting for Routing Reduction based upon Neighbor coverage in Ad Hoc Networks

Cause of nodes high mobility in mobile ad hoc networks (MANETs), there are frequent link breakages exist which escort to frequent route discoveries and path failures. The route discovery procedure cannot be ignored. In a...

Sliding window approach based Text Binarisation from Complex Textual images

Abstract— Text binarisation process classifies individual pixels as text or background in the textual images. Binarization is necessary to bridge the gap between localization and recognition by OCR. This paper presents...

Recognizing faces with single sample per subject using fusion of transforms

Face recognition has attracted attention of the researchers. Face recognition becomes challenging if various factors are considered such as varying illumination, pose, facial expression and somewhat occlusion. The face r...

IMPROVED ROUND ROBIN POLICY A MATHEMATICAL APPROACH

This work attempts to mathematically formulize the computation of waiting time of any process in a static -process, CPU-bound round robin scheme. That in effect, can calculate other performance measures also. An improv...

An Evident Theoretic Feature Selection Approach for Text Categorization

With the exponential growth of textual documents available in unstructured form on the Internet, feature selection approaches are increasingly significant for the preprocessing of textual documents for automatic text cat...

Download PDF file
  • EP ID EP161758
  • DOI -
  • Views 113
  • Downloads 0

How To Cite

Neetu (2013). Hierarchical classification of web content using Naïve Bayes approach. International Journal on Computer Science and Engineering, 5(5), 402-408. https://europub.co.uk/articles/-A-161758