A Novel Approach for Text Categorization of Unorganized data based with Information Extraction

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 7

Abstract

Internet has made a profound change in the lives of many enthusiastic innovators and researchers. The information available on the web has knocked the doors of Knowledge Discovery leading to a new Information era. Unfortunately, most Search Engines provide web content which is irrelevant to the information intended to the browser. Many Text Categorization techniques for web content have been developed, to recognize the given document’s category but failed to make trust worthy results. This paper primarily focuses on web content categorization based on classic summarization technique by enabling the classification at word level. The web document is preprocessed first which involves filtering the content with classical techniques and then is converted into organized data. The organized data is then treated with predefined hierarchical categorical set to identify the exact category.

Authors and Affiliations

Suneetha Manne , Dr. S. sameen Fatima

Keywords

Related Articles

A Novel Algorithm for Scaling up the Accuracy of Decision Trees

Classification is one of the most efficient and widely used data mining technique. In classification, Decision trees can handle high dimensional data, and their representation is intuitive and generally easy to assimilat...

Various Schemes to Speed up the PC during Virus Scan

The current threat landscape is changing and we have seen a large volume of new viruses captured by security vendors each day. Customers always complain that anti-virus software slow down their computers by consuming muc...

PREPROCESSING OF WEB LOGS

Today’s real world databases are highly susceptible to noisy, missing and inconsistent data due to their typically huge size data and their origin from multiple, heterogeneous sources. Hence, pre-processing of data is ne...

A Mid – Point based k-mean Clustering Algorithm for Data mining

In k-means clustering algorithm, the number of centroids is equal to the number of the clusters in which data has to be partitioned which in turn is taken as an input parameter. The initial centroids in original k-means...

Quantum Computation and Consciousness in Cyclic and Mythological Models of Universe

Cyclic models such as Steinhardt-Turok model, Baum-Frampton model, and CCC models have been proposed for the universe. It has been postulated that the value of the physical constants in different aeons may possibly be di...

Download PDF file
  • EP ID EP124382
  • DOI -
  • Views 128
  • Downloads 0

How To Cite

Suneetha Manne, Dr. S. sameen Fatima (2011). A Novel Approach for Text Categorization of Unorganized data based with Information Extraction. International Journal on Computer Science and Engineering, 3(7), 2846-2854. https://europub.co.uk/articles/-A-124382