A Novel Approach for Text Categorization of Unorganized data based with Information Extraction

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 7

Abstract

Internet has made a profound change in the lives of many enthusiastic innovators and researchers. The information available on the web has knocked the doors of Knowledge Discovery leading to a new Information era. Unfortunately, most Search Engines provide web content which is irrelevant to the information intended to the browser. Many Text Categorization techniques for web content have been developed, to recognize the given document’s category but failed to make trust worthy results. This paper primarily focuses on web content categorization based on classic summarization technique by enabling the classification at word level. The web document is preprocessed first which involves filtering the content with classical techniques and then is converted into organized data. The organized data is then treated with predefined hierarchical categorical set to identify the exact category.

Authors and Affiliations

Suneetha Manne , Dr. S. sameen Fatima

Keywords

Related Articles

Neuro Language Generator

‘Neuro Language Generator using Finite State Machine’ is based on neural network and finite state machine. The undamental properties of neural network along with the power of Turing machine prove how it can be implement...

Evaluation of Next Generation Networks

Abstract—Next Generation Networks have a major impact on existing communications technology. NGN allows the convergence of multiple applications to run on the same network;consist of voice, data and video and other new m...

Image Retrieval using Associativity between ABIR and CBIR Features

The present paper provides the information about image retrieval using the concept of Attribute Base Image Retrieval (ABIR) and (CBIR) Content Base Image Retrieval and the fusion of both method (visual and textual) which...

Optimization of Association Rule Mining through Genetic Algorithm

Strong rule generation is an important area of data mining. In this paper we design a novel method for generation of strong rule. In which a general Apriori algorithm is used to generate the rules after that we use the o...

Geographic information based Replication and Drop Routing (GeoRaDR): A Hybrid Message Transmission Approach for DTNs 

Several approaches have been proposed to perform routing in Delay/Disruption Tolerant Networks (DTNs) which has a random connectivity pattern. As the routing path from the source to destination will not be available alwa...

Download PDF file
  • EP ID EP124382
  • DOI -
  • Views 132
  • Downloads 0

How To Cite

Suneetha Manne, Dr. S. sameen Fatima (2011). A Novel Approach for Text Categorization of Unorganized data based with Information Extraction. International Journal on Computer Science and Engineering, 3(7), 2846-2854. https://europub.co.uk/articles/-A-124382