AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION

Abstract

 Nowadays, in many text mining applications, information is present in the form of text documents. Text document contains various types of information such as side information or metadata. The different types of information such as document provenance information, title of the document, links in the document, user-access behavior from web logs, or other non-textual attributes treated as side information contained into the text document. Such attributes contains a large amount of information for clustering purposes. It is difficult to estimate the importance of this sideinformation when text document contains some of the information is noisy. In such cases, to avoid the low quality of mining process we need a principled way to perform the text mining, to maximize the advantages from using this side information. Conformation to that, this paper represents solution to the use of side information for clustering by hierarchical algorithm which then extends to the classification problem on real data sets.

Authors and Affiliations

Kiran V. Gaidhane

Keywords

Related Articles

 Survey of Various Methods for Optimum Load Dispatch in Hybrid Power System

 Scarcity of energy resources, increasing power generation cost and ever-growing demand for electric energy, it is necessary to utilize the power as much as possible. To improve the power utilization factor, econom...

 FPGA Implementations of Tiny Mersenne Twister

 Random number generators are essential in many computing applications, such as Artificial Intelligence like genetic algorithms and automated opponents, random game content, simulation of complex phenomena such as...

CO N TROLLING AND MONITORING SMART ELECTRONIC DEVICES IN REMOTE LOCATIONS BY USING INTERNET

Now day’s mobile devices performs a wide variety of tasks. mobile centric devices will be used for various performance - intensive tasks to control various electronic devices .Mobile platform allows m ulti - devic...

A REVIEW ON ENERGY EFFICIENT ROUTING IN MOBILE AD - HOC NETWORKS (MANET)

One of the limitations of mobile ad - hoc network is their inherent limited energy resource. Besides maximizing the lifetime of the sensor node, it is preferable to distribute the energy dissipated throughout the wir...

SOLUTION PROCEDURE FOR FUZZY PROJECT CRASHING PROBLEM THROUGH GOAL PROGRAMMING TECHNIQUE

Project management is one of the most important fields in business and industry. Every task in an organization can be taken into account as a project. Time Cost Trade Off problem is one of the main aspects of project s...

Download PDF file
  • EP ID EP112504
  • DOI 10.5281/zenodo.58632
  • Views 101
  • Downloads 0

How To Cite

Kiran V. Gaidhane (30).  AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION. International Journal of Engineering Sciences & Research Technology, 5(7), 1137-1148. https://europub.co.uk/articles/-A-112504