AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION

Abstract

 Nowadays, in many text mining applications, information is present in the form of text documents. Text document contains various types of information such as side information or metadata. The different types of information such as document provenance information, title of the document, links in the document, user-access behavior from web logs, or other non-textual attributes treated as side information contained into the text document. Such attributes contains a large amount of information for clustering purposes. It is difficult to estimate the importance of this sideinformation when text document contains some of the information is noisy. In such cases, to avoid the low quality of mining process we need a principled way to perform the text mining, to maximize the advantages from using this side information. Conformation to that, this paper represents solution to the use of side information for clustering by hierarchical algorithm which then extends to the classification problem on real data sets.

Authors and Affiliations

Kiran V. Gaidhane

Keywords

Related Articles

A REVIEW PAPER ON AN EMBEDDED EXTENDED VISUAL CRYPTOGRAPHY SCHEME FOR COLOR IMAGE USING LPG WITH PCA

A method for creating digital image copyright protection is proposed in this paper. The proposed method in this paper is based on visual cryptography using LPG with PCA. The proposed method is working on selection of ran...

Study on the use of Recycled Aggregate in concrete

Recycling is the act of processing the used material for use in creating new product. The usage of natural aggregate is getting more and more intense with the advanced development in infrastructu of natural aggregate,...

 SOLAR VEHICLES

 A solar vehicle is an electric vehicle powered completely or significantly by direct solar energy. Usually, photovoltaic (PV) cells contained in solar panels convert the sun’s energy directly into electric energy....

 THE OPTIMIZATION ALGORITHM OF SEPARATION PROCESS IN ION CHROMATOGRAPHY

 In the article were elaborated evaluation and optimization of separation processes of the mixtures of complex composition by ion chromatography using a semi-empirical approach based on the use of the size and the...

MECHANICAL BEHAVIOR OF FLY ASH IMPREGNATED NATURAL FIBRE REINFORCED POLYMER COMPOSITE

A composite material is the combination of two or more materials, which are having different phases and the properties superior to the base material. The effect of the coir fiber and 75μm flyash particles on mechanical...

Download PDF file
  • EP ID EP112504
  • DOI 10.5281/zenodo.58632
  • Views 90
  • Downloads 0

How To Cite

Kiran V. Gaidhane (30).  AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION. International Journal of Engineering Sciences & Research Technology, 5(7), 1137-1148. https://europub.co.uk/articles/-A-112504