AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION

Abstract

 Nowadays, in many text mining applications, information is present in the form of text documents. Text document contains various types of information such as side information or metadata. The different types of information such as document provenance information, title of the document, links in the document, user-access behavior from web logs, or other non-textual attributes treated as side information contained into the text document. Such attributes contains a large amount of information for clustering purposes. It is difficult to estimate the importance of this sideinformation when text document contains some of the information is noisy. In such cases, to avoid the low quality of mining process we need a principled way to perform the text mining, to maximize the advantages from using this side information. Conformation to that, this paper represents solution to the use of side information for clustering by hierarchical algorithm which then extends to the classification problem on real data sets.

Authors and Affiliations

Kiran V. Gaidhane

Keywords

Related Articles

 OPERATING PARAMETERS ACCUMULATION OF HELIUM LIQUEFICATION SYSTEM: (H.E & TURBINE)

 Present work involves analysis and optimization of the process parameters (like helium flow rate, pressure and temp.) for main components as (eight different heat exchangers as well as three different turbo-expande...

 EXAMINATION OF THE SCOPE OF ACADEMIC USE ON SMART DEVICE INTHE ISRAELI ACADEMIC ENVIRONMENT

 In recent years the phenomenon of using smart devices for studying anytime, anywhere, has grown rapidly. Theobjective of this study is to examine the scope of the academic use of smart devices in the Israeli academ...

 Real Time Gesture Recognition for Cart Movement

 Hand gesture recognition based man-machine interface is being developed vigorously in recent years. Due to the effect of lighting and complex background, most visual hand gesture recognition systems work only unde...

AN APPROACH TO MITIGATE THE PRIVACY ISSUES IN SMARTPHONE HEALTHCARE SYSTEM THROUGH VISUAL CRYPTOGRAPHY

Smartphone Healthcare Systems are the immerging pervasive technology which provides the healthcare services at any location through mobile phones. It contributes to access the Electronic Medical Records (EMR) from and t...

 Comparative Study of the Lipid Content and the Fatty Acid Composition in the Parasite (Mothocya Belonae) and in the Muscle of its host (Belone Belone), (Teleost, Belonidae) Collected in the Bay of Monastir (Central Mediterranean)

 The fatty acid composition of the parasite, Mothocya belonae, and the muscle of its host, Belone belone (Garfish), were compared. The saturated, monounsaturated and polyunsaturated fatty acids in parasite and the...

Download PDF file
  • EP ID EP112504
  • DOI 10.5281/zenodo.58632
  • Views 102
  • Downloads 0

How To Cite

Kiran V. Gaidhane (30).  AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION. International Journal of Engineering Sciences & Research Technology, 5(7), 1137-1148. https://europub.co.uk/articles/-A-112504