Semantic Based Cluster Content Discovery in Description First Clustering Algorithm

Abstract

In the field of data analytics grouping of like documents in textual data is a serious problem. A lot of work has been done in this field and many algorithms have purposed. One of them is a category of algorithms which firstly group the documents on the basis of similarity and then assign the meaningful labels to those groups. Description first clustering algorithm belong to the category in which the meaningful description is deduced first and then relevant documents are assigned to that description. LINGO (Label Induction Grouping Algorithm) is the algorithm of description first clustering category which is used for the automatic grouping of documents obtained from search results. It uses LSI (Latent Semantic Indexing); an IR (Information Retrieval) technique for induction of meaningful labels for clusters and VSM (Vector Space Model) for cluster content discovery. In this paper we present the LINGO while it is using LSI during cluster label induction and cluster content discovery phase. Finally, we compare results obtained from the said algorithm while it uses VSM and Latent semantic analysis during cluster content discovery phase.

Authors and Affiliations

Muhammad Waseem Khan, Hafiz Muhammad Shahzad Asif, Yasir Saleem

Keywords

Related Articles

Sensor-Fusion Based Navigation for Mobile Robot in Outdoor Environment

Autonomous navigation of the vehicles or robots is very challenging and useful task used by many scientists and researchers these days. By keeping this fact in mind, an algorithm for autonomous navigation of mobile robot...

Effect of Compaction on Compressive Strength of Unfired Clay Blocks

This study investigates the possible use of unfired compacted clay blocks as a substitute of CSEB (Compressed Stabilized Earth Blocks) for the construction of economical houses. Cubes of 150 mm size were cut from the cla...

Change Detection Algorithms for Surveillance in Visual IoT: A Comparative Study

The VIoT (Visual Internet of Things) connects virtual information world with real world objects using sensors and pervasive computing. For video surveillance in VIoT, ChD (Change Detection) is a critical component. ChD a...

Solving Real-Life Problems: Future Mobile Technology Sophistication

Almost all the human being real life concerned domains are taking advantage of latest technologies for enhancing their process, procedures and operations. This integration of technological innovations provides ease of ac...

Calibration and Validation of an Experimental Setup for the Measurement of the Cylindrical Body Shapes and Curvatures of the Objects and Subjects through the Techniques of Rasterstereography

The intent of study is to establish a criterion for the experimental setup of rasterstereography, one that is more efficient, simple, accurate and precise to examine and analyse the curvature of the object or the subject...

Download PDF file
  • EP ID EP194479
  • DOI 10.22581/muet1982.1701.01
  • Views 110
  • Downloads 0

How To Cite

Muhammad Waseem Khan, Hafiz Muhammad Shahzad Asif, Yasir Saleem (2017). Semantic Based Cluster Content Discovery in Description First Clustering Algorithm. Mehran University Research Journal of Engineering and Technology, 36(1), 1-6. https://europub.co.uk/articles/-A-194479