A Hybrid Algorithm for Document Clustrering Using Concept Factorization

Abstract

 Massive amount of assorted information is available on the web. Clustering is one of the techniques to deal with huge amount of information. Clustering partitions a data set into groups where data objects in each group should exhibit large measure of resemblance. Objects with high resemblance measure should be placed in a cluster (intra cluster). Resemblance between the objects of different clusters should be less (inter cluster). The most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for bulky datasets. K-means algorithm is simple, straightforward, easy to implement and works well in many applications. K means algorithm has the limitation of generating local optimal solution. Harmony Search Method (HSM) is a new meta- heuristic optimization method which imitates the music improvisation process. HSM has been a successful technique in a wide variety of optimization problem. Better results can be obtained by hybridizing K-means with HSM. In conventional clustering methods, Term Frequency and Inverse Document Frequency(TF-IDF) of a feature can be calculated and the documents are clustered. In, the projected work an effort has been made to apply the concept factorization method for document clustering problem, to find optimal clusters in sufficient amount of time.

Authors and Affiliations

Siamala Devi S *

Keywords

Related Articles

EFFECT OF CARBON AND NITROGEN SOURCES ON THE PRODUCTION OF KERATINASE FROM STAPHYLOCOCCUS AUREUS

Keratinase enzymes are mainly used in dehairing process in leather industry instead of sodium sulphides and these are also used as detergent to remove stains on cloth. The microorganisms producing keratinase were isola...

 Microstructure and Mechanical Properties of Al6061-Sicp Casted Composites

 Stir casting process is used for producing discontinuous particle reinforced metal matrix composites for decades.To obtain sufficient wetting of particle by liquid metal and to get a homogenous dispersion of the c...

Wireless Automation of an Electrical Drive using Bluetooth

Industrial automation in the present day requires effective feedback-oriented mechanisms which can control as well as monitor electrical, electronic and mechanical systems. Traditionally, feedback based automation syst...

 IDENTIFYING OPTIMAL NUMBER OF ORTHONORMALISATION IN LEARNING ALGORITHM USING WEATHER FORECASTING

 Artificial neural networks are more powerful than any other traditional expert system in the classification of patterns, which are non linear and in performing pattern classification tasks because they learn f...

 Critical Event Monitoring in WSNS using Level-By-Level Offset Based Wake up Pattern

 This paper proposed to monitor a critical event in wireless sensor networks. Whenever a critical event occurs, the critical event is detected by the nearby sensor nodes. Immediately these sensor nodes should broad...

Download PDF file
  • EP ID EP137858
  • DOI -
  • Views 72
  • Downloads 0

How To Cite

Siamala Devi S * (30).  A Hybrid Algorithm for Document Clustrering Using Concept Factorization. International Journal of Engineering Sciences & Research Technology, 3(7), 269-275. https://europub.co.uk/articles/-A-137858