A Hybrid Algorithm for Document Clustrering Using Concept Factorization

Abstract

 Massive amount of assorted information is available on the web. Clustering is one of the techniques to deal with huge amount of information. Clustering partitions a data set into groups where data objects in each group should exhibit large measure of resemblance. Objects with high resemblance measure should be placed in a cluster (intra cluster). Resemblance between the objects of different clusters should be less (inter cluster). The most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for bulky datasets. K-means algorithm is simple, straightforward, easy to implement and works well in many applications. K means algorithm has the limitation of generating local optimal solution. Harmony Search Method (HSM) is a new meta- heuristic optimization method which imitates the music improvisation process. HSM has been a successful technique in a wide variety of optimization problem. Better results can be obtained by hybridizing K-means with HSM. In conventional clustering methods, Term Frequency and Inverse Document Frequency(TF-IDF) of a feature can be calculated and the documents are clustered. In, the projected work an effort has been made to apply the concept factorization method for document clustering problem, to find optimal clusters in sufficient amount of time.

Authors and Affiliations

Siamala Devi S *

Keywords

Related Articles

 Principles of Ubiquitous Computing Systems

 This paper provides a concise summary of pervasive computing and also the challenges faced in computer systems research posed by the emerging field of pervasive computing. This papper probes the relationship of th...

 Classifying Energy Feature for Video Segmentation

 Video segmentation is a major role in digital image processing. This paper provides methodology for detecting moving object presented in the given input video. First step is converting input video into frames. Sec...

STUDY ON DURABILITY PROPERTIES OF RECYCLED AGGREGATE CONCRETE INCORPORATED WITH SILICA FUME AND MINERAL QUARTZ

Disposal of construction waste is now new challenge for the construction industry in this era. This is peak time to use Construction waste as recycled aggregate (RA) in concrete manufacturing for sustainable development...

 Fine Grain Dynamically Reconfigurable Architecture for CMOS Sram

 Cell stability and area are among the major concerns in SRAM cell designs. This paper compares the performance of three SRAM cell topologies which include the conventional 6T-cell,8T-cell and 10T-cell. The cmos d...

 A Novel Approach for Cloud as a Forensic Computing Perspective

 Cloud computing may well become one of the most transformative technologies in the history of computing. The benefits of ‘cloud computing’ increase challenges in maintaining data security and data privacy have al...

Download PDF file
  • EP ID EP137858
  • DOI -
  • Views 78
  • Downloads 0

How To Cite

Siamala Devi S * (30).  A Hybrid Algorithm for Document Clustrering Using Concept Factorization. International Journal of Engineering Sciences & Research Technology, 3(7), 269-275. https://europub.co.uk/articles/-A-137858