A Hybrid Algorithm for Document Clustrering Using Concept Factorization

Abstract

 Massive amount of assorted information is available on the web. Clustering is one of the techniques to deal with huge amount of information. Clustering partitions a data set into groups where data objects in each group should exhibit large measure of resemblance. Objects with high resemblance measure should be placed in a cluster (intra cluster). Resemblance between the objects of different clusters should be less (inter cluster). The most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for bulky datasets. K-means algorithm is simple, straightforward, easy to implement and works well in many applications. K means algorithm has the limitation of generating local optimal solution. Harmony Search Method (HSM) is a new meta- heuristic optimization method which imitates the music improvisation process. HSM has been a successful technique in a wide variety of optimization problem. Better results can be obtained by hybridizing K-means with HSM. In conventional clustering methods, Term Frequency and Inverse Document Frequency(TF-IDF) of a feature can be calculated and the documents are clustered. In, the projected work an effort has been made to apply the concept factorization method for document clustering problem, to find optimal clusters in sufficient amount of time.

Authors and Affiliations

Siamala Devi S *

Keywords

Related Articles

BIOREMEDIATION OF LOW GRADE ORES

The research work presented in this paper is on a Bioremediation for the recovery of zinc from mining waste i.e. Low grade ore of Hindustan Zinc Limited. They are waste product for the mines, as the recovery process is...

 RESEARCH ON OPTIMIZATION MODEL OF THE LINKAGE TOU TARIFF UNDER THE MODE OF CHINA ELECTRIC POWER SYSTEM

 Under China's current power system mode, the current TOU tariff is mostly considered and formulated from the sale side or the generation side, TOU tariff of the linkage in generation side and the sale side has not...

 An Efficient Pricing Mechanism for Mobile Video Streaming in Cloud Computing

 Among the most popular consumer devices, the mobile phones are the one whose usage has increased in a rapid manner and along with the development of 3G networks and smart phones, enabled users to use them in an e...

ASSESSMENT OF EMERGENCY ESCAPE ROUTES FOR A BUILDING USING PATHFINDER - A CASE STUDY

Evacuation planning is critical for important applications to evacuate affected populations to safer places in the event of natural disasters, fire, industrial and constructional accidents. Currently, evacuation plan...

A Study of Various Techniques of Preparing Magnetic Abrasives

With the development in the industries like Aeronautics, Optical Electronics, Medical instruments and Nuclear Reactors the need of part surface finish and geometric precision has increased drastically. The Magnetically...

Download PDF file
  • EP ID EP137858
  • DOI -
  • Views 86
  • Downloads 0

How To Cite

Siamala Devi S * (30).  A Hybrid Algorithm for Document Clustrering Using Concept Factorization. International Journal of Engineering Sciences & Research Technology, 3(7), 269-275. https://europub.co.uk/articles/-A-137858