A Hybrid Algorithm for Document Clustrering Using Concept Factorization
Journal Title: International Journal of Engineering Sciences & Research Technology - Year 30, Vol 3, Issue 7
Abstract
Massive amount of assorted information is available on the web. Clustering is one of the techniques to deal with huge amount of information. Clustering partitions a data set into groups where data objects in each group should exhibit large measure of resemblance. Objects with high resemblance measure should be placed in a cluster (intra cluster). Resemblance between the objects of different clusters should be less (inter cluster). The most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for bulky datasets. K-means algorithm is simple, straightforward, easy to implement and works well in many applications. K means algorithm has the limitation of generating local optimal solution. Harmony Search Method (HSM) is a new meta- heuristic optimization method which imitates the music improvisation process. HSM has been a successful technique in a wide variety of optimization problem. Better results can be obtained by hybridizing K-means with HSM. In conventional clustering methods, Term Frequency and Inverse Document Frequency(TF-IDF) of a feature can be calculated and the documents are clustered. In, the projected work an effort has been made to apply the concept factorization method for document clustering problem, to find optimal clusters in sufficient amount of time.
Authors and Affiliations
Siamala Devi S *
A SURVEY ON SUPERVISED METHOD FOR DETECTION OF MALWARE
The Number of Android mobile devices has been increased in recent year. There are so many approaches for detection of android malware has been proposed by using permission or source code analysis or dynamic analysis. In...
STUDY OF GASIFICATION ON THE DIFFERENT FUELS AND FUEL FEED RATE IN FLUIDIZED BED GASIFIER
An Optimum Fuel feed rate is Established in a Fluidized bed Gasifier for generating producer gas at a steady state Screw feeder system is installed that was used to feed fuel particles of different size. Steady state...
Enhanced Privacy Protection in Personalized Web Search for Sequential Background
Personalized Web Search has established to improve the quality of various search services on the Internet. Due to the tremendous data opportunities in the internet the privacy protection is very important to prese...
Electricity Generation in Double Chamber Microbial Fuel Cell with different Salts Concentration
Microbial fuel cell ( MFC ) represents a new method for electricity generation. Microbial fuel cells are devices that can use bacterial metabolism to produce an electric potential from a wide range organic substra...
Optimization of Radio Resource Allocation
This paper addresses optimization of radio resource allocation for downlink Multiple Output-Orthogonal Division Multiple Access) Systems. It has an objective of maximizing the total system capacity and proportional fai...