Obtaining the Name aliases from the Web, using them to Cluster Text Documents with Cuckoo Algorithm and Comparing Results with K-Means Algorithm
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3
Abstract
Abstract: There is an increase in the searching where name aliases are concerned. Approximately 30 percent of searches are based on aliases; hence it becomes important to obtain correct aliases. Lexical pattern based method is used to obtain the aliases of any personal or place from the web The aliases obtained are ranked and filtered based on the co-occurrence frequency and web dice methods These final aliases are then used to cluster the text documents present in a huge database. To get the best cluster cuckoo method of clustering is used. This method is based on the reproduction system of the cuckoo bird. According to the studies this clustering method when used with levy flight concept gives the best results when huge data is concern and also outperforms particle swarm optimization algorithm and genetic algorithm. The result will be compared with the result of k-means clustering method.
Authors and Affiliations
Pornima Deshpande , Smita Chaudhari
Energy-Balanced Dispatch of Mobile Sensors in Hybrid Wireless Sensor Network with Obstacles
We consider a hybrid wireless sensor network with static and mobile nodes. Static sensors monitor the environment and report events occurring in the sensing field. Mobile sensors are then dispatched to visit these even...
Feature Based Semantic Polarity Analysis Through Ontology
Abstract: Opinion mining, a trending research area where customers feels that opinions of others are alwaysimportant for making decisions while purchasing the products. Here the problem is to collect those opinions...
Novel Malware Clustering System Based on Kernel Data Structure
Abstract : An operating system kernel is the prime of system software, responsible for the integrity and conventional computer system’s operations. Traditional malware detection approaches have based on the codecentricas...
‘A Review Study on Future Applicability of Snake Robots in India’
Abstract: In this study we to aim to present an overview of features of snake robots and their application across various fields. Snakes is blessed with a unique feature of moving over or climbing all most all kind of te...
A Survey Of Sign Based Image Copy Detection Methods
Abstract: The world wide web is filled with billions of images and redundant copies of images can frequently be found on many websites. These duplicates can be exact copies or differ slightly in their visual conten...