Max-Min Ant System Based Web Crawler

Abstract

A focused crawler is Web crawler that traverses the Web to explore information that is related to a particular topic of interest only. This study, aims to find the Indian academicians webpages from foreign universities websites by selecting the features of webpage and determine its relevance on an unknown dataset. Therefore, a feature selection algorithm based on MaxMin Ant System (MMAS) is presented to improve the accuracy of focused crawler and classification process. The weights to features are assigned using cosine similarity to determine the relevancy of webpages. MMAS finds the best solution and select best fitted URLs from a large pool of URLs. The performance of the proposed methodology classification result is compared with manual classification result for Lancaster University, Stanford University and Harvard University dataset. The performance of the proposed methodology is measured using recall parameter.

Authors and Affiliations

Komal Upadhyay, Er. Suveg Moudgil

Keywords

Related Articles

Re-Ranking Of Web Images Using Semantic Signature and Parallel SVM

Image re-ranking has been adopted by current commercial search engines to improve the results of web-based image search. In image re-ranking, for a given a query keyword, first a pool of images are retrieved by the sear...

slugDesign and Analysis of Three Stages pHEMT LNA At K-Band

Thispaper represents the designing of three stage LNA using EC2612 pHEMT technology.pHEMT technology gives high transconductance and shows better reliability. This three stage amplifier has been designed for K-band appl...

A Novel Full-Bridge PWM Dc�Dc Converter with Energy Recovery Turn-Off Snubber

This paper presents a full bridge pulse-width modulated dc-dc converter with controlled secondary side rectifier using a novel non dissipative energy recovery turn-off snubber, and an innovative control algorithm to ach...

In Vitro Studies of Apple Varieties of Kashmir

Apple is rosaceous pome fruit grown in temperate areas of the world for its high economic value. It is chiefly relished by people as fresh fruit but processed products like apple juice, canned apple sauce, apple jam and...

Granular Computing for Data Mining

Granular computing is a rising computing worldview of data handling. It concerns the handling of complex data substances called data granules, which emerge during the time spent information reflection and induction of l...

Download PDF file
  • EP ID EP24652
  • DOI -
  • Views 389
  • Downloads 12

How To Cite

Komal Upadhyay, Er. Suveg Moudgil (2017). Max-Min Ant System Based Web Crawler. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(6), -. https://europub.co.uk/articles/-A-24652