Pattern Recognition for Finding Similarity of Webpages

Abstract

We proposed a functional technique for identifying similar Web pages that is based on measuring tree similarity. In this paper we introduce an experiment with two methods for evaluating the similarity of web pages. The results of these methods can be used in different ways for the reordering and clustering a web page set. Both of these methods belong to the field web content mining. The first method is purely focused on the similarity of web pages. This method segments web pages and compares their layouts based on the image processing and graph matching. The second is based on detecting of objects that result from the user point of view on the web page. The similarity of web page is measured as an object match on the analyzed web pages. The key idea behind the method is to transform each Web page into a compressed, normalized tree that effectively represents its visual structure.

Authors and Affiliations

N. Pughazendi , G . Pattusamy

Keywords

Related Articles

FPGA based implementation of Interoperability of Wireless mesh Network and Wi-Fi

Wireless Mesh Networks (WMNs) is a key technology for next generation wireless networks, showing rapid progress and many new inspiring applications. IEEE 802.11s is the standard defined for WLAN mesh networks.One importa...

Implication of Cell Phone Usage on Study Patterns of Teens

Cell phone usage has become worldwide commodity for every person regardless of their ages. Over the years, teenagers within the age bracket of thirteen to nineteen are more vulnerable towards the use of the technology in...

 Detecting and Alerting Tcp –Ip Packets againt TCP SYN attacks

 Transmission Control Protocol Synchronized ( TCP SYN) Flood has become a problem to the network management to maintain the network server from being attacked by the malicious attackers. Possibly one of the problems...

 DAEMON Decisional Access in Emission Mechanism Of Networks

 The Decisional access method of mining is a process of retrieving data with some assumption decision from the data bases which are inter connected with databases Many of the organizations are providing design of th...

 Implementation and Analysis of Modified Double Precision Interval Arithmetic Array Multiplication

 This paper presents the design of a 64 bit array multiplier that performs interval multiplication. This multiplier requires carry save adders instead of full adders that reduces the delay i n r e s p e c t o f co...

Download PDF file
  • EP ID EP99137
  • DOI -
  • Views 155
  • Downloads 0

How To Cite

N. Pughazendi, G . Pattusamy (2013). Pattern Recognition for Finding Similarity of Webpages. International Journal of Computer & organization Trends(IJCOT), 3(4), 134-140. https://europub.co.uk/articles/-A-99137