Using Fuzzy Clustering Powered by Weighted Feature Matrix to Establish Hidden Semantics in Web Documents

Abstract

Digital Data is growing exponentially exploding on the 'World Wide Web'. The orthodox clustering algorithms obligate various challenges to tackle, of which the most often faced challenge is the uncertainty. Web documents have become heterogeneous and very complex. There exist multiple relations between one web document and others in the form of entrenched links. This can be imagined as a one to many (1-M) relationships, for example, a particular web document may fit in many cross domains viz. politics, sports, utilities, technology, music, weather forecasting, linked to ecommerce products, etc. Therefore, there is a necessity for efficient, effective and constructive context driven clustering methods. Orthodox or the already well-established clustering algorithms adhere to classify the given data sets as exclusive clusters. Signifies that we can clearly state whether to which cluster an object belongs to. But such a partition is not sufficient for representing in the real time. So, a fuzzy clustering method is presented to build clusters with indeterminate limits and allows that one object belongs to overlying clusters with some membership degree. In supplementary words, the crux of fuzzy clustering is to contemplate the fitting status to the clusters, as well as to cogitate to what degree the object belongs to the cluster. The aim of this study is to device a fuzzy clustering algorithm which along with the help of feature weighted matrix, increases the probability of multi-domain overlapping of web documents. Over-lapping in the sense that one document may fall into multiple domains. The use of features gives an option or a filter on the basis of which the data would be extracted through the document. Matrix allows us to compute a threshold value which in turn helps to calculate the clustering result.

Authors and Affiliations

Pramod D Patil, Parag Kulkarni

Keywords

Related Articles

An Automatic Multiple Sclerosis Lesion Segmentation Approach based on Cellular Learning Automata

Multiple Sclerosis (MS) is a demyelinating nerve disease which for an unknown reason assumes that the immune system of the body is affected, and the immune cells begin to destroy the myelin sheath of nerve cells. In Path...

AODV Robust (AODVR): An Analytic Approach to Shield Ad-hoc Networks from Black Holes 

Mobile ad-hoc networks are vulnerable to several types of malicious routing attacks, black hole is one of those, where a malicious node advertise to have the shortest path to all other nodes in the network by the means o...

Security Issues in Cloud Computing and their Solutions: A Review

Cloud computing is an internet-based, emerging technology, tends to be prevailing in our environment especially computer science and information technology fields which require network computing on large scale. Cloud com...

Semantic Similarity Calculation of Chinese Word

This paper puts forward a two layers computing method to calculate semantic similarity of Chinese word. Firstly, using Latent Dirichlet Allocation (LDA) subject model to generate subject spatial domain. Then mapping word...

Real Time Analysis of Crowd Behaviour for Automatic and Accurate Surveillance

Surveillance in this modern era is a necessity. Creating an alert in case of emergencies and disturbances is of very much importance. As the number of simultaneous camera feeds increase, burden on human supervisor also i...

Download PDF file
  • EP ID EP376518
  • DOI 10.14569/IJACSA.2018.090864
  • Views 104
  • Downloads 0

How To Cite

Pramod D Patil, Parag Kulkarni (2018). Using Fuzzy Clustering Powered by Weighted Feature Matrix to Establish Hidden Semantics in Web Documents. International Journal of Advanced Computer Science & Applications, 9(8), 503-514. https://europub.co.uk/articles/-A-376518