A Novel Clustering Method for Similarity Measuring in Text Documents

Journal Title: International Journal of Modern Engineering Research (IJMER) - Year 2013, Vol 3, Issue 5

Abstract

 Clustering is the process of grouping data into subsets in such a manner that identical instances are collected together, while different instances belong to different groups. The instances are thereby arranged into an efficient depiction that characterizes the populace that is being sampled. A general move towards the clustering process is to treat it as an optimization process. A best partition is found by optimizing an exacting function of similarity, or distance, among data. Basically, there is a hidden assumption that the true inherent structure of data could be correctly describe by using the similarity formula defined and fixed in the clustering decisive factor. In this paper, we introduce clustering with multi- view points based on different similarity measures. The multi- view point approach to learning is one in which we have ‘views’ of the data (sometimes in a rather abstract sense) and the goal is to use the relationship between these views to alleviate the difficulty of a learning problem of interest.

Authors and Affiliations

Preethi Priyanka Thella

Keywords

Related Articles

 Development of a Integrated Air Cushioned Vehicle (Hovercraft)

 The design and development of a hovercraft prototype with full hovercraft basic functions is reported by taking into consideration, size, material and component availability and intermediate fabrication skill....

 Optimal Converge cast Methods for Tree- Based WSNs

 A tree- based wireless sensor network (WSN) is a collection of sensors nodes, such as sink is the root of tree and leaves are the nodes. Data in such a topology flows from sensor nodes (leaves) to the sink (root) n...

 Voltage Support and Reactive Power Control in Micro-grid using DG

  Distribution Generators(DGs) are the renewable energy resource which can be connected to the grid. When it is connected to the grid it should be operated with controlled voltage and reactive power control. And in...

 Anomaly Detection Using Generic Machine Learning Approach With a Case Study of Awareness

 Abstract: Security of computer systems and information in flow is essential to acceptance for every network user utilities Now the standalone computer and internets are exposed to an increasing number of security t...

 An Overview of Distributed Generation

 The Power Generated in Karnataka(INDIA) is 7445.91MW and Demand is 8500MWwhich causes the problem of Load shedding, many states face this problem and are forced to buy the power from other states which leads to t...

Download PDF file
  • EP ID EP87814
  • DOI -
  • Views 124
  • Downloads 0

How To Cite

Preethi Priyanka Thella (2013).  A Novel Clustering Method for Similarity Measuring in Text Documents. International Journal of Modern Engineering Research (IJMER), 3(5), 2823-2826. https://europub.co.uk/articles/-A-87814