An Emperical Study of Clustering Algorithms to extract Knowledge from PubMed Articles

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2017, Vol 5, Issue 3

Abstract

Extraction of useful information from biomedical literature is one of the thrust for the world nowadays due to availability of almost articles on the web in electronic form. Information retrieval (IR) from biomedical literature is finding useful patterns from the unstructured text corpus that satisfies information. In this paper intelligent text analysis is carried out on PubMed articles related to influenza virus. In this context, various algorithms are discussed to reveal the information from PubMed articles, like year wise count of articles containing influenza virus related terms (viz. H1N1, H5N1, and H7N1 etc.), countries with their publication count, which tells about the outbreaks of the diseases in these countries. The articles may be grouped by searching the keyword �influenza virus strain� pattern with the help of regular expressions. Automatic text categorization is another challenging issue for text mining. We applied k-means, fuzzy C-means, and fuzzy C-shell algorithm for automatic categorization of text articles. The association between words based on their cooccurrence is computed which further helps to categorize the documents based on their cooccurrences. The basic k-means clustering algorithm is first applied to cluster the documents, and then to handle the fuzzy nature of words which may belong to more than one cluster, fuzzy c-means clustering is applied to form more accurate clusters. As Fuzzy c-means method clusters the documents which are in linear spaces but not in the circle, spherical, or ellipsoidal spaces. A new method is proposed here, which considers the clusters of documents in the radius of the circle.K

Authors and Affiliations

Deepak Agnihotri, Kesari Verma, Priyanka Tripathi

Keywords

Related Articles

Building An Automatic Speech Recognition System for Home Automation

This paper presents a study on automatic speech recognition (ASR) systems applied to home automation. So a detailed study of the architecture of speech recognition systems was carried out. The objective is to select a sp...

Dialogue Based Decision Making in Online Trading

Software agents, acting on behalf of humans, have been identified as an important solution for future electronic markets. Such agents can make their own decisions based on given prior preferences and the market environme...

Creatinine, Urea and Uric Acid in Hospitalized Patients with and Without Hyperglycemia Analysis using Generalized Additive Model

Hyperglycemia is an important risk factor for heart disease and premature mortality. In hospitalized patients, it is related to an increase in morbidity and development of other disease like kidney disease. To evaluate t...

A Model Driven Architecture Approach to Generate Multidimensional Schemas of Data Warehouses

Over the past decade, the concept of data warehousing has been widely accepted. The main reason for building data warehouses is to improve the quality of information in order to achieve specific business objectives such...

Computing on Encrypted Data into the Cloud though Fully Homomorphic Encryption

Securing Data in the cloud based on Fully Homomorphic Encryption (FHE) is a new and potential form of security that allows computing on encrypted data without decrypted it first. However, a practical FHE solution is not...

Download PDF file
  • EP ID EP275516
  • DOI 10.14738/tmlai.53.3106
  • Views 66
  • Downloads 0

How To Cite

Deepak Agnihotri, Kesari Verma, Priyanka Tripathi (2017). An Emperical Study of Clustering Algorithms to extract Knowledge from PubMed Articles. Transactions on Machine Learning and Artificial Intelligence, 5(3), 13-27. https://europub.co.uk/articles/-A-275516