Ontology Based Automatic Text Mining Using TF and IDF Algorithms for Summarization of Multiple Files
Journal Title: Saudi Journal of Engineering and Technology - Year 2018, Vol 3, Issue 6
Abstract
Abstract: In the present world, due to tremendous development in technology, a huge amount of information is available everywhere. Therefore, it is difficult for the users to understand the main content of the entire document as it takes a lot of time. In this work we use extractive text summarization which uses a method to give the version of summary for one or more file or document. Here we give an approach that maps sentences to nodes of a hierarchical ontology. Ontology explains what exists in a particular domain. For the ontology creation, vocabularies are collected. It is used as background knowledge and helps to find the related meaning of the terms which occur in the source documents. Text mining is the technique from which high quality information is derived from text. Clustering is a significant task. The clustering method groups similar or related terms into a single group. In the first stage, data collection takes place. The pre-processing stage includes stemming and stop words removal.TF-IDF process occurs after which clustering takes place. In the ontology creation, first the determination of the main sub topics of the article of interest is done. We classify sentences to nodes which have a predefined hierarchical ontology. Each ontology node has bag-of-words from a web search. We represent sentences by sub trees that permit to apply measures of similarity and find relations between sentences. The ontology used in this work is not domain-specific; it does not require labelled data. this work can be extended to topics focused on summarization framework to news articles or blogs and to also to various machine learning approaches Keywords: Ontology, Text Summarization, TF-IDF, Files, Documents, Extract, Summary.
Authors and Affiliations
Chinmayee C, Meenakshi Sundaram
Effect of Additive Type and Percent on Soil Plasticity
Abstract:In the current study, the effects of three types of additive (lime, cement and cement kiln dust) on the plasticity of a soil are studied. The results of the study indicate plasticity index are affected by the ad...
Livability and Urban Quality of the SouqWaqif in Doha (State of Qatar)
Abstract: Doha, the capital city of the State of Qatar, has undergone rapid economic growth and urbanization over the past 20 years. In contrast with developed countries, where sustainable development has been implemente...
Productivity Improvement in Micro and Small EnterpriseProducing Exercise-Notebook: A Case Study
Abstract: Micro and Small Enterprises (MSEs) have potential to improve both productivity and quality. MSEs can adapt any change easily because of their dynamic nature. Productivity and Quality improvement programs in Med...
Survey: The Reliability of VANETs for Safety Applications
Abstract:Vehicular ad hoc networks (VANETs) have become a hot research area over the past few years. The main purpose of VANETs is to improve traffic safety, traffic efficiency, and driving comfort. Particularly, traffic...
Smart Home Energy Management System Using Least Square Regression Analysis
Abstract: Smart home is a residence with several electrical and electronic appliances that are capable of communicating with each other and can be controlled remotely from any room in the home or from any location in the...