Temporal Condensation of Tamil News
Journal Title: Engineering and Technology Journal - Year 2021, Vol 6, Issue 7
Abstract
Since the dawn of the Internet, we have been inundated with an excess of information. The volume of information available on the Internet is expected to grow exponentially. This brings a need for summarization of information. Thus, making summarization one of the most sought-after topics in the domain of natural language processing. It is essential to be informed about the vital happenings, and newspapers have been serving this purpose for a very long time. Sadly, there is a perception among the general public that no news agency today can be unequivocally trusted, the credibility of news articles is uncertain. Therefore, one has to read news articles from various sources to get an unbiased view on topic. When a query related to an event is entered in SEs like google, the search renders an overwhelming number of responses, it is humanly impossible to read all of them. In an effort to address the aforementioned problems, a condensation of news articles covering the Tamilnadu Legislative Assembly election is performed. The news articles were collected from various news sources over a period of two months. The collected articles were translated from Tamil to English. These articles included news about various events, in order to segregate Tamilnadu related news from them k-means clustering was performed on the dataset. The relvant news articles acquired was pre-processed to remove ambiguity and mistakes from translation. These articles were summarized individually using a linear regression model that gave importance to features such as named entities, number of words that were similar to title etc. The acquired individual summaries were summarized using BERT extractive summarizer as it would reduce redundancy. When generated summary was compared with introduction and title of the article in the absence of an introduction a precision of 0.512, recall of 0.25 and f-measure of 0.31 were obtained.
Authors and Affiliations
Shreenidhi S , Prof. Sridhar Ranganathan
Production Biodiesel from Vegetable Oils Using Duck Egg Shell Catalyst
Population growth causes energy needs to increase. Energy needs in Indonesia currently still depend on petroleum fuel. This research aims to determine the effect of the concentration of adding catalyst from egg shells on...
Distribution and Delivery Model on E-Commerce Service for MSMEs
This research aims to develop an e-commerce service application model for Micro, Small, and Medium Enterprises (MSMEs) that facilitates the sale of MSME products online. The application model allows MSMEs to expand marke...
STORED CARBON FMU LAWU MANUNGGAL USING A BIOMASS APPROACH IN THE COMMUNITY FOREST IN SIDOMULYO VILLAGE, MAGETAN REGENCY, INDONESIA
Community forests provide real benefits, especially the economic value of communities around the forest, and a real contribution to environmental services. One of the benefits of forests is vital for the survival of livi...
Implementation of Data Mining to Find Association Patterns of Tracer Study Data Using Apriori Algorithm
Knowing the distribution of alumni from a university is very useful as an evaluation material and a benchmark for teaching and learning activities at related universities. One way to get the distribution of alumni is to...
The Development of Agricultural Greenhouses in the Island of Crete, Greece. A SWOT Analysis
The climate conditions in the island of Crete, Greece are favorable for the development of agricultural greenhouses. The island hosts nowadays almost one third of the Greek greenhouses used mainly for vegetables producti...