Automatic Representative News Generation using On-Line Clustering

Journal Title: EMITTER International Journal of Engineering Technology - Year 2013, Vol 1, Issue 1

Abstract

The increasing number of online news provider has produced large volume of news every day. The large volume can bring drawback in consuming information efficiently because some news contain similar contents but they have different titles that may appear. This paper presents a new system for automatically generating representative news using on-line clustering. The system allows the clustering to be dynamic with the features of centroid update and new cluster creation. Text mining is implemented to extract the news contents. The representative news is obtained from the closest distance to each centroid that calculated using Euclidean distance. For experimental study, we implement our system to 460 news in Bahasa Indonesia. The experiment performed 70.9% of precision ratio. The error is mainly caused by imprecise results from keyword extraction that generates only one or two keywords for an article. The distribution of centroid’s keywords also affects the clustering results.

Authors and Affiliations

Marlisa Sigita, Ali Ridho Barakbah, Entin Martiana Kusumaningtyas, Idris Winarno

Keywords

Related Articles

The rSPA Processes of River Water-quality Analysis System for Critical Contaminate Detection, Classification Multiple-water-quality-parameter Values and Real-time Notification

The water quality analysis is one of the most important aspects of designing environmental systems. It is necessary to realize detection and classification processes and systems for water quality analysis. The important...

Determination of Nearest Emergency Service Office using Haversine Formula Based on Android Platform

Emergency Reporting Application is an Android-based application that serves to help the community in reporting the emergency condition. This application allows users to choose and contact the emergency services office, w...

Differential Spatio-temporal Multiband Satellite Image Clustering using K-means Optimization With Reinforcement Programming

Deforestration is one of the crucial issues in Indonesia because now Indonesia has world's highest deforestation rate. In other hand, multispectral image delivers a great source of data for studying spatial and temporal...

The Design of Terrestrial Trunked Radio (TETRA) Communication System at Juanda Airport

Nowdays the application of wireless communication system at the airport area is very important as it is used to support the services and savety of people. In the beginning the communication is done by using Handy Talkie...

Smart I’rab: Smart Aplicasion for Arabic Grammar Learning

Arabic grammar, known as nahwu, is necessary to comprehend the Holy Qur’an that is completely written in Arabic. However, many people get trouble to study this skill because there are various kinds of word formation and...

Download PDF file
  • EP ID EP170927
  • DOI 10.24003/emitter.v1i1.11
  • Views 123
  • Downloads 0

How To Cite

Marlisa Sigita, Ali Ridho Barakbah, Entin Martiana Kusumaningtyas, Idris Winarno (2013). Automatic Representative News Generation using On-Line Clustering. EMITTER International Journal of Engineering Technology, 1(1), 107-114. https://europub.co.uk/articles/-A-170927