Automatic Representative News Generation using On-Line Clustering

Journal Title: EMITTER International Journal of Engineering Technology - Year 2013, Vol 1, Issue 1

Abstract

The increasing number of online news provider has produced large volume of news every day. The large volume can bring drawback in consuming information efficiently because some news contain similar contents but they have different titles that may appear. This paper presents a new system for automatically generating representative news using on-line clustering. The system allows the clustering to be dynamic with the features of centroid update and new cluster creation. Text mining is implemented to extract the news contents. The representative news is obtained from the closest distance to each centroid that calculated using Euclidean distance. For experimental study, we implement our system to 460 news in Bahasa Indonesia. The experiment performed 70.9% of precision ratio. The error is mainly caused by imprecise results from keyword extraction that generates only one or two keywords for an article. The distribution of centroid’s keywords also affects the clustering results.

Authors and Affiliations

Marlisa Sigita, Ali Ridho Barakbah, Entin Martiana Kusumaningtyas, Idris Winarno

Keywords

Related Articles

Application of Artificial Neural Networks in Modeling Direction Wheelchairs Using Neurosky Mindset Mobile (EEG) Device

The implementation of Artificial Neural Network in prediction the direction of electric wheelchair from brain signal input for physical mobility impairment.. The control of the wheelchair as an effort in improving disabl...

Secure Communication and Information Exchange using Authenticated Ciphertext Policy Attribute-Based Encryption in Mobile Ad-hoc Network

MANETs are considered as suitable for commercial applications such as law enforcement, conference meeting, and sharing information in a student classroom and critical services such as military operations, disaster relief...

Moment Invariant Features Extraction for Hand Gesture Recognition of Sign Language based on SIBI

SIBI in Indonesian known as standard of Indonesian sign language. To help deaf people Myo Armband becomes an immersive technology for communication each other. The problem on Myo sensor is unstable clock rate. It causes...

Hybrid Modeling KMeans – Genetic Algorithms in the Health Care Data

K-Means is one of the major algorithms widely used in clustering due to its good computational performance. However, K-Means is very sensitive to the initially selected points which randomly selected, and therefore it do...

Modified GTS Allocation Scheme for IEEE 802.15.4

IEEE 802.15.4 standard is widely used in wireless personal area networks (WPANs). The devices transmit data during two periods: contention access period (CAP) by accessing the channel using CSMA/CA and contention free pe...

Download PDF file
  • EP ID EP170927
  • DOI 10.24003/emitter.v1i1.11
  • Views 134
  • Downloads 0

How To Cite

Marlisa Sigita, Ali Ridho Barakbah, Entin Martiana Kusumaningtyas, Idris Winarno (2013). Automatic Representative News Generation using On-Line Clustering. EMITTER International Journal of Engineering Technology, 1(1), 107-114. https://europub.co.uk/articles/-A-170927