Automatic Representative News Generation using On-Line Clustering
Journal Title: EMITTER International Journal of Engineering Technology - Year 2013, Vol 1, Issue 1
Abstract
The increasing number of online news provider has produced large volume of news every day. The large volume can bring drawback in consuming information efficiently because some news contain similar contents but they have different titles that may appear. This paper presents a new system for automatically generating representative news using on-line clustering. The system allows the clustering to be dynamic with the features of centroid update and new cluster creation. Text mining is implemented to extract the news contents. The representative news is obtained from the closest distance to each centroid that calculated using Euclidean distance. For experimental study, we implement our system to 460 news in Bahasa Indonesia. The experiment performed 70.9% of precision ratio. The error is mainly caused by imprecise results from keyword extraction that generates only one or two keywords for an article. The distribution of centroid’s keywords also affects the clustering results.
Authors and Affiliations
Marlisa Sigita, Ali Ridho Barakbah, Entin Martiana Kusumaningtyas, Idris Winarno
Application of Artificial Neural Networks in Modeling Direction Wheelchairs Using Neurosky Mindset Mobile (EEG) Device
The implementation of Artificial Neural Network in prediction the direction of electric wheelchair from brain signal input for physical mobility impairment.. The control of the wheelchair as an effort in improving disabl...
Secure Communication and Information Exchange using Authenticated Ciphertext Policy Attribute-Based Encryption in Mobile Ad-hoc Network
MANETs are considered as suitable for commercial applications such as law enforcement, conference meeting, and sharing information in a student classroom and critical services such as military operations, disaster relief...
Moment Invariant Features Extraction for Hand Gesture Recognition of Sign Language based on SIBI
SIBI in Indonesian known as standard of Indonesian sign language. To help deaf people Myo Armband becomes an immersive technology for communication each other. The problem on Myo sensor is unstable clock rate. It causes...
Hybrid Modeling KMeans – Genetic Algorithms in the Health Care Data
K-Means is one of the major algorithms widely used in clustering due to its good computational performance. However, K-Means is very sensitive to the initially selected points which randomly selected, and therefore it do...
Modified GTS Allocation Scheme for IEEE 802.15.4
IEEE 802.15.4 standard is widely used in wireless personal area networks (WPANs). The devices transmit data during two periods: contention access period (CAP) by accessing the channel using CSMA/CA and contention free pe...