Classification of Radical Web Content in Indonesia using Web Content Mining and k-Nearest Neighbor Algorithm

Journal Title: EMITTER International Journal of Engineering Technology - Year 2017, Vol 5, Issue 2

Abstract

Radical content in procedural meaning is content which have provoke the violence, spread the hatred and anti nationalism. Radical definition for each country is different, especially in Indonesia. Radical content is more identical with provocation issue, ethnic and religious hatred that is called SARA in Indonesian languange. SARA content is very difficult to detect due to the large number, unstructure system and many noise can be caused multiple interpretations. This problem can threat the unity and harmony of the religion. According to this condition, it is required a system that can distinguish the radical content or not. In this system, we propose text mining approach using DF threshold and Human Brain as the feature extraction. The system is divided into several steps, those are collecting data which is including at preprocessing part, text mining, selection features, classification for grouping the data with class label, simillarity calculation of data training, and visualization to the radical content or non radical content. The experimental result show that using combination from 10-cross validation and k-Nearest Neighbor (kNN) as the classification methods achieve 66.37% accuracy performance with 7 k value of kNN method [1].

Authors and Affiliations

Muh. Subhan, Amang Sudarsono, Ali Ridho Barakbah

Keywords

Related Articles

Dimensionality Reduction Algorithms on High Dimensional Datasets

Classification problem especially for high dimensional datasets have attracted many researchers in order to find efficient approaches to address them. However, the classification problem has become very complicatedespeci...

The Comparison of Propagation Model for Terrestrial Trunked Radio (TETRA)

A system of digital radio Terrestrial Trunked Radio (TETRA) is designed for communication which need specialility, better privacy, better quality of audio with speed transmission data and access capacity to the internet...

Designing and Building of 3D Adventure Game “Tetuko: Childhood of Ghatotkacha” Using Kinect

Nowadays, the young people are not interested in the local culture as a “wayang” puppet. This condition threatens the extinction of some of the local culture that should be a mainstay of the industry entering an era of c...

Fast Response Three Phase Induction Motor Using Indirect Field Oriented Control (IFOC) Based On Fuzzy-Backstepping

Induction Motor in Electrical drive system at a accelleration speed for example in electric cars have a hard speed setting is set on a wide range, causing an inconvenience for motorists and a fast response is required an...

Tooth Color Detection Using PCA and KNN Classifier Algorithm Based on Color Moment

Matching the suitable color for tooth reconstruction is an important step that can make difficulties for the dentists due to the subjective factors of color selection. Accurate color matching system is mainly result bas...

Download PDF file
  • EP ID EP320663
  • DOI 10.24003/emitter.v5i2.214
  • Views 105
  • Downloads 0

How To Cite

Muh. Subhan, Amang Sudarsono, Ali Ridho Barakbah (2017). Classification of Radical Web Content in Indonesia using Web Content Mining and k-Nearest Neighbor Algorithm. EMITTER International Journal of Engineering Technology, 5(2), 328-348. https://europub.co.uk/articles/-A-320663