Classification of Radical Web Content in Indonesia using Web Content Mining and k-Nearest Neighbor Algorithm

Journal Title: EMITTER International Journal of Engineering Technology - Year 2017, Vol 5, Issue 2

Abstract

Radical content in procedural meaning is content which have provoke the violence, spread the hatred and anti nationalism. Radical definition for each country is different, especially in Indonesia. Radical content is more identical with provocation issue, ethnic and religious hatred that is called SARA in Indonesian languange. SARA content is very difficult to detect due to the large number, unstructure system and many noise can be caused multiple interpretations. This problem can threat the unity and harmony of the religion. According to this condition, it is required a system that can distinguish the radical content or not. In this system, we propose text mining approach using DF threshold and Human Brain as the feature extraction. The system is divided into several steps, those are collecting data which is including at preprocessing part, text mining, selection features, classification for grouping the data with class label, simillarity calculation of data training, and visualization to the radical content or non radical content. The experimental result show that using combination from 10-cross validation and k-Nearest Neighbor (kNN) as the classification methods achieve 66.37% accuracy performance with 7 k value of kNN method [1].

Authors and Affiliations

Muh. Subhan, Amang Sudarsono, Ali Ridho Barakbah

Keywords

Related Articles

Traffic Analysis of Quality of Service (QoS) for Video Conferencing between Main Campus and Sub Campus in Laboratory Scale

Recently, in the distance learning system, video conferencing becomes one of expected course material delivery systems for creating a virtual class such that lecturer and student which are separated at long distance can...

Performance Analysis of The Effect on Insertion Guide Vanes For Rectangular Elbow 900 Cross Section

The use of elbow or curved pipe in the installation of piping has a loss of pressure (pressure drop) which could lead the power of pump that drive the fluid and decrease the energy efficiency of the system. The pressure...

Impression Generation of Indonesian Cultural Paintings for Mobile Application with Culture Dependent Color-Impression Metric Creation Contents

Painting is one of complex image reflecting observations and feelings of the artist to the environment. This condition extends the need of painting impression generation system since common people with lack of art experi...

Covert Communication in MIMO-OFDM System Using Pseudo Random Location of Fake Subcarriers

Multiple-Input Multiple-Output Orthogonal Frequency Division Multiplexing (MIMO-OFDM) is the most used wireless transmission scheme in the world. However, its security is the interesting problem to discuss if we want to...

Comparison of The Data-Mining Methods in Predicting The Risk Level of Diabetes

Mellitus Diabetes is an illness that happened in consequence of the too high glucose level in blood because the body could not release or use insulin normally. The purpose of this research is to compare the two methods i...

Download PDF file
  • EP ID EP320663
  • DOI 10.24003/emitter.v5i2.214
  • Views 95
  • Downloads 0

How To Cite

Muh. Subhan, Amang Sudarsono, Ali Ridho Barakbah (2017). Classification of Radical Web Content in Indonesia using Web Content Mining and k-Nearest Neighbor Algorithm. EMITTER International Journal of Engineering Technology, 5(2), 328-348. https://europub.co.uk/articles/-A-320663