A new approach to filtering spam SMS: Motif Patterns

Abstract

Along with the widespread of every technology, it comes with many problems. Mobile Short Message Service (SMS), which is widely used in mobile technologies, has brought many problems. The most important problem of SMS is unwanted messages named spam that are spread on the mobile network. Spam messages prevent mobile traffic and keep people busy unnecessarily. In this study to filter SMS spam, a novel feature extraction method, motif pattern method, is proposed, which uses forms that composed of comparision on UTF-8 codes of characters. In the proposed motif pattern method, the appearance of the values entered into a window size (PB) defined on the unicodes of SMS is considered as a motif pattern. The frequencies of these motifs in the SMS are used as the feature vector. The motif types depend on the specified PB. Three benchmark datasets were used to test the motif pattern method. The success rate was 93.76%, 90.07% and 94.29%, respectively, for three sets of data. According to the observed results, it is seen that the proposed method is a successful feature extraction method from SMS messages in spam filtering. It is also thought that the motif method can be used in other text mining, natural language processing fields.

Authors and Affiliations

Yılmaz Kaya, Cüneyt Özdemir

Keywords

Related Articles

An Adaptive Noise Cancellation System Based on Linear and Widely Linear Complex Valued Least Mean Square Algorithms for Removing Electrooculography Artifacts from Electroencephalography Signals

In this study, an adaptive noise cancellation (ANC) system based on linear and widely linear (WL) complex valued least mean square (LMS) algorithms is designed for removing electrooculography (EOG) artifacts from electro...

The Performance Evaluation of Solar Control Methods in Buildings: A Multi-Objective Approach

In this study, a tool and method that support the analysis of different solar control methods in buildings through a genetic optimization algorithm are proposed. First, eight scenarios of different glazing alternatives a...

Measurement and Calculation of Breakdown Voltages in CF4 Gas Mixtures

Tetrafluoromethane (CF4) has found industrial applications in power switching and gas insulated circuit breakers although it is considered to be a powerful greenhouse gas. This disadvantage can be overcome by mixing CF4...

Development of an M2M Platform with a Responsive Design

Nowadays it has been clearly observed that machines are more and more communicating with each other. An M2M (Machine to Machine) communication application developed for any sector needs an M2M platfrom in order to intera...

Bandwidth Efficient Overlapped FSK Coded Secure Command Transmission for Medical Implant Communication Systems

Nowadays, wireless communication systems are exploited in most health care systems. Implantable Medical Systems (IMS) also have wireless communication capability. However, it is very important that secure wireless commun...

Download PDF file
  • EP ID EP490326
  • DOI 10.29109/http-gujsc-gazi-edu-tr.372880
  • Views 120
  • Downloads 0

How To Cite

Yılmaz Kaya, Cüneyt Özdemir (2018). A new approach to filtering spam SMS: Motif Patterns. Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji, 6(2), 436-450. https://europub.co.uk/articles/-A-490326