A new approach to filtering spam SMS: Motif Patterns
Journal Title: Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji - Year 2018, Vol 6, Issue 2
Abstract
Along with the widespread of every technology, it comes with many problems. Mobile Short Message Service (SMS), which is widely used in mobile technologies, has brought many problems. The most important problem of SMS is unwanted messages named spam that are spread on the mobile network. Spam messages prevent mobile traffic and keep people busy unnecessarily. In this study to filter SMS spam, a novel feature extraction method, motif pattern method, is proposed, which uses forms that composed of comparision on UTF-8 codes of characters. In the proposed motif pattern method, the appearance of the values entered into a window size (PB) defined on the unicodes of SMS is considered as a motif pattern. The frequencies of these motifs in the SMS are used as the feature vector. The motif types depend on the specified PB. Three benchmark datasets were used to test the motif pattern method. The success rate was 93.76%, 90.07% and 94.29%, respectively, for three sets of data. According to the observed results, it is seen that the proposed method is a successful feature extraction method from SMS messages in spam filtering. It is also thought that the motif method can be used in other text mining, natural language processing fields.
Authors and Affiliations
Yılmaz Kaya, Cüneyt Özdemir
Investigation of Power Generation System Driven by Wind and Sea Flow Energy
Wind and sea are renewable sources of energy with high energy potential. The sea/oceans have more than one type of energy such as wind, wave, tide, flow. The purpose of this work is to design a system that will convert m...
A Survey of Hyper-parameter Optimization Methods in Convolutional Neural Networks
Convolutional neural networks (CNN) are special types of multi-layer artificial neural networks in which convolution method is used instead of matrix multiplication in at least one of its layers. Although satisfactory r...
Measurement and Mapping of Long-Term and Continuous Electromagnetic Pollution Levels in a Selected Pilot Region
Mobile phones, mobile devices, the number of users in our daily lives and usage times are increasing rapidly. This rapid increase in both telephone conversations, mobile internet use and mobile systems as well as the rap...
Investigation of Single Bolted Connections of Thin Wall U Steel Profiles with Eccentric Behaviour
There is no Turkish standard for bolted connections of thin walled steel elements. In the calculations made according to Eurocode and AISI standards, there are different approaches and behaviours for the same sample. The...
Tent Map based Optimization Method
In the real life, some problems cannot be solved using mathematical methods. Meta-heuristic optimization methods are usually used to solve these problems. One of the solutions used to increase the performance of meta-heu...