slugComparative Study on Email Spam Classifier Using Feature Selection Techniques

Abstract

Now a days e-mail is very popular because the way of communication method is very easy and due to these reasons some advertisers and social networks sent messages,for advertising their product,that are unwanted for users and not requested by the users, these messages are called spam messages. Sometimes these spam messages are harmful for system after clicking the link which is sent by the spammer or advertisement companies. It takes lots of time for deleting and also occupy memory for storage. Due to these we need filtering the messages and this is the important task for separating the genuine messages from the junk messages. Even though number of researchers are uses different classification techniques for classify the spam, still 100% accuracy in spam classification are question mark, In this paper, we used spam data set which are collected from UCI repository. Initially, various classification algorithms are applied over this dataset using CLEMENTINE data mining tool. This data set is divided into two parts one is training data set and the other one is testing data set. After that most of the data is used for training and a smaller portion of the data is used for testing. After a model has been processed by using the training set, we test the model and identify the results. This process is done by different data set. Finally, best classifier for email spam i.e neural network is identified based on the Training and testing accuracy of various models. In this thesis we also use feature selection method selecting features of spam data set which removes the redundant , irrelevant and noisy data. It improves the data quality and also increase the accuracy of the resulting models.

Authors and Affiliations

Yukti Kesharwani, Shrikant Lade, Dayashakar Pandey

Keywords

Related Articles

A Study of Data Mining Tools in Knowledge Discovery Process

Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential to help companies focus on the most important information in their data warehouses. It...

A Rule Based Punjabi Dialect Conversion System

Recently there is dramatically increase in availability of informal dialectal language on web. People are sharing their ideas, thoughts and emotions, usually in the form of blogs and partially informal articles across t...

Experimental Investigation to Increase the Efficiency of the Air Compressor by Changing the Coolants in Inter Cooler

The compressed air usage is increasing fastly now days. But the efficiency of compressor is low due to the many reasons like location, elevation, length of pipe lines, inter cooler performance, even atmosphere condition...

A Review on Use of Scrap Tyre Rubber Wastes in Concrete

In the whole world the volume of tyre rubber wastes is increasing at a fast rate .An estimated 1000 million tyres reach the end of their useful lives every year and 5000 millions more are expected to be discarded in a r...

Design and Implementation of a Single Stage AC-DC Energy Harvesting Converter for Microgenerator

In this project, a direct ac-dc power electronic convertor topology is proposed for efficient and optimum energy harvesting from low – voltage micro generators. The convertor utilizes the bidirectional current - conduct...

Download PDF file
  • EP ID EP18343
  • DOI -
  • Views 328
  • Downloads 12

How To Cite

Yukti Kesharwani, Shrikant Lade, Dayashakar Pandey (2014). slugComparative Study on Email Spam Classifier Using Feature Selection Techniques. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 2(6), -. https://europub.co.uk/articles/-A-18343