An Adaptive Classification approach to filter spam E-mail using Vector Space Model

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

The majority of previous studies of data mining have been concentrate on structured data, such as relational, transactional and data warehouse data. But, in actuality, an important section of the available information is stored in text databases, which consist of large collections of web documents from various sources, such as news articles, research papers, e-books, digital libraries, e-mails, and Web pages. Moreover, It is in increasing phase and in magnitude of terabytes of size. Among the ample of provisions of internet, e-mail facility is very useful and broadly used. Spam email is the strongly attached issue with email provision. Among various approaches developed to stop spam emails, filtering is an important and popular one. In this paper, to categorize spam and non-span email which arrives to our email id, classification method-KNNC Classification can work for better accuracy using Vector Space Model in adaptive manner. For getting accuracy in spam classification we have used two dataset- personal & Ling Spam Corpus(Lemm dataset) and apply KNNC Classification on them. We got nearly 95% of precision in spam & 86.6% of precision in nonspam and got 83% of accuracy using personal dataset and 80% using Lemm dataset using adaptive approach. We propose our own solution by reviewing the result and related work that adaptive approach using vector space model in KNNC classification method is efficiently provide better accuracy for filtering the spam mail for both smaller and larger dataset.

Authors and Affiliations

Nakul Dave, Uttam Chauhan and Avani Dave

Keywords

Related Articles

Traffic aware Multipath Communication for Time-Critical Applications in Underwater Acoustic Sensor Networks

In this project, we propose an energy efficient and collision aware (EECA) node-disjoint multipath routing algorithm with multipath power-control transmission (MPT) scheme for Acoustic sensor networks. With the aid of...

Agricultural Marketing in Growth of Rural India

The marketing of agro products is a multifarious process. Agriculture sector is facing several challenges in terms of exploring and searching new markets for the increased production. But unfortunately, Farmers are not...

slugSALT AND PEPPER NOISE REDUCTION USING MDBUTM FILTER WITH FUZZY BASED REFINEMENT

A Modified Decision Based Unsymmetrical Trimmed Median Filter (MDBUTMF) followed by Fuzzy Noise Reduction Method (FNRM) is proposed for the restoration of color images that are highly corrupted by salt and pepper noise...

slugReliability Prediction of FaultTolerant Multicomputer Interconnection Networks

This paper proposes a new method to identify all the maximal incomplete sub cubes present in a faulty cube taking maximum fault tolerance level i.e. number of faulty nodes is equal to the system dimension. The procedur...

slugCONTROLLING THE MENACE OF UNSOLICITED ELECTRONIC MAILS – CONTEMPORARY DEVELOPMENTS AND INDIAN PERSPECTIVES

The tremendous growth of the Internet as a vehicle of communication in the 1990s to its transformation to a tool with incredible potential has meant that the marketing nuisances of the physical world have also transfer...

Download PDF file
  • EP ID EP18502
  • DOI -
  • Views 480
  • Downloads 18

How To Cite

Nakul Dave, Uttam Chauhan and Avani Dave (2012). An Adaptive Classification approach to filter spam E-mail using Vector Space Model. International Journal of Management, IT and Engineering, 2(8), -. https://europub.co.uk/articles/-A-18502