An Adaptive Classification approach to filter spam E-mail using Vector Space Model

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

The majority of previous studies of data mining have been concentrate on structured data, such as relational, transactional and data warehouse data. But, in actuality, an important section of the available information is stored in text databases, which consist of large collections of web documents from various sources, such as news articles, research papers, e-books, digital libraries, e-mails, and Web pages. Moreover, It is in increasing phase and in magnitude of terabytes of size. Among the ample of provisions of internet, e-mail facility is very useful and broadly used. Spam email is the strongly attached issue with email provision. Among various approaches developed to stop spam emails, filtering is an important and popular one. In this paper, to categorize spam and non-span email which arrives to our email id, classification method-KNNC Classification can work for better accuracy using Vector Space Model in adaptive manner. For getting accuracy in spam classification we have used two dataset- personal & Ling Spam Corpus(Lemm dataset) and apply KNNC Classification on them. We got nearly 95% of precision in spam & 86.6% of precision in nonspam and got 83% of accuracy using personal dataset and 80% using Lemm dataset using adaptive approach. We propose our own solution by reviewing the result and related work that adaptive approach using vector space model in KNNC classification method is efficiently provide better accuracy for filtering the spam mail for both smaller and larger dataset.

Authors and Affiliations

Nakul Dave, Uttam Chauhan and Avani Dave

Keywords

Related Articles

slugDetermination of Lot Size in the Construction of Six sigma based Link Sampling Plans

The term ‘Six Sigma’ originated from the terminology associated with the statistical modeling of manufacturing processes. A six-sigma process is one in which 99.99999% of the products manufactured are statistically exp...

Role of Rural development Schemes in Haryana: An overview

Rural development is the process of improving the quality of life and economic wellbeing of people living in relatively isolated and sparsely areas. According to United Nations, rural development is process of change,...

slugGraph Factorization and its Application

In this paper, different types of factorization of graphs of the complete graphs K6m-2, K6m+2 and K6m for m≥1 have been studied. An algorithm for the solution of TSP has been developed. Some theoretical investigations...

CLONING: (Artificial Cloning Of Organism)

Cloning by nuclear transfer using mammalian somatic cells has enormous potential application. However, somatic cloning has been inefficient in all species in which live clones have been produced. High abortion and feta...

Ethical Leadership relations with Employee Job Performance

The aim of this research is to observe the concept of ethical leadership relation with employee job performance .The Characteristics of an ethical leader and how organisations can develop leaders that are not only soun...

Download PDF file
  • EP ID EP18502
  • DOI -
  • Views 443
  • Downloads 18

How To Cite

Nakul Dave, Uttam Chauhan and Avani Dave (2012). An Adaptive Classification approach to filter spam E-mail using Vector Space Model. International Journal of Management, IT and Engineering, 2(8), -. https://europub.co.uk/articles/-A-18502