An Adaptive Classification approach to filter spam E-mail using Vector Space Model

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

The majority of previous studies of data mining have been concentrate on structured data, such as relational, transactional and data warehouse data. But, in actuality, an important section of the available information is stored in text databases, which consist of large collections of web documents from various sources, such as news articles, research papers, e-books, digital libraries, e-mails, and Web pages. Moreover, It is in increasing phase and in magnitude of terabytes of size. Among the ample of provisions of internet, e-mail facility is very useful and broadly used. Spam email is the strongly attached issue with email provision. Among various approaches developed to stop spam emails, filtering is an important and popular one. In this paper, to categorize spam and non-span email which arrives to our email id, classification method-KNNC Classification can work for better accuracy using Vector Space Model in adaptive manner. For getting accuracy in spam classification we have used two dataset- personal & Ling Spam Corpus(Lemm dataset) and apply KNNC Classification on them. We got nearly 95% of precision in spam & 86.6% of precision in nonspam and got 83% of accuracy using personal dataset and 80% using Lemm dataset using adaptive approach. We propose our own solution by reviewing the result and related work that adaptive approach using vector space model in KNNC classification method is efficiently provide better accuracy for filtering the spam mail for both smaller and larger dataset.

Authors and Affiliations

Nakul Dave, Uttam Chauhan and Avani Dave

Keywords

Related Articles

ROOT CAUSE ANALYSIS OF DEFECTS IN AUTOMOBILE FUEL PUMPS: A CASE STUDY

Quality can be directly measured from the degree to which customer requirements are satisfied. Some problems were reported by the customers of the automobile company under study in the fuel pumps; which is used in an a...

"In The current Scenario the issue and challenges of entrepreneur"

Entrepreneurship is often tricky and complicated as a result many new ventures fail. The term entrepreneur is often use as a substitute for the founder. Rurally, the word entrepreneur applies to those who establish a n...

THE ROLES AND RESPON SIBILITIES OF A HUMAN RESOURCE MANAGER IN AN ORGANISATION

This paper reveals about the roles and responsibilities of an HR who plays a vital role for smooth functioning of the Organization. It throws light on the conventional functions and the traditional role of HR managers....

slugInvestment Pattern in Debt Scheme of Mutual Funds – An Analytical Study

A Mutual Fund is a trust that pools together the savings of a number of investors who share a common financial goal. All such investors buy units in a fund that best suit their needs - be it growth in capital, regular...

Goal programming model with the deployment of ATMs machines random demand

We all have experienced the discomfort of waiting in a queue. Unfortunately, this phenomenon is becoming increasingly common in urban societies with increasing population. One of the problems of ATM machines is the lon...

Download PDF file
  • EP ID EP18502
  • DOI -
  • Views 509
  • Downloads 18

How To Cite

Nakul Dave, Uttam Chauhan and Avani Dave (2012). An Adaptive Classification approach to filter spam E-mail using Vector Space Model. International Journal of Management, IT and Engineering, 2(8), -. https://europub.co.uk/articles/-A-18502