Techniques for text classification: Literature review and current trends

Journal Title: Webology - Year 2015, Vol 12, Issue 2

Abstract

Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. This kind of web information, popularly known as the digital/electronic information is in the form of documents, conference material, publications, journals, editorials, web pages, e-mail etc. People largely access information from these online sources rather than being limited to archaic paper sources like books, magazines, newspapers etc. But the main problem is that this enormous information lacks organization which makes it difficult to manage. Text classification is recognized as one of the key techniques used for organizing such kind of digital data. In this paper we have studied the existing work in the area of text classification which will allow us to have a fair evaluation of the progress made in this field till date. We have investigated the papers to the best of our knowledge and have tried to summarize all existing information in a comprehensive and succinct manner. The studies have been summarized in a tabular form according to the publication year considering numerous key perspectives. The main emphasis is laid on various steps involved in text classification process viz. document representation methods, feature selection methods, data mining methods and the evaluation technique used by each study to carry out the results on a particular dataset.

Authors and Affiliations

Rajni Jindal, Ruchika Malhotra and Abha Jain

Keywords

Related Articles

Metadata and the Web

The rapid increase in the number and variety of resources on the World Wide Web has made the problem of resource description and discovery central to discussions about the efficiency and evolution of this medium. The ina...

What students are saying on Facebook about their schools?

Social Networking has reached every corner of the mass population in recent years. Academic professionals have employed social networking sites (SNSs) to help them make their teaching more lively and multi-faceted. Resul...

Charting the Landscape of Open Access Journals in Library and Information Science

Open access journals (OAJs) represent a significant portion of the literature in library and information science (LIS). This study contributes to current efforts to raise awareness of the LIS OA literature by focusing on...

Marketing of Library and Information Services in Global Era: A Current Approach

This paper deals with the marketing of library and information services in the global era. It discusses about the marketing concept of today's library and information centers covering various topics such as management...

Digital consumers reshaping the information profession

The introductory paragraph to Digital consumers reshaping the information profession (p.1), explaining the choice of the title as "Digital consumers …" and not "Digital information consumers …", set the tone for a though...

Download PDF file
  • EP ID EP687753
  • DOI -
  • Views 190
  • Downloads 0

How To Cite

Rajni Jindal, Ruchika Malhotra and Abha Jain (2015). Techniques for text classification: Literature review and current trends. Webology, 12(2), -. https://europub.co.uk/articles/-A-687753